A REVIEW PAPER ON SMS TEXT TO PLAIN ENGLISH TRANSLATION(Text Normalization)

نویسندگان

  • MEENAKSHI SHARMA
  • Meenakshi Sharma
چکیده

Mobile technology as well as social networking technology plays an important role in communication across internet. A large amount of information is found in noisy contexts as texting and chat lingo have become increasingly considerably in the past decade. This noisy information needs to be normalized into the standard text so that it can be used by the various other tools such as text-to-speech programs. This paper presents a review on Short message Service (SMS) text normalization into plain English text. Term normalization means to translate the SMS text into the plain English text using various techniques like Rule based approach and Statistical machine translation. This is research area of Natural Language Processing (NLP).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Phrase-Based Statistical Model for SMS Text Normalization

Short Messaging Service (SMS) texts behave quite differently from normal written texts and have some very special phenomena. To translate SMS texts, traditional approaches model such irregularities directly in Machine Translation (MT). However, such approaches suffer from customization problem as tremendous effort is required to adapt the language model of the existing translation system to han...

متن کامل

SMS Text Normalization Using Hybrid Approach

Text normalization is a task of generating plain text from an un normalized text. Mobile technology has contributed to the evolution of several media of communication such as chats, emails and short message service (SMS) text. This has significantly influenced the traditional standard way of expressing views from letter writing to a high-tech form of expression known as texting language. In thi...

متن کامل

Text Normalization Using Hybrid Approach

Machine Translation (MT) was an important area of Natural Language Processing that dealt with the translation of one natural language to another language. In this paper we were presenting the research on Translation of short messages to Plain English Text Messages. In today’s world where communication over the internet had increased by using various types of websites and another internet applic...

متن کامل

Syntactic Normalization of Twitter Messages

The use of computer mediated communication such as emailing, microblogs, Short Messaging System (SMS), and chat rooms has created corpora which contain incredibly noisy text. Tweets, messages sent by users on Twitter.com, are an especially noisy form of communication. Twitter.com contains billions of these tweets, but in their current state they contain so much noise that it is difficult to ext...

متن کامل

CS224N: Investigating SMS Text Normalization using Statistical Machine Translation

In this project we explore two approaches to SMS text normalization. First we try a dictionary substitution approach used by most websites that provide such a service, and then modify it with our extension. This is followed by a statistical machine translation (MT) approach using off the shelf MT tools. We evaluate the performance of our system on three test sets from different sources and disc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014