نتایج جستجو برای: non written script

تعداد نتایج: 1397671  

2014
Ann Bies Zhiyi Song Mohamed Maamouri Stephen Grimes Haejoong Lee Jonathan Wright Stephanie Strassel Nizar Habash Ramy Eskander Owen Rambow

This paper describes the process of creating a novel resource, a parallel Arabizi-Arabic script corpus of SMS/Chat data. The language used in social media expresses many differences from other written genres: its vocabulary is informal with intentional deviations from standard orthography such as repeated letters for emphasis; typos and nonstandard abbreviations are common; and nonlinguistic co...

2009
Chirstioph Brau Krunoslav Stingl Christopher Hoffmann Jonathan Wolf Dirk Wildgruber Susanne Trauzettel-Klosinski

Impaired phonological encoding has been assumed to be one reason for reading problems in dyslexics. While normal readers are supposed to use a fast and efficient way of grapheme to phoneme conversion, dyslexics seem to be unable to access this pathway. The present study infers whether normal readers reading words written in a new script can utilise the efficient grapheme to phoneme conversion e...

2002
M. Srinivas

A system for online recognition of handwritten Telugu script is presented. A handwritten character is constructed by executing a sequence of strokes. A structureor shape-based representation of a stroke is used in which a stroke is represented as a string of shape features. Using this string representation, an unknown stroke is identified by comparing it with a database of strokes. A full chara...

2006
Muhammad Ghulam Abbas Malik

Machine Transliteration is to transcribe a word written in a script with approximate phonetic equivalence in another language. It is useful for machine translation, cross-lingual information retrieval, multilingual text and speech processing. Punjabi Machine Transliteration (PMT) is a special case of machine transliteration and is a process of converting a word from Shahmukhi (based on Arabic s...

2016
Somnath Banerjee Kunal Chakma Sudip Kumar Naskar Amitava Das Paolo Rosso Sivaji Bandyopadhyay Monojit Choudhury

The shared task on Mixed Script Information Retrieval (MSIR) was organized for the fourth year in FIRE-2016. The track had two subtasks. Subtask-1 was on question classification where questions were in code mixed Bengali-English and Bengali was written in transliterated Roman script. Subtask-2 was on ad-hoc retrieval of Hindi film song lyrics, movie reviews and astrology documents, where both t...

2001
Ahmed M. Elgammal Mohamed A. Ismail

Because of the different characteristics of Arabic language and Romance and Anglo Saxon languages, recognition of documents written in hybrid of these languages requires that the language of the text to be identified priori to the recognition phase. In this paper, three efficient techniques that can be used to discriminate between text written in Arabic script and text written in English script...

2008
DAVID J. SILVA

Soon after their arrival to Korea, Christian missionaries were confronted by decisions regarding how they would present written materials to the Korean people. While many Koreans used their indigenous script (Hangeul) for everyday purposes, higher status literacy materials were expected to be presented using Chinese characters (Hanja), a system unfamiliar to most but considered more prestigious...

2013
Darko Brodić Zoran N. Milivojević Čedomir A. Maluckov

Any document in Serbian language can be written in two different scripts: Latin or Cyrillic. Although characteristics of these scripts are similar, some of their statistical measures are quite different. The paper proposed a method for the extraction of certain script from document according to the occurrence and co-occurrence of the script types. First, each letter is modeled with the certain ...

2012
Singh Kaushal

This paper deals with an Optical Character Recognition system for printed Urdu, a popular Pakistani/Indian script and is the third largest understandable language in the world, especially in the subcontinent but fewer efforts are made to make it understandable to computers. Lot of work has been done in the field of literature and Islamic studies in Urdu, which has to be computerized. Research h...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید