A Rule Based Syllabification Algorithm for Sinhala
نویسندگان
چکیده
This paper presents a study of Sinhala syllable structure and an algorithm for identifying syllables in Sinhala words. After a thorough study of the Syllable structure and linguistic rules for syllabification of Sinhala words and a survey of the relevant literature, a set of rules was identified and implemented as a simple, easy-to-implement algorithm. The algorithm was tested using 30,000 distinct words obtained from a corpus and compared with the same words manually syllabified. The algorithm performs with 99.95 % accuracy.
منابع مشابه
Report on Phonetics and Phonology of Sinhala
This report examines the major characteristics of Sinhala language related to Phonetics and Phonology. The main topics under study are Segmental and Supra-segmental sounds in Spoken Sinhala. The first part presents Sinhala Phonemic Inventory, which describes phonemes with their associated features and phonotactics of Sinhala. Supra-segmental features like Syllabification, Stress, Pitch and Into...
متن کاملFestival-si: A Sinhala Text-to-Speech System
This paper brings together the development of the first Text-to-Speech (TTS) system for Sinhala using the Festival framework and practical applications of it. Construction of a diphone database and implementation of the natural language processing modules are described. The paper also presents the development methodology of direct Sinhala Unicode text input by rewriting Letter-to-Sound rules in...
متن کاملAutomatic Segmentation of Separately Pronounced Sinhala Words into Syllables
Aligned corpora are widely used in various speech applications like automatic speech recognition, speech synthesis, as well as prosodic and phonetic research. The segmentation into syllables can be done manually or automatically. But it consumes significantly more time for a fully manual phonetic segmentation and practically it is a complicated task because in many cases it requires a large ali...
متن کاملAre rule-based syllabification methods adequate for languages with low syllabic complexity? the case of Italian
Syllabification information is a valuable component in speech synthesis systems. Linguistic rule-based methods have been assumed to be the best technique for determining the syllabification of unknown words. This has recently been shown to be incorrect for the English language where data-driven algorithms have been shown to outperform rule-based methods. It may be possible, however, that data-d...
متن کاملA Rule Based Algorithm for Automatic Syllabification of a Word of Bodo Language ISSN 2319 - 2720
The process of syllabification performs the task of Identifying syllables in a word. The correct Syllabification rules and algorithms are mainly used in text-to-speech system to improve naturalness of the synthesized speech. This paper presents a study of Bodo syllable structure and linguistic rules for syllabification as well. An algorithm has been developed for automatic syllabification of Bo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005