A general-purpose IsiZulu speech synthesizer

نویسندگان

  • J. A. Louw
  • M. Davel
  • E. Barnard
چکیده

A general-purpose isiZulu text-to-speech (TTS) system was developed, based on the ‘Multisyn’ unitselection approach supported by the Festival TTS toolkit. The development involved a number of challenges related to the interface between speech technology and linguistics – for example, choosing an appropriate set of phonetic units, producing reliable pronunciations, and developing appropriate cost functions for selecting and joining diphone units. We show how solutions were found for each of these challenges, and describe a number of other innovations (such as automated fault detection in manual alignments) that were introduced. Initial evaluations suggest that the synthesizer is usable by a wide spectrum of isiZulu speakers.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A general-purpose IsiZulu Speech Synthesiser

A general-purpose isiZulu text-to-speech (TTS) system was developed, based on the “Multisyn” unit-selection approach supported by the Festival TTS toolkit. The development involved a number of challenges related to the interface between speech technology and linguistics – for example, choosing an appropriate set of phonetic units, producing reliable pronunciations, and developing appropriate co...

متن کامل

Automatic error detection in alignments for speech synthesis

The phonetic segmentation of recorded speech is a crucial factor in the quality of concatenative systems for speech synthesis. We describe a a likelihood-based error detection process that can be used to flag possible errors in such a segmentation, with a view towards manual correction. It is shown that this process can be used to assist in the creation of high-accuracy segmentations. In partic...

متن کامل

Study on Unit-Selection and Statistical Parametric Speech Synthesis Techniques

One of the interesting topics on multimedia domain is concerned with empowering computer in order to speech production. Speech synthesis is granting human abilities to the computer for speech production. Data-based approach and process-based approach are the two main approaches on speech synthesis. Each approach has its varied challenges. Unit-selection speech synthesis and statistical parametr...

متن کامل

Farsi language prosodic structure, research and implementation using a speech synthesizer

In this research, we have investigated about prosodic features of Farsi (Persian) language and quantified major stress rules and some intonation rules for speech synthesis purpose. The research is mostly concentrated on pitch variations and then on durational changes. We have implemented the proposed simplified prosodic rules using a Klatt formant synthesizer, specially modified for Farsi phone...

متن کامل

A Simple Malay Speech Synthesizer Using Syllable Concatenation Approach

A Malay speech synthesizer system will be discussed. This paper will cover the available Malay speech synthesis system, the underlying structure of our system, brief description of crucial modules, general evaluation of the system, the proposed enhancement and future work of Malay text-to-speech system in Computer-Aided Translation Unit (UTMK). The objective is to highlight how our system works...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006