A general-purpose IsiZulu Speech Synthesiser

نویسنده

J. A. Louw

چکیده

A general-purpose isiZulu text-to-speech (TTS) system was developed, based on the “Multisyn” unit-selection approach supported by the Festival TTS toolkit. The development involved a number of challenges related to the interface between speech technology and linguistics – for example, choosing an appropriate set of phonetic units, producing reliable pronunciations, and developing appropriate cost functions for selecting and joining diphone units. We show how solutions were found for each of these challenges, and describe a number of other innovations (such as automated fault detection in manual alignments) that were introduced. Initial evaluations suggest that the synthesizer is usable by a wide spectrum of isiZulu speakers.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic intonation modeling with INTSINT

Accurate intonation modeling has become a vital part of modern day speech synthesis systems. This is especially true for tonal languages such as isiZulu, where the intonation of an utterance not only influences the perceived naturalness of the synthetic voice, but may also influence its semantics. In this work we explore the INTSINT intonation modeling algorithm and its application to an isiZul...

متن کامل

A general-purpose IsiZulu speech synthesizer

A general-purpose isiZulu text-to-speech (TTS) system was developed, based on the ‘Multisyn’ unitselection approach supported by the Festival TTS toolkit. The development involved a number of challenges related to the interface between speech technology and linguistics – for example, choosing an appropriate set of phonetic units, producing reliable pronunciations, and developing appropriate cos...

متن کامل

Festival 2 - build your own general purpose unit selection speech synthesiser

This paper describes version 2 of the Festival speech synthesis system. Festival 2 provides a development environment for concatenative speech synthesis, and now includes a general purpose unit selection speech synthesis engine. We discuss various aspects of unit selection speech synthesis, focusing on the research issues that relate to voice design and the automation of the voice development p...

متن کامل

On evaluating synthesised visual speech

This paper describes issues relating to the subjective evaluation of synthesised visual speech. Two approaches to synthesis are compared: a text-driven synthesiser and a speech-driven synthesiser. Both synthesisers are trained using the same data and both use the same model for rendering the synthesised visual speech. Naturalness is used as a performance metric, and the naturalness of real visu...

متن کامل

Current status of the IBM Trainable Speech Synthesis System

This paper describes the current status of the IBM Trainable Speech Synthesis System. The system is a state-of-the-art, trainable, unit-selection based concatenative speech synthesiser. The system uses hidden Markov models (HMMs) to provide a phonetic transcription and HMM state alignment of a database of single-speaker continuous-speech training data. The runtime synthesiser uses the HMM state...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2005

A general-purpose IsiZulu Speech Synthesiser

نویسنده

چکیده

منابع مشابه

Automatic intonation modeling with INTSINT

A general-purpose IsiZulu speech synthesizer

Festival 2 - build your own general purpose unit selection speech synthesiser

On evaluating synthesised visual speech

Current status of the IBM Trainable Speech Synthesis System

عنوان ژورنال:

اشتراک گذاری