Mandarin/English mixed-lingual name recognition for mobile phone

نویسندگان

  • Xiaolin Ren
  • Xin He
  • Yaxin Zhang
چکیده

Speaker independent name speech recognition has become hot application in handheld devices such as mobile phones and personal digit assistants (PDAs). This paper presents a new mixed-lingual ASR system that will enable Chinese mobile phone users to conduct Mandarin and English name speech recognition simultaneously without switching language modes. We created an elaborately designed mixed acoustic model set which consists of two monolingual acoustic model sets, i.e. context-dependent Mandarin models and context-independent English models. Design of such mixed acoustic models for mobile phones is to balance the complexity and performance of the ASR system. We also described Mandarin and English mixed-lingual letter-to-sound converter in this paper. We demonstrated the effectiveness of the mixed-lingual name speech recognition system through off-line ASR experiments. Furthermore, the mixed-lingual name ASR system has also been integrated in Motorola new PDA phones and showed good performance in real world application.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mixed-lingual spoken word recognition by using VQ codebook sequences of variable length segments

We are investigating unsupervised phone modeling. This paper describes a derivation method of VQ codebook sequences of variable length segments from spoken word samples, and also describes evaluation results by applying the method to mixed-lingual speech recognition tasks which include non-native speakers. The VQ codebook is generated based on a piecewise linear segmentation method which includ...

متن کامل

Concatenative Mandarin Tts Accommodating Isolated English Words

An experiment to explore the method realizing a concatenative Chinese TTS accommodating isolated English words is presented. The experiment was based on an existing concatenative Mandarin TTS system, developed in Motorola China Research Center. The experimental system employs an English word synthesizer based on the concatenation of speech segments stored in an English corpus. The original Engl...

متن کامل

Cross-lingual Speaker Adaptation for Hmm-based Speech Synthesis

This paper explores a cross-lingual speaker adaptation technique for HMM-based speech synthesis, where a source voice model for English is transformed into a target speaker model using Mandarin Chinese speech data from the target speaker. A phone mappingbased method is adopted to map Chinese Initial/Finals into English phonemes and two types of mapping rules, including one-to-one and one-to-seq...

متن کامل

Analysis of unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using KLD-based transform mapping

In the EMIME project, we developed a mobile device that performs personalized speech-to-speech translation such that a user’s spoken input in one language is used to produce spoken output in another language, while continuing to sound like the user’s voice. We integrated two techniques into a single architecture: unsupervised adaptation for HMM-based TTS using word-based large-vocabulary contin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005