Decision tree based Mandarin tone model and its application to speech recognition

نویسندگان

Yang Cao

Yonggang Deng

Hong Zhang

Taiyi Huang

Bo Xu

چکیده

Tone is an essential language phenomenon for Mandarin Chinese language. Until now, we still do not know exactly how context affects tone pattern variation in continuous Mandarin speech. In this paper, we proposed a decision tree based approach to obtain the quantitative result of tone pattern variation in continuous Mandarin speech. Many possible factors other than tone of neighboring syllables were taken into consideration when the decision tree was constructed. After the tree was established, 29 tone patterns were automatically obtained, and we found that syllable position in the word together with Consonant/Vowel type of the syllable made important contribution to tone pattern variation in continuous utterance. We also presented a novel approach to integrate tone information into search process at word level. Experimental results showed that the character error rate was reduced by 15.2%.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Update progress of Sinohear: advanced Mandarin LVCSR system at NLPR

NLPR has been with long efforts on Mandarin speech recognition. This paper reports our recent process in this field with several significant novel characteristics: 1) Very large speech databases are used to learn more robust acoustic model; 2) Acoustic model has evolved from non-tonal class-triphone to tonal class-triphone based on tone-embedded decision tree, namely unified tone & triphone mod...

متن کامل

A Tone Recognition Fram Mandarin

In this paper, we present a tone recognition framework for continuous Mandarin speech. To model the variations of F0 pattern caused by co-articulation and phonetic effects, a set of discriminating features are extracted: 1) outlined features from the F0 contours of target syllable and neighboring syllables are combined; 2) contextual tone information is utilized within an iterative process; 3) ...

متن کامل

A stochastic polynomial tone model for continuous Mandarin speech

In this paper, a stochastic polynomial tone model is presented for tone modeling in continuous mandarin speech. In this model, the pitch contour is described by a stochastic trajectory. The mean trajectory is represented by a polynomial function of normalized time while the variance is time varying. After that, an effective training and recognition algorithm is developed respectively. Also the ...

متن کامل

Using GMM for voiced/voiceless segmentation and tone decision in Mandarin continuous speech recognition

In this paper, methods of Gaussian Mixture Model (GMM) are presented for both silence/voiced/voiceless segmentation and tone decision in Mandarin continuous speech recognition system. GMM has been used for silence/voiced/voiceless segmentation before, but the feature parameters can be modified to improve both accuracy and speed. As a popular method in pattern recognition, GMM is first proposed ...

متن کامل

Class-triphone Acoustic Modeling Based on Decision Tree for Mandarin Continuous Speech Recognition

Decision tree based acoustic modeling has increasingly become popular for modeling speech spectral variations in continuous speech. In this paper, class-triphone acoustic models based on the decision tree are investigated for mandarin speakerindependent continuous speech recognition. Three main questions are discussed: how to select base phone models, how to generate the question set based on l...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2000

Decision tree based Mandarin tone model and its application to speech recognition

نویسندگان

چکیده

منابع مشابه

Update progress of Sinohear: advanced Mandarin LVCSR system at NLPR

A Tone Recognition Fram Mandarin

A stochastic polynomial tone model for continuous Mandarin speech

Using GMM for voiced/voiceless segmentation and tone decision in Mandarin continuous speech recognition

Class-triphone Acoustic Modeling Based on Decision Tree for Mandarin Continuous Speech Recognition

عنوان ژورنال:

اشتراک گذاری