Decision tree based Mandarin tone model and its application to speech recognition
نویسندگان
چکیده
Tone is an essential language phenomenon for Mandarin Chinese language. Until now, we still do not know exactly how context affects tone pattern variation in continuous Mandarin speech. In this paper, we proposed a decision tree based approach to obtain the quantitative result of tone pattern variation in continuous Mandarin speech. Many possible factors other than tone of neighboring syllables were taken into consideration when the decision tree was constructed. After the tree was established, 29 tone patterns were automatically obtained, and we found that syllable position in the word together with Consonant/Vowel type of the syllable made important contribution to tone pattern variation in continuous utterance. We also presented a novel approach to integrate tone information into search process at word level. Experimental results showed that the character error rate was reduced by 15.2%.
منابع مشابه
Update progress of Sinohear: advanced Mandarin LVCSR system at NLPR
NLPR has been with long efforts on Mandarin speech recognition. This paper reports our recent process in this field with several significant novel characteristics: 1) Very large speech databases are used to learn more robust acoustic model; 2) Acoustic model has evolved from non-tonal class-triphone to tonal class-triphone based on tone-embedded decision tree, namely unified tone & triphone mod...
متن کاملA Tone Recognition Fram Mandarin
In this paper, we present a tone recognition framework for continuous Mandarin speech. To model the variations of F0 pattern caused by co-articulation and phonetic effects, a set of discriminating features are extracted: 1) outlined features from the F0 contours of target syllable and neighboring syllables are combined; 2) contextual tone information is utilized within an iterative process; 3) ...
متن کاملA stochastic polynomial tone model for continuous Mandarin speech
In this paper, a stochastic polynomial tone model is presented for tone modeling in continuous mandarin speech. In this model, the pitch contour is described by a stochastic trajectory. The mean trajectory is represented by a polynomial function of normalized time while the variance is time varying. After that, an effective training and recognition algorithm is developed respectively. Also the ...
متن کاملUsing GMM for voiced/voiceless segmentation and tone decision in Mandarin continuous speech recognition
In this paper, methods of Gaussian Mixture Model (GMM) are presented for both silence/voiced/voiceless segmentation and tone decision in Mandarin continuous speech recognition system. GMM has been used for silence/voiced/voiceless segmentation before, but the feature parameters can be modified to improve both accuracy and speed. As a popular method in pattern recognition, GMM is first proposed ...
متن کاملClass-triphone Acoustic Modeling Based on Decision Tree for Mandarin Continuous Speech Recognition
Decision tree based acoustic modeling has increasingly become popular for modeling speech spectral variations in continuous speech. In this paper, class-triphone acoustic models based on the decision tree are investigated for mandarin speakerindependent continuous speech recognition. Three main questions are discussed: how to select base phone models, how to generate the question set based on l...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000