Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences

نویسنده

  • Paul Mermelstein
چکیده

o Several parametric representations of the acoustic signal were compared as to word recognition performance in a syllableoriented continuous speech recognition system. The vocabulary included many phonetically similar monosyllabic words, therefore the emphasis was on ability to retain phonetically significant acoustic information in the face of syntactic and duration variations. For each ~ arameter set (based on a mel-frequency cepstrum, a linear frequency cepstrum, a linear prediction cepstrum, a linear prediction spectrum, or a set of reflection coefficients), word templates were generated using an efficient dynamic method, and test data were time registered wi th the templates. A set of ten melfrequency cepstrum coefficients computed every 6" 4 ms resulted in the best performance, namely 96 .. 5% and 9500% recognition with each of two speakers.. The superior performance of the mel-frequency cepstrum coefficients may be attributed to the fact that they better represent the perceptually relevant aspects of the short-term speech spectrum ..

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences

Akfract-Several parametric representations of the acoustic signal were compared with regard to word recognition performance in a syllable-oriented continuous speech recognition system. The vocabulary included many phonetically similar monosyllabic words, therefore the emphasis was on the ability to retain phonetically significant acoustic information in the face of syntactic and dura...

متن کامل

Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences

Akfract-Several parametric representations of the acoustic signal were compared with regard to word recognition performance in a syllable-oriented continuous speech recognition system. The vocabulary included many phonetically similar monosyllabic words, therefore the emphasis was on the ability to retain phonetically significant acoustic information in the face of syntactic and dura...

متن کامل

Constraints of lexical stress on lexical access in English: evidence from native and non-native listeners.

Four cross-modal priming experiments and two forced-choice identification experiments investigated the use of suprasegmental cues to stress in the recognition of spoken English words, by native (English-speaking) and non-native (Dutch) listeners. Previous results had indicated that suprasegmental information was exploited in lexical access by Dutch but not by English listeners For both listener...

متن کامل

Exploring the role of lexical stress in lexical recognition.

Three cross-modal priming experiments examined the role of suprasegmental information in the processing of spoken words. All primes consisted of truncated spoken Dutch words. Recognition of visually presented word targets was facilitated by prior auditory presentation of the first two syllables of the same words as primes, but only if they were appropriately stressed (e.g., OKTOBER preceded by ...

متن کامل

Delayed commitment in spoken word recognition: evidence from cross-modal priming.

Using the cross-modal priming paradigm, we attempted to determine whether semantic representations for word-final morphemes embedded in multisyllabic words (e.g.,/lak/in /hemlak/) are independently activated in memory. That is, we attempted to determine whether the auditory prime, /hemlak/, would facilitate lexical decision times to the visual target, KEY, even when the recognition point for /h...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009