Speech Recognition for Keyword Spotting using a Set of Modulation Based Features – Preliminary Results

نویسندگان

  • Kaliappan GOPALAN
  • Tao CHU
چکیده

We present the preliminary results of applying a set of parameters of the AM-FM model for recognizing word utterances. By acquiring modulation based parameters from the amplitude envelope (AE) and the instantaneous frequency – both obtained by demodulating at four selected center frequencies – a compact feature set is created for each frame of a word utterance. Applying a dynamic time warping of features, a dissimilarity measure between an unknown and one of several reference utterances is obtained to detect the presence of a keyword in a continuous stream of speech. A feature set consisting of the peak frequencies in AE and weighted formants, among others, shows an overall recognition score of 75 percent or higher – depending on the analysis frequencies used – for an extracted set of word utterances. The low false positive and false negative scores suggest the viability of modulation based parameters for building a keyword spotting system.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Keyword Spotting Based On Decision Fusion

Automatic speech recognition (ASR) technology is available now-a-days in all handsets where keyword spotting plays a vital role. Keyword spotting performance significantly degrades when applied to real-world environment due to background noise. As visual features are not affected much by noise this provides better solution. In this paper, audio-visual integration is proposed which combines audi...

متن کامل

Robust Keyword Spotting Using a Multi-Stream Approach

Speech recognition systems are prone to severe degradation in noisy environments due to mismatch between training and testing conditions. A multi-stream approach for keyword spotting is proposed to improve robustness in mismatched conditions. The assumption is that most real world noises are colored and do not affect the full spectrum equally, meaning certain parts of the spectrum can still pro...

متن کامل

Comparison of keyword spotting methods for searching in speech

This paper presents and discusses keyword spotting methods for searching in speech. In contrast with searching in text, the searching in speech or generally in multimedia data still represents a challenge. The aim of the paper is to present a keyword spotting (KWS) method based on a large vocabulary continuous speech recognition (LVCSR) system, based on phonetics decoder, and keyword spotting u...

متن کامل

An Utterance Recognition Technique for Keyword Spotting by Fusion of Bark Energy and MFCC Features

This paper describes the preliminary results of a keyword spotting system using a fusion of spectral and cepstral features. Spectral energy in 16 bands of frequencies on Bark scale and 16 mel-scale warped cepstral coefficients are used independently and in combination with appropriate weights for recognizing word utterances. Results of matching features using Euclidean and cosine distances in a...

متن کامل

An Effective Approach for Chinese Speech Recognition on Small size of Vocabulary

In this paper, an effective approach for Chinese speech recognition on small vocabulary size is proposed the independent speech recognition of Chinese words based on Hidden Markov Model (HMM). The features of speech words are generated by sub-syllable of Chinese characters. Total 640 speech samples are recorded by 4 native males and 4 females with frequently speaking ability. The preliminary re...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010