Invariant-integration method for robust feature extraction in speaker-independent speech recognition

نویسندگان

Florian Müller

Alfred Mertins

چکیده

The vocal tract length (VTL) is one of the variabilities that speaker-independent automatic speech recognition (ASR) systems encounter. Standard methods to compensate for the effects of different VTLs within the processing stages of the ASR systems often have a high computational effort. By using an appropriate warping scheme for the frequency centers of the timefrequency analysis, a change in VTL can be approximately described by a translation in the subband-index space. We present a new type of features that is based on the principle of invariant integration, and an according feature selection method is described. ASR experiments show the increased robustness of the proposed features in comparison to standard MFCCs.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust Auditory-Based Speech Feature Extraction Using Independent Subspace Method

In recent years many approaches have been developed to address the problem of robust speaker recognition in adverse acoustical environments. In this paper we propose a robust auditory-based feature extraction method for speaker recognition according to the characteristics of the auditory periphery and cochlear nucleus. First, speech signals are represented based on frequency selectivity at basi...

متن کامل

Contextual invariant-integration features for improved speaker-independent speech recognition

This work presents a feature-extraction method that is based on the theory of invariant integration. The invariant-integration features are derived from an extended time period, and their computation has a very low complexity. Recognition experiments show a superior performance of the presented feature type compared to cepstral coefficients using a mel filterbank (MFCCs) or a gammatone filterba...

متن کامل

Speaker feature extraction from pitch information based on spectral subtraction for speaker identification

Robust speaker feature extraction under noise conditions is an important issue for application of a speaker recognition system. It is well known that LPC cepstrum, which expresses the spectral envelope, is e ective for speaker recognition. This implies that the spectral rough structure is e ective for speaker recognition. However, LPC cepstrum is a noise-sensitive feature. On the other hand, sp...

متن کامل

Noise Robust Speaker-Independent Speech Recognition with Invariant-Integration Features Using Power-Bias Subtraction

This paper presents new results about the robustness of invariantintegration features (IIF) in noisy conditions. Furthermore, it is shown that a feature-enhancement method known as “powerbias subtraction” for noisy conditions can be combined with the IIF approach to improve its performance in noisy environments while keeping the robustness of the IIFs to mismatching vocaltract length training-t...

متن کامل

شبکه عصبی پیچشی با پنجره‌های قابل تطبیق برای بازشناسی گفتار

Although, speech recognition systems are widely used and their accuracies are continuously increased, there is a considerable performance gap between their accuracies and human recognition ability. This is partially due to high speaker variations in speech signal. Deep neural networks are among the best tools for acoustic modeling. Recently, using hybrid deep neural network and hidden Markov mo...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2009

Invariant-integration method for robust feature extraction in speaker-independent speech recognition

نویسندگان

چکیده

منابع مشابه

Robust Auditory-Based Speech Feature Extraction Using Independent Subspace Method

Contextual invariant-integration features for improved speaker-independent speech recognition

Speaker feature extraction from pitch information based on spectral subtraction for speaker identification

Noise Robust Speaker-Independent Speech Recognition with Invariant-Integration Features Using Power-Bias Subtraction

شبکه عصبی پیچشی با پنجره‌های قابل تطبیق برای بازشناسی گفتار

عنوان ژورنال:

اشتراک گذاری