Application of Blind Source Separation in Speech Processing for Combined Interference Removal and Robust Speaker Detection Using a Two-microphone Setup

نویسندگان

  • Erik Visser
  • Te-Won Lee
چکیده

A speech enhancement scheme is presented integrating spatial and temporal signal processing methods for blind denoising in non stationary noise environments. In a first stage, spatially localized interferring point sources are separated from noisy speech signals recorded by two microphones using a Blind Source Separation (BSS) algorithm assuming no a priori knowledge about the sources involved. Spatially distributed background noise is removed in a second processing step. Here, the BSS output channel containing the desired speaker is filtered with a time-varying Wiener filter. Noise power estimates for the filter coefficients are computed from desired speaker absent time-intervals identified by comparing signal energy of separated source files from the BSS stage. The scheme’s performance is illustrated by speech recognition experiments on real recordings corrupted by babble noise and compared to conventional beamforming and single channel denoising techniques.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Adaptive cross-channel interference cancellation on blind signal separation outputs using source absence/presence detection and spectral subtraction

The performances of blind source separation (BSS) are still not satisfiable to apply to the real environments. The major obstacle may seem the finite filter length of the assumed mixing model and the nonlinear sensor noises. This paper presents a two-step speech enhancement method with stereo microphone inputs. The first is an ordinary frequency-domain BSS step, and the second is the removal of...

متن کامل

A spatio-temporal speech enhancement scheme for robust speech recognition in noisy environments

A new speech enhancement scheme is presented integrating spatial and temporal signal processing methods for robust speech recognition in noisy environments. The scheme first separates spatially localized point sources from noisy speech signals recorded by two microphones. Blind source separation algorithms assuming no a priori knowledge about the sources involved are applied in this spatial pro...

متن کامل

Blind Source Separation for Speech Application Under Real Acoustic Environment

A hands-free speech recognition system [1] is essential for the realization of an intuitive, unconstrained, and stress-free human-machine interface, where users can talk naturally because they require no microphone in their hands. In this system, however, since noise and reverberation always degrade speech quality, it is difficult to achieve high recognition performance, compared with the case ...

متن کامل

Spatio-temporal Speech Enhancement for Robust Speech Recognition

A new speech enhancement scheme is presented integrating spatial and temporal signal processing methods for robust speech recognition in noisy environments. The scheme first separates spatially localized point sources from noisy speech signals recorded by two microphones. Blind source separation algorithms assuming no a priori knowledge about the sources involved are applied in this spatial pro...

متن کامل

A comparison of simultaneous 3-channel blind source separation to selective separation on channel pairs using 2-channel BSS

A number of real-life speech applications using BSS have been reported for two channel applications but only a few have been reported for multi-channel (more than 2 channels) applications. Moreover these mostly involve simulation studies or real-life separations in controlled settings. In this paper some practical problems of multichannel applications will be analyzed. A methodology is proposed...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003