Missing data techniques: Feature reconstruction
نویسندگان
چکیده
Automatic speech recognition (ASR) performance degrades rapidly when speech is corrupted with increasing levels of noise. Missing data techniques (MDT) constitute a family of methods that tackle noise robust speech recognition based on the so called missing data assumption proposed in [1]. MDTs assume that (i) the noisy speech signal can be divided in speech-dominated (reliable) and noise-dominated (unreliable) spectro-temporal components prior to decoding and (ii) the unreliable elements do not retain any information about the corresponding clean speech values. This means that the clean speech values corresponding to noise-dominated components are effectively missing, and speech recognition must proceed with partially observed data. Techniques for speech recognition with missing features divide in roughly two categories, marginalization and feature reconstruction. The marginalization approach, discussed in Chapter ??, is based on disregarding the missing components when calculating acoustic model likelihoods: likelihoods that correspond to the missing components are calculated by integrating over the full range of possible missing feature values [2, 3]. In this chapter, we focus on the reconstruction approach, where the missing values are substituted (imputed) with clean speech estimates prior to calculating the acoustic model likelihoods [4, 5, 6]. Since the reconstructed features do not contain any missing data, likelihood calculation does not need to be modified. In general, all missing feature imputation methods employ a model of the clean speech to estimate the missing values. The models range from simple smoothness assumptions [6] to advanced statistical models and exemplar-based approaches, although the acoustic models employed by the recognizer may also be used. Given the clean speech model and a noisy observation, the missing features are estimated as the values that best match the assumptions of clean speech components at the missing locations.
منابع مشابه
روشی جدید در بازشناسی مقاوم گفتار مبتنی بر دادگان مفقود با استفاده از شبکه عصبی دوسویه
Performance of speech recognition systems is greatly reduced when speech corrupted by noise. One common method for robust speech recognition systems is missing feature methods. In this way, the components in time - frequency representation of signal (Spectrogram) that present low signal to noise ratio (SNR), are tagged as missing and deleted then replaced by remained components and statistical ...
متن کاملClosed-Form Solutions for Affine Reconstruction under Missing Data
The known factorization algorithm for the maximum likelihood affine reconstruction requires that all the feature points used must be visible in all views. We derive here a closed-form-expression for the 3D coordinates of the feature points and translation vectors given the inhomogeneous affine projection matrices, but no single feature point is required to be visible in all views. The expressio...
متن کاملاستفاده از دادههای اقلیمی جهانی برای بازسازی خلأهای آماری دادههای دما و بارش (مطالعۀ موردی: ایستگاههای حوزۀ آبخیز خانمیرزا)
Introduction: Due to importance of data quality, issues relating to filling the missing data has found a great deal of interest. Regeneration methods for missing data can be classified into two kinds of classical and modern categories. Application of statistical methods such as relationship with nearby stations and approaches on the base of hydrological, climatological or physiographical simila...
متن کاملParticle Filter Based Soft-mask Estimation for Missing Feature Reconstruction
In this work, we show how particle filter (PF) based speech feature enhancement can profitably be combined with soft-decision missing feature reconstruction. The combined approach is motivated by the fact that standard minimum mean square error noise compensation techniques fail to give accurate estimates of the clean speech spectrum if the noise spectral power significantly exceeds that of spe...
متن کاملVector Autoregressive Model for Missing Feature Reconstruction
This paper proposes a Vector Autoregressive (VAR) model as a new technique for missing feature reconstruction in ASR. We model the spectral features using multiple VAR models. A VAR model predicts missing features as a linear function of a block of feature frames. We also propose two schemes for VAR training and testing. The experiments on AURORA-2 database have validated the modeling methodolo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011