Multi-Resolution Speech Spectrogram

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-resolution for speech analysis

In the purpose to deal with artifact on observations measurements resulting from usual speech processing, we propose to extend the representation of the speech signal by taking a sequence of sets of observations instead of a simple sequence of observations. A set of observations is computed from temporal Multi-Resolution (MR) analysis. This method is designed to be adapted to any usual mode and...

متن کامل

A super-resolution spectrogram using coupled PLCA

The short-time Fourier transform (STFT) based spectrogram is commonly used to analyze the time-frequency content of a signal. Depending on window size, the STFT provides a trade-off between time and frequency resolutions. This paper presents a novel method that achieves high resolution simultaneously in both time and frequency. We extend Probabilistic Latent Component Analysis (PLCA) to jointly...

متن کامل

Robust speech recognition using the modulation spectrogram

The performance of present-day automatic speech recognition (ASR) systems is seriously compromised by levels of acoustic interference (such as additive noise and room reverberation) representative of real-world speaking conditions. Studies on the perception of speech by human listeners suggest that recognizer robustness might be improved by focusing on temporal structure in the speech signal th...

متن کامل

Speech - Nonspeech discrimination based on speech-relevant spectrogram modulations

In this work, we adopt an information theoretic approach the Information Bottleneck method to extract the relevant modulation frequencies across both dimensions of a spectrogram, for speech / non-speech discrimination (music, animal vocalizations, environmental noises). A compact representation is built for each sound ensemble, consisting of the maximally informative features. We demonstrate th...

متن کامل

Multi-Time Resolution Analysis of Speech

Neuroscience and Cognitive Science Program, Department of Linguistics, Department of Biology, Department of Electrical & Computer Engineering University of Maryland, College Park MD 20742 Silicon Speech, 46 Oxford Drive, Santa Venetia, CA 94903 Department of Electrical and Electronics Engineering, Sophia University, Tokyo, Japan Now at: Equipe ‘Audition’, LPP CNRS (FRE 2929), Université Paris 5...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Computer Applications

سال: 2011

ISSN: 0975-8887

DOI: 10.5120/1937-2587