Multi-Resolution Speech Spectrogram
نویسندگان
چکیده
منابع مشابه
Multi-resolution for speech analysis
In the purpose to deal with artifact on observations measurements resulting from usual speech processing, we propose to extend the representation of the speech signal by taking a sequence of sets of observations instead of a simple sequence of observations. A set of observations is computed from temporal Multi-Resolution (MR) analysis. This method is designed to be adapted to any usual mode and...
متن کاملA super-resolution spectrogram using coupled PLCA
The short-time Fourier transform (STFT) based spectrogram is commonly used to analyze the time-frequency content of a signal. Depending on window size, the STFT provides a trade-off between time and frequency resolutions. This paper presents a novel method that achieves high resolution simultaneously in both time and frequency. We extend Probabilistic Latent Component Analysis (PLCA) to jointly...
متن کاملRobust speech recognition using the modulation spectrogram
The performance of present-day automatic speech recognition (ASR) systems is seriously compromised by levels of acoustic interference (such as additive noise and room reverberation) representative of real-world speaking conditions. Studies on the perception of speech by human listeners suggest that recognizer robustness might be improved by focusing on temporal structure in the speech signal th...
متن کاملSpeech - Nonspeech discrimination based on speech-relevant spectrogram modulations
In this work, we adopt an information theoretic approach the Information Bottleneck method to extract the relevant modulation frequencies across both dimensions of a spectrogram, for speech / non-speech discrimination (music, animal vocalizations, environmental noises). A compact representation is built for each sound ensemble, consisting of the maximally informative features. We demonstrate th...
متن کاملMulti-Time Resolution Analysis of Speech
Neuroscience and Cognitive Science Program, Department of Linguistics, Department of Biology, Department of Electrical & Computer Engineering University of Maryland, College Park MD 20742 Silicon Speech, 46 Oxford Drive, Santa Venetia, CA 94903 Department of Electrical and Electronics Engineering, Sophia University, Tokyo, Japan Now at: Equipe ‘Audition’, LPP CNRS (FRE 2929), Université Paris 5...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Computer Applications
سال: 2011
ISSN: 0975-8887
DOI: 10.5120/1937-2587