Audio coding using a psychoacoustic pre- and post-filter
نویسندگان
چکیده
A novel concept for perceptual audio coding is presented which is based on the combination of a preand post-filter, controlled by a psychoacoustic model, with a transform coding scheme. This paradigm allows modeling of the temporal and spectral shape of the masked threshold with a resolution independent of the used transform. By using frequency warping techniques the maximum possible detail for a given filter order can be made frequency-dependent and thus better adapted to the human auditory system. The filter coefficients are represented efficiently by LSF parameters which can be adaptively interpolated over time. First experiments with a system obtained by extending an existing transform codec showed that this approach can significantly improve the performance for speech signals, while the performance for other signals remained the same.
منابع مشابه
Perceptual Audio Coding Using a Time-Varying Linear Pre- and Post-Filter
Recently, a new concept for perceptual audio coding was presented, which is based on a prefilter in the encoder and a corresponding post-filter in the decoder, both controlled by a psychoacoustic model. It enables individual selection of spectral and temporal resolutions for irrelevancy reduction and redundancy reduction. This paper addresses problems related to the efficient transmission of th...
متن کاملGaussian mixture model based audio coding in a perceptual domain
Gaussian mixture model based vector quantization (GMM-VQ) is a powerful technique for structured vector quantization. This thesis describes the implementation and evaluation of a perceptual audio coder using GMM-VQ. It is combined with a perceptual transform using psychoacoustic preand post-filtering. The coder was tested for low rate mono audio coding (64 kbps) and the results were compared to...
متن کاملImproved audio coding using a psychoacoustic model based on a cochlear filter bank
Perceptual audio coders use an estimated masked threshold for the determination of the maximum permissible just-inaudible noise level introduced by quantization. This estimate is derived from a psychoacoustic model mimicking the properties of masking. Most psychoacoustic models for coding applications use a uniform (equal bandwidth) spectral decomposition as a first step to approximate the freq...
متن کاملAn Improved Psychoacoustic Model for Audio Coding Based on Wavelet Packet
This paper describes a new design of a psychoacoustic model for audio coding following the model used in the standard MPEG-1 audio layer 3 using an appropriate wavelet packet decomposition of the speech/audio signal. The design of a psychoacoustic model is achieved by wavelet packet decomposition whose connections are selected in such a way that sub bands correspond to the best possible one to ...
متن کاملImproved Noise Weighting in CELP Coding of Speech - Applying the Vorbis Psychoacoustic Model To Speex
One key aspect of the CELP algorithm is that it shapes the coding noise using a simple, yet effective, weighting filter. In this paper, we improve the noise shaping of CELP using a more modern psychoacoustic model. This has the significant advantage of improving the quality of an existing codec without the need to change the bit-stream. More specifically, we improve the Speex CELP codec by usin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000