Audio coding using a psychoacoustic pre- and post-filter

نویسندگان

  • Bernd Edler
  • Gerald Schuller
چکیده

A novel concept for perceptual audio coding is presented which is based on the combination of a preand post-filter, controlled by a psychoacoustic model, with a transform coding scheme. This paradigm allows modeling of the temporal and spectral shape of the masked threshold with a resolution independent of the used transform. By using frequency warping techniques the maximum possible detail for a given filter order can be made frequency-dependent and thus better adapted to the human auditory system. The filter coefficients are represented efficiently by LSF parameters which can be adaptively interpolated over time. First experiments with a system obtained by extending an existing transform codec showed that this approach can significantly improve the performance for speech signals, while the performance for other signals remained the same.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Perceptual Audio Coding Using a Time-Varying Linear Pre- and Post-Filter

Recently, a new concept for perceptual audio coding was presented, which is based on a prefilter in the encoder and a corresponding post-filter in the decoder, both controlled by a psychoacoustic model. It enables individual selection of spectral and temporal resolutions for irrelevancy reduction and redundancy reduction. This paper addresses problems related to the efficient transmission of th...

متن کامل

Gaussian mixture model based audio coding in a perceptual domain

Gaussian mixture model based vector quantization (GMM-VQ) is a powerful technique for structured vector quantization. This thesis describes the implementation and evaluation of a perceptual audio coder using GMM-VQ. It is combined with a perceptual transform using psychoacoustic preand post-filtering. The coder was tested for low rate mono audio coding (64 kbps) and the results were compared to...

متن کامل

Improved audio coding using a psychoacoustic model based on a cochlear filter bank

Perceptual audio coders use an estimated masked threshold for the determination of the maximum permissible just-inaudible noise level introduced by quantization. This estimate is derived from a psychoacoustic model mimicking the properties of masking. Most psychoacoustic models for coding applications use a uniform (equal bandwidth) spectral decomposition as a first step to approximate the freq...

متن کامل

An Improved Psychoacoustic Model for Audio Coding Based on Wavelet Packet

This paper describes a new design of a psychoacoustic model for audio coding following the model used in the standard MPEG-1 audio layer 3 using an appropriate wavelet packet decomposition of the speech/audio signal. The design of a psychoacoustic model is achieved by wavelet packet decomposition whose connections are selected in such a way that sub bands correspond to the best possible one to ...

متن کامل

Improved Noise Weighting in CELP Coding of Speech - Applying the Vorbis Psychoacoustic Model To Speex

One key aspect of the CELP algorithm is that it shapes the coding noise using a simple, yet effective, weighting filter. In this paper, we improve the noise shaping of CELP using a more modern psychoacoustic model. This has the significant advantage of improving the quality of an existing codec without the need to change the bit-stream. More specifically, we improve the Speex CELP codec by usin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000