Spectral Coding of Speech LSF Parameters Using Karhunen-Loeve Transform

نویسنده

  • László LOIS
چکیده

In this paper, the use of optimal KarhunenLoeve (KL) transform for quantization of speech line spectrum frequency (LSF) coefficients is studied. Both scalar quantizer (SQ) and vector quantizer (VQ) schemes are developed to encode efficiently the transform parameters after operating one or two-dimensional KL transform. Furthermore, the SQ schemes are also combined with entropy coding by using Huffman variable length coding (VLC). The basic idea in developing these schemes is utilizing the strong correlation of LSF parameters to reduce the bit rate for a given level of fidelity. Since the use of global statistics for generating the coding scheme may not be appropriate, we propose several adaptive KL transform systems (AKL) to encode the LSF parameters. The performance of all systems for different bit rates is investigated and adequate comparisons are made. It is shown that the proposed KL transform coding systems introduce as good as or better performance for both SQ and VQ in the examined bit rates compared to other methods in the field of LSF coding. key words: speech coding, line spectral frequency, KarhunenLoeve transform, adaptive transform coding

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient distance measure for quantization of LSF and its Karhunen-Loeve transformed parameters

This paper presents a new distance measure that is based on the spectral sensitivity of the line spectrum frequency parameters (LSFs) and its Karhunen–Loeve (KL) transformed coefficients. It is shown that the proposed distance measure achieves better performance of vector quantization (VQ) compared to other methods in the field of LSF coding. In most cases, the percentage of outliers is reduced...

متن کامل

Variable length coding of transformed LSF coefficients

In this paper, the use of Karhunen-Loeve transform (KLT) and discrete cosine transform (DCT) is studied for encoding of the line spectrum frequency (LSF) parameters at variable bit rate (VBR). For VBR coding, scalar quantization (SQ) is used with Huffman coding. The basic idea in developing these schemes is using linear transform to exploit the strong intraframe and interframe correlation of LS...

متن کامل

Gaussian Mixture Model-based Quantization of Line Spectral Frequencies for Adaptive Multirate Speech Codec

In this paper, we investigate the use of a Gaussian Mixture Model (GMM)-based quantizer for quantization of the Line Spectral Frequencies (LSFs) in the Adaptive Multi-Rate (AMR) speech codec. We estimate the parametric GMM model of the probability density function (pdf) for the prediction error (residual) of mean-removed LSF parameters that are used in the AMR codec for speech spectral envelope...

متن کامل

Speech Enhancement with Signal Subspace Filter Based on Perceptual Post Filtering

A novel technique is presented to design the signal subspace speech enhancement based on perceptual post filtering. Firstly, by subspace filter the noisy speech is enhanced. The underlying principle is to decompose the vector space of the noisy signal into a signal plus noise subspace and a noise subspace. The decomposition can theoretically be performed by applying the Karhunen-Loeve transform...

متن کامل

Two stage transform vector quantization of LSFs for wideband speech coding

We investigate the use of a two stage transform vector quantizer (TSTVQ) for coding of line spectral frequency (LSF) parameters in wideband speech coding. The first stage quantizer of TSTVQ, provides better matching of source distribution and the second stage quantizer provides additional coding gain through using an individual cluster specific decorrelating transform and variance normalization...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999