Variable bit-rate sinusoidal transform coding using variable order spectral estimation

نویسندگان

  • Ning Li
  • Derek J. Molyneux
  • Meau Shin Ho
  • Barry M. G. Cheetham
چکیده

Sinusoidal transform coding (STC) is known to be capable of producing good communication quality speech coded at bitrates below 4kb/s. Discrete all-pole modelling (DAP) is an alternative spectral estimation method which can be more accurate than the conventional linear prediction (LP) analysis normally used by STC. In the quest to achieve the highest possible speech quality at lower and lower average bit-rates in variable bit-rate coding schemes, more and more effort must be made to investigate ways of varying the number of parameters according to the characteristics of each speech frame. This paper considers the advantage to be gained by varying the all-pole model order according to the discrete Itakura-Saito (IS) distance measure used in DAP. A significant reduction is achieved in the average number of parameters to be quantised compared to the fixed order model while the speech quality remains the same.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An improved residual-domain phase/amplitude model for sinusoidal coding of speech at very low bit rates: a variable rate scheme

An improved harmonic sinusoidal model is presented, where the underlying sine wave amplitudes and phases are e ciently represented using a combination of linear prediction, linear phase alignment, all-pass ltering, and spectral sampling in the residual-domain. The analysis and synthesis systems are introduced and the derivation and encoding of each model parameter is discussed. Performance anal...

متن کامل

Spectral Coding of Speech LSF Parameters Using Karhunen-Loeve Transform

In this paper, the use of optimal KarhunenLoeve (KL) transform for quantization of speech line spectrum frequency (LSF) coefficients is studied. Both scalar quantizer (SQ) and vector quantizer (VQ) schemes are developed to encode efficiently the transform parameters after operating one or two-dimensional KL transform. Furthermore, the SQ schemes are also combined with entropy coding by using Hu...

متن کامل

Phase modelling of speech excitation for low bit-rate sinusoidal transform coding

Sinusoidal transform coding (STC) techniques model speech as the sum of sine-waves whose frequencies, amplitudes and phases are specified at regular intervals. To achieve a low-bit rate representation, only the spectral envelope is encoded and the phases are regenerated according to a minimum phase assumption. In this paper, the inaccuracy of the minimum phase model is demonstrated. It is shown...

متن کامل

Video Coding with R-D Constrained Hierarchical Variable Block Size (VBS) Motion Estimation

The variable block size(VBS) motion estimation technique allows for larger blocks to be used when smaller blocks yield little gain, saving bit rates especially for areas containing more complex motion. However, the employment of the VBS motion estimation technique introduces a new optimization issue for the motion compensated transform coding, because an increase in bit rate allocation is neces...

متن کامل

Video Coding with R - D ConstrainedHierarchical Variable Block Size ( VBS )

The variable block size(VBS) motion estimation technique allows for larger blocks to be used when smaller blocks yield little gain, saving bit rates especially for areas containing more complex motion. However, the employment of the VBS motion estimation technique introduces a new optimization issue for the motion compensated transform coding, because an increase in bit rate allocation is neces...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000