Variable bit-rate sinusoidal transform coding using variable order spectral estimation
نویسندگان
چکیده
Sinusoidal transform coding (STC) is known to be capable of producing good communication quality speech coded at bitrates below 4kb/s. Discrete all-pole modelling (DAP) is an alternative spectral estimation method which can be more accurate than the conventional linear prediction (LP) analysis normally used by STC. In the quest to achieve the highest possible speech quality at lower and lower average bit-rates in variable bit-rate coding schemes, more and more effort must be made to investigate ways of varying the number of parameters according to the characteristics of each speech frame. This paper considers the advantage to be gained by varying the all-pole model order according to the discrete Itakura-Saito (IS) distance measure used in DAP. A significant reduction is achieved in the average number of parameters to be quantised compared to the fixed order model while the speech quality remains the same.
منابع مشابه
An improved residual-domain phase/amplitude model for sinusoidal coding of speech at very low bit rates: a variable rate scheme
An improved harmonic sinusoidal model is presented, where the underlying sine wave amplitudes and phases are e ciently represented using a combination of linear prediction, linear phase alignment, all-pass ltering, and spectral sampling in the residual-domain. The analysis and synthesis systems are introduced and the derivation and encoding of each model parameter is discussed. Performance anal...
متن کاملSpectral Coding of Speech LSF Parameters Using Karhunen-Loeve Transform
In this paper, the use of optimal KarhunenLoeve (KL) transform for quantization of speech line spectrum frequency (LSF) coefficients is studied. Both scalar quantizer (SQ) and vector quantizer (VQ) schemes are developed to encode efficiently the transform parameters after operating one or two-dimensional KL transform. Furthermore, the SQ schemes are also combined with entropy coding by using Hu...
متن کاملPhase modelling of speech excitation for low bit-rate sinusoidal transform coding
Sinusoidal transform coding (STC) techniques model speech as the sum of sine-waves whose frequencies, amplitudes and phases are specified at regular intervals. To achieve a low-bit rate representation, only the spectral envelope is encoded and the phases are regenerated according to a minimum phase assumption. In this paper, the inaccuracy of the minimum phase model is demonstrated. It is shown...
متن کاملVideo Coding with R-D Constrained Hierarchical Variable Block Size (VBS) Motion Estimation
The variable block size(VBS) motion estimation technique allows for larger blocks to be used when smaller blocks yield little gain, saving bit rates especially for areas containing more complex motion. However, the employment of the VBS motion estimation technique introduces a new optimization issue for the motion compensated transform coding, because an increase in bit rate allocation is neces...
متن کاملVideo Coding with R - D ConstrainedHierarchical Variable Block Size ( VBS )
The variable block size(VBS) motion estimation technique allows for larger blocks to be used when smaller blocks yield little gain, saving bit rates especially for areas containing more complex motion. However, the employment of the VBS motion estimation technique introduces a new optimization issue for the motion compensated transform coding, because an increase in bit rate allocation is neces...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000