A Perceptually Based Embedded Subband Speech Coder - Speech and Audio Processing, IEEE Transactions on

نویسندگان

Benjamim Tang

Albert Shen

Abeer Alwan

Gregory Pottie

چکیده

A new scheme for robust, high-quality, embedded speech coding based on subband decomposition and perceptually optimized bit allocation and prioritization is presented. An infinte impulse response (IIR) quadrature mirror filterbank (QMF) performs subband decomposition. A perceptual model, computed using subband spectral analysis, optimizes the coder’s perceptual quality. Dynamic bit allocation and prioritization is combined with embedded quantization resulting in little performance degradation relative to a nonembedded implementation. The coder output is scalable from high quality at higher bit rates to lower quality at lower bit rates, supporting a wide range of service and resource utilization. The lower bit-rate representation is obtained simply through truncation of the higher bit-rate representation. Since source-rate adaptation is performed through truncation of the encoded stream, interaction with the coder is not required, making the embedded coder ideally suited for rateadaptive communication systems. Performance for both speech and music was verified through subjective listening tests.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Joint filterbanks for echo cancellation and audio coding

In this paper, joint structures for audio coding and echo cancellation are investigated, utilizing standard audio coders. Two types of audio coders are considered, coders based on cosine modulated filterbanks and coders based on the modified discrete cosine transform (MDCT). For the first coder type, two methods for combining such a coder with a subband echo canceler are proposed. The two metho...

متن کامل

Variable-Rate CELP Based on Subband Flatness - Speech and Audio Processing, IEEE Transactions on

Code-excited linear prediction (CELP) is the predominant methodology for communications quality speech coding below 8 kbps, and several variable-rate CELP schemes have been discussed in the literature, including QCELP, the variable-rate wideband digital cellular mobile radio speech coding standard specified in IS-95. A key component of these speech coders is the detection and classification of ...

متن کامل

A New Algorithm for Voice Activity Detection Based on Wavelet Packets (RESEARCH NOTE)

Speech constitutes much of the communicated information; most other perceived audio signals do not carry nearly as much information. Indeed, much of the non-speech signals maybe classified as ‘noise’ in human communication. The process of separating conversational speech and noise is termed voice activity detection (VAD). This paper describes a new approach to VAD which is based on the Wavelet ...

متن کامل

Perceptual audio coding using adaptive pre- and post-filters and lossless compression

This paper proposes a versatile perceptual audio coding method that achieves high compression ratios and is capable of low encoding/decoding delay. It accommodates a variety of source signals (including both music and speech) with different sampling rates. It is based on separating irrelevance and redundancy reductions into independent functional units. This contrasts traditional audio coding w...

متن کامل

An improved (Auto: I, LSP: T) constrained iterative speech enhancement for colored noise environments

In this correspondence we illustrate how the (Auto:I, LSP:T) constrained iterative speech enhancement algorithm can be extended to provide improved performance in colored noise environments. The modified algorithm, referred to here as noise adaptive (Auto:I, LSP:T), operates on subbanded signal components in which the terminating iteration is adjusted based on the a posteriori estimate of the s...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1997

A Perceptually Based Embedded Subband Speech Coder - Speech and Audio Processing, IEEE Transactions on

نویسندگان

چکیده

منابع مشابه

Joint filterbanks for echo cancellation and audio coding

Variable-Rate CELP Based on Subband Flatness - Speech and Audio Processing, IEEE Transactions on

A New Algorithm for Voice Activity Detection Based on Wavelet Packets (RESEARCH NOTE)

Perceptual audio coding using adaptive pre- and post-filters and lossless compression

An improved (Auto: I, LSP: T) constrained iterative speech enhancement for colored noise environments

عنوان ژورنال:

اشتراک گذاری