speech coding

A hybrid approach to electrolaryngeal speech enhancement based on spectral subtraction and statistical voice conversion

2013

Kou Tanaka Tomoki Toda Graham Neubig Sakriani Sakti Satoshi Nakamura

We present a hybrid approach to improving naturalness of electrolaryngeal (EL) speech while minimizing degradation in intelligibility. An electrolarynx is a device that artificially generates excitation sounds to enable laryngectomees to produce EL speech. Although proficient laryngectomees can produce quite intelligible EL speech, it sounds very unnatural due to the mechanical excitation produ...

متن کامل

Uniform Speech Parameterization for Multi-Form Segment Synthesis

2011

Alexander Sorin Slava Shechtman Vincent Pollet

In multi-form segment synthesis speech is constructed by sequencing speech segments of different nature: model segments, i.e. mathematical abstractions of speech and template segments, i.e. speech waveform fragments. These multi-form segments can have shared, layered or alternate speech parameterization schemes. This paper introduces an advanced uniform speech parameterization scheme for statis...

متن کامل

Automatic acoustic segmentation for speech recognition on broadcast recordings

2007

Gang Peng Mei-Yuh Hwang Mari Ostendorf

This paper investigates the issue of automatic segmentation of speech recordings for broadcast news (BN) and broadcast conversation (BC) speech recognition. Our previous segmentation algorithm often exhibited high deletion errors, where some speech segments were misclassified as non-speech and thus were never passed on to the recognizer. In contrast with our previous segmentation models, which ...

متن کامل

Eliciting and Annotating Uncertainty in Spoken Language

2014

Heather Pon-Barry Stuart M. Shieber Nicholas Longenbaugh

A major challenge in the field of automatic recognition of emotion and affect in speech is the subjective nature of affect labels. The most common approach to acquiring affect labels is to ask a panel of listeners to rate a corpus of spoken utterances along one or more dimensions of interest. For applications ranging from educational technology to voice search to dictation, a speaker’s level of...

متن کامل

New compression and decompression of speech signals by a Neural Predictive Coding (NPC)

2001

J. L. ZARADER C. CHAVY

In this paper, we present a new method of speech compression and decompression based on a Neural Predictive Coding of speech signals. The NPC system is designed to predict the samples of a speech signal window from previous ones. In the coder/decoder that we proposed the transmitted data is computed from the prediction error of the NPC (difference between the sample and its corresponding predic...

متن کامل

Personal speech coding

1998

Wenhui Jin Wai-Yip Chan

In existing speech coding systems, all quantizer codebooks are designed to suit the statistical and perceptual characteristics of speech signals of a population of speakers. However, an individual’s speech signal does not exhibit, even over a long time, the entire range of characteristics of the population. With the advent of the personal communication systems, personal information might become...

متن کامل

Speech Coding and Recognition

2009

M. Satya Sai Ram P. Siddaiah Madhavi Latha

This paper investigates the performance of a speech recognizer in an interactive voice response system for various coded speech signals, coded by using a vector quantization technique namely Multi Switched Split Vector Quantization Technique. The process of recognizing the coded output can be used in Voice banking application. The recognition technique used for the recognition of the coded spee...

متن کامل

Speech Coding & Recognition

2009

M. Satya Sai Ram P. Siddaiah Madhavi Latha

This paper investigates the performance of a speech recognizer in an interactive voice response system for various coded speech signals, coded by using a vector quantization technique namely Multi Switched Split Vector Quantization Technique. The process of recognizing the coded output can be used in Voice banking application. The recognition technique used for the recognition of the coded spee...

متن کامل

Fuzzy Clustering Approach Using Data Fusion Theory and its Application To Automatic Isolated Word Recognition

Journal: International Journal of Engineering 2003

B. Moshiri, P. Eslambolchi, Reza HoseinNezhad,

In this paper, utilization of clustering algorithms for data fusion in decision level is proposed. The results of automatic isolated word recognition, which are derived from speech spectrograph and Linear Predictive Coding (LPC) analysis, are combined with each other by using fuzzy clustering algorithms, especially fuzzy k-means and fuzzy vector quantization. Experimental results show that the...

متن کامل

Automatic Extraction of Drum Tracks from Polyphonic Music Signals

2002

Aymeric Zils François Pachet Olivier Delerue Fabien Gouyon

We propose an approach for extracting automatically time indexes of occurrences of percussive sounds in an audio signal taken from the Popular music repertoire. The scheme is able to detect percussive sounds unknown a priori in a selective fashion. It is based on an analysis by synthesis technique, whereby the sound searched for is gradually synthesized from the signal itself. The possibility t...

متن کامل