speech synthesis method

Evaluation of a speech synthesis method for nonlinear modeling of vocal folds vibration effect

1997

Hiroshi Ohmura Kazuyo Tanaka

In this paper, we present a new speech synthesis method for improving voice quality in parametric rule-based speech synthesis systems. We also describe the results of a preference test on speech wave reconstruction to con rm the performance of the proposed method. The method is based on the functional approximation of vocal tract resonance produced by nonlinear interaction between the glottis a...

متن کامل

Applications of computer generated expressive speech for communication disorders

2003

Jan P. H. van Santen Lois M. Black Gilead Cohen Alexander Kain Esther Klabbers Taniya Mishra Jacques de Villiers Xiaochuan Niu

This paper focuses on generation of expressive speech, specifically speech displaying vocal affect. Generating speech with vocal affect is important for diagnosis, research, and remediation for children with autism and developmental language disorders. However, because vocal affect involves many acoustic factors working together in complex ways, it is unlikely that we will be able to generate c...

متن کامل

Clustering Expressive Speech Styles in Audiobooks Using Glottal Source Parameters

2011

Éva Székely João P. Cabral Peter Cahill Julie Carson-Berndsen

A great challenge for text-to-speech synthesis is to produce expressive speech. The main problem is that it is difficult to synthesise high-quality speech using expressive corpora. With the increasing interest in audiobook corpora for speech synthesis, there is a demand to synthesise speech which is rich in prosody, emotions and voice styles. In this work, Self-Organising Feature Maps (SOFM) ar...

متن کامل

Reducing over-smoothness in HMM-based speech synthesis using exemplar-based voice conversion

Journal: :EURASIP J. Audio, Speech and Music Processing 2017

Gia Nhu Nguyen Trung-Nghia Phung

Speech synthesis has been applied in many kinds of practical applications. Currently, state-of-the-art speech synthesis uses statistical methods based on hidden Markov model (HMM). Speech synthesized by statistical methods can be considered over-smooth caused by the averaging in statistical processing. In the literature, there have been many studies attempting to solve over-smoothness in speech...

متن کامل

Broadcast Technology

2012

NHK STRL 10 Speech synthesis is a very convenient means of conveying spoken information without the need for human labor. The most commonly used method of speech synthesis in program production compiles speech recorded during different tasks. Additionally, the textto-speech (TTS) synthesis, which can synthesize speech from any text, can be used to deal with dialogue spoken with unemotional or u...

متن کامل

Two-Band Excitation for HMM-Based Speech Synthesis

Journal: :IEICE Transactions 2007

Sang-Jin Kim Minsoo Hahn

© 2009 Seungho Han et al. 457 ABSTRACT⎯The optimum maximum voiced frequency (MVF) estimation-based two-band excitation for hidden Markov model-based speech synthesis is presented. An analysis-by-synthesis scheme is adopted for the MVF estimation which leads to the minimum spectral distortion of synthesized speech. Experimental results show that the proposed method significantly improves synthet...

متن کامل

A low bit rate speech coding method using a formant-articulatory parameter nomogram

2000

Hiroshi Ohmura Akira Sasou Kazuyo Tanaka

In this paper, we propose a new method for low bit rate speech coding using a nomogram that is a pair of codebooks representing the functional relationship between formant frequencies and articulatory parameters. Significant features of our approach are 1) using the codebooks derived theoretically from the computation using a stylized vocal tract model and 2) independent coding by separating fr...

متن کامل

Speech analysis/synthesis/conversion by using sequential processing

1999

Panuthat Boonpramuk Tetsuo Funada Noboru Kanedera

This paper presents a method for speech analysis/synthesis/ conversion by using sequential processing. The aims of this method are to improve the quality of synthesized speech and to convert the original speech into another speech of different characteristics. We apply the Kalman Filter for estimating the auto-regressive coefficients of ‘time varying AR model with unknown input (ARUI model)’, w...

متن کامل

Title Recognition of Visual Speech Elements Using Adaptively Boosted Hidden Markov Models( Published Version ) Recognition of Visual Speech Elements Using Adaptively Boosted Hidden Markov Models

2016

Liang Dong

The performance of automatic speech recognition (ASR) system can be significantly enhanced with additional information from visual speech elements such as the movement of lips, tongue, and teeth, especially under noisy environment. In this paper, a novel approach for recognition of visual speech elements is presented. The approach makes use of adaptive boosting (AdaBoost) and hidden Markov mode...

متن کامل

An automatic pitch-marking method using wavelet transform

2000

Masaharu Sakamoto Takashi Saitoh

This paper describes a new automatic pitch-marking method using wavelet transform. This method detects discontinuity in the speech waveform which occurs at the glottal closure instant (GCI). A time domain prosodic modification technique requires an appropriate determination of the synthesis pitch-marks. We evaluated the performance of the newly developed pitchmarking method by using our interna...

متن کامل