speech synthesis method

Automatic feature selection for acoustic-visual concatenative speech synthesis: towards a perceptual objective measure

2013

Utpala Musti Vincent Colotte Slim Ouni Caroline Lavecchia Brigitte Wrobel-Dautcourt Marie-Odile Berger

We present an iterative algorithm for automatic feature selection and weight tuning of target cost in the context of unit selection based audio-visual speech synthesis. We perform feature selection and weight tuning for a given unit-selection corpus to make the ranking given by the target cost function consistent with the ordering given by an objective dissimilarity measure. We explicitly perfo...

متن کامل

روشی مناسب جهت ساخت بتالاکتامهای بدون استخلاف اتم نیتروژن

پایان نامه :وزارت علوم، تحقیقات و فناوری - دانشگاه شهید بهشتی - دانشکده علوم 1371

رضا چنارانی, محمد صادق خواجوی, ایرج قاضی,

we describe here a suitable approach for the synthesis of n-unsubstituted monocyclic b-lactams under mild reaction conditions by the annelation of imines with substituted acetylchlorides. in this method the reaily available phtalimidoacetyl chloride were allowed to react with - dibenzylideneiminotoluene (hydrobenzamide) in the presence of an equimolar amount of triethylamine in refluxing toluen...

15 صفحه اول

An enhanced ABS/OLA sinusoidal model for waveform synthesis in TTS

1999

Michael W. Macon Mark A. Clements

This paper describes a method for text-to-speech waveform synthesis based on the Analysis-by-Synthesis/Overlap-Add (ABS/OLA) sinusoidal model. This model has been shown in previous work to be a useful framework for pitch and time-scale modi cation of both speech and music signals. This paper explores some extensions of the original ABS/OLA formulation that attempt to overcome speci c artifacts,...

متن کامل

Source-filter separation for articulation-to-speech synthesis

2004

Yoshinori Shiga Simon King

In this paper we examine a method for separating out the vocal-tract filter response from the voice source characteristic using a large articulatory database. The method realises such separation for voiced speech using an iterative approximation procedure under the assumption that the speech production process is a linear system composed of a voice source and a vocal-tract filter, and that each...

متن کامل

Spectral Study of the Vocal Tract in Vowel Synthesis: A Comparison between 1D and 3D Acoustic Analysis

Journal: :CoRR 2015

Negar M. Harandi Daniel Aalto Antti Hannukainen Jarmo Malinen Sidney S. Fels

A state-of-the-art 1D acoustic synthesizer has been previously developed, and coupled to speaker-specific biomechanical models of oropharynx in ArtiSynth. As expected, the formant frequencies of the synthesized vowel sounds were shown to be different from those of the recorded audio. Such discrepancy was hypothesized to be due to the simplified geometry of the vocal tract model as well as the o...

متن کامل

Automatic phonetic transcription of large speech corpora: a comparative study

2006

Christophe Van Bael Lou Boves Henk van den Heuvel Helmer Strik

This study investigates whether automatic transcription procedures can approximate manual phonetic transcriptions typically delivered with contemporary large speech corpora. We used ten automatic procedures to generate a broad phonetic transcription of well-prepared speech (read-aloud texts) and spontaneous speech (telephone dialogues). The resulting transcriptions were compared to manually ver...

متن کامل

Optimization of Cost Function Weights for Unit Selection Speech Synthesis Using Speech Recognition

2014

Miran Pobar Sanda Martinčić-Ipšić Ivo Ipšić

A well known problem in unit selection speech synthesis is designing the join and target function sub-costs and optimizing their corresponding weights so that they reflect the human listeners’ preferences. To achieve this we propose a procedure where an objective criterion for optimal speech unit selection is used. The objective criterion for tuning the cost function weights is based on automat...

متن کامل

A Statistical Sample-Based Approach to GMM-Based Voice Conversion Using Tied-Covariance Acoustic Models

Journal: :IEICE Transactions 2016

Shinnosuke Takamichi Tomoki Toda Graham Neubig Sakriani Sakti Satoshi Nakamura

This paper presents a novel statistical sample-based approach for Gaussian Mixture Model (GMM)-based Voice Conversion (VC). Although GMM-based VC has the promising flexibility of model adaptation, quality in converted speech is significantly worse than that of natural speech. This paper addresses the problem of inaccurate modeling, which is one of the main reasons causing the quality degradatio...

متن کامل

The Sound of Deception - What Makes a Speaker Credible?

2017

Anne Schröder Simon Stone Peter Birkholz

The detection of deception in human speech is a difficult task but can be performed above chance level by human listeners even when only audio data is provided. Still, it is highly contested, which speech features could be used to help identify lies. In this study, we examined a set of phonetic and paralinguistic cues and their influence on the credibility of speech using an analysis-by-synthes...

متن کامل

Improvement in corpus-based generation of F0 contours using generation process model for emotional speech synthesis

2004

Keikichi Hirose

In our fully automatic corpus-based method of generating fundamental frequency (F0) contours for emotional speech synthesis, an improvement was realized related to the process of corpus preparation. The method assumes the generation process model and predicts its command parameters using binary regression trees with inputs of linguistic information of the sentence to be synthesized. Because of ...

متن کامل