speaker transformation

نتایج جستجو برای: speaker transformation

تعداد نتایج: 242055 فیلتر نتایج به سال:

Hindi Speech Recognition and Online Speaker Adaptation

2011

Ganesh Sivaraman K Samudravijaya

Speaker Adaptation is a technique which is used to improve the recognition accuracy of Automatic Speech Recognition (ASR) systems. Here, we report a study of the impact of online speaker adaptation on the performance of a speaker independent, continuous speech recognition system for Hindi language. The speaker adaptation is performed using the Maximum Likelihood Linear Regression (MLLR) transfo...

متن کامل

Transformation of Vocal Characteristics: A Review of Literature

2012

Dong - Yan Huang Ee Ping Ong Susanto Rahardja Minghui Dong Haizhou Li

The transformation of vocal characteristics aims at modifying voice such that the intelligibility of aphonic voice is increased or the voice characteristics of a speaker (source speaker) to be perceived as if another speaker (target speaker) had uttered it. In this paper, the current state-of-the-art voice characteristics transformation methodology is reviewed. Special emphasis is placed on voi...

متن کامل

Incorporating MAP estimation and covariance transform for SVM based speaker recognition

2010

Cheung-Chi Leung Donglai Zhu Kong-Aik Lee Bin Ma Haizhou Li

In this paper, we apply Constrained Maximum a Posteriori Linear Regression (CMAPLR) transformation on Universal Background Model (UBM) when characterizing each speaker with a supervector. We incorporate the covariance transformation parameters into the supervector in addition to the mean transformation parameters. Maximum Likelihood Linear Regression (MLLR) covariance transformation is adopted....

متن کامل

Blind Stochastic Feature Transformation for Channel Robust Speaker Verification

Journal: :Journal of VLSI signal processing systems for signal, image and video technology 2006

متن کامل

Incorporating durational modification in voice transformation

2008

Arthur R. Toth Alan W. Black

Voice transformation is the process of using a small amount of speech data from a target speaker to build a transformation model that can be used to generate arbitrary speech that sounds like the target speaker. One common current technique is building Gausian Mixture Models to map spectral aspects from source to target speakers. This paper proposes the use of duration models to improve the tra...

متن کامل

A new multi-speaker formant synthesizer that applies voice conversion techniques

2001

Juana M. Gutiérrez-Arriola Juan Manuel Montero-Martínez José A. Vallejo Ricardo de Córdoba Rubén San-Segundo-Hernández José Manuel Pardo

We present a multi-speaker formant synthesizer based on parameter concatenation. The user can choose among three speakers, two males and one female. The synthesizer stores all the parameters for the basic speaker and linear transformation functions to synthesized the other two. The complete database for one speaker consists of 455 parameterized units (diphones, triphones,...) and the parameters...

متن کامل

Estimation of GMM in voice conver

2003

Helenca Duxans

Voice conversion consists in transforming a source speaker voice into a target speaker voice. There are many applications of voice conversion systems where the amount of training data from the source speaker and the target speaker is different. Usually, the amount of source data available is large, but it is desired to estimate the transformation with a small amount of target data. Systems base...

متن کامل

Voice conversion by codebook mapping of line spectral frequencies and excitation spectrum

1997

Levent M. Arslan David Talkin

This paper presents a new scheme for developing a voice conversion system that modiies the utterance of a source speaker to sound like speech from a target speaker. We refer to the method as Speaker Transformation Algorithm using Segmen-tal Codebooks (STASC). Two new methods are described to perform the transformation of vocal tract and glottal excita-tion characteristics across speakers. In ad...

متن کامل

Similar Speaker Selection Technique Based on Distance Metric Learning with Perceptual Voice Quality Similarity

2012

Yusuke Ijima Mitsuaki Isogai Hideyuki Mizuno

This paper describes a similar speaker selection technique based on distance metric learning. Our aim is selection of a perceptually similar speaker using acoustic features from a multispeaker database. A novel point of the proposed technique is training a transform matrix using the perceptual voice quality similarity between many speakers obtained from a subjective evaluation to convert acoust...

متن کامل

Implications of glottal source for speaker and dialect identification

1999

Lisa Yanguas Thomas F. Quatieri

In this paper we explore the importance of speaker specific information carried in the glottal source. We time align utterances of two speakers speaking the same sentence from the TIMIT database of American English. We then extract the glottal flow derivative from each speaker and interchange them. Through time alignment and this glottal flow transformation, we can make a speaker of a northern ...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید