نتایج جستجو برای: like utterance

تعداد نتایج: 659663  

1995
A. Waibel B. Suhm P. Geutner T. Kemp T. Sloboda W. Ward M. Woszczyna

After locating and highlighting erroneous sections in the recognizer hypothesis, misrecognitions are corrected. The spoken hypothesis correction method uses N-Best lists for both the initial utterance and the respo-ken section. The N-Best list for the highlighted section of the initial utterance is rescored using scores from decoding the secondary utterance. Depending on the quality of the N-Be...

2000
Chia-Hsien LIN Hsiao-Chuan WANG

This paper presents a keyword spotting method based on searching a syllable lattice structure. The Mandarin syllables are represented in initial-final models. By one-stage dynamic programming, an utterance is converted into a sequence of topN-candidate syllables. It comes out a syllable lattice structure for this input utterance. A vocabulary of predefined keywords is represented as a set of sy...

Journal: :Journal of substance abuse treatment 2015
Sarah Peregrine Lord Doğan Can Michael Yi Rebeca Marin Christopher W Dunn Zac E Imel Panayiotis Georgiou Shrikanth Narayanan Mark Steyvers David C Atkins

The current paper presents novel methods for collecting MISC data and accurately assessing reliability of behavior codes at the level of the utterance. The MISC 2.1 was used to rate MI interviews from five randomized trials targeting alcohol and drug use. Sessions were coded at the utterance-level. Utterance-based coding reliability was estimated using three methods and compared to traditional ...

Journal: :Informatics (Basel) 2021

In this paper, we study the controllability of an Expressive TTS system trained on a dataset for continuous control. The is Blizzard 2013 based audiobooks read by female speaker containing great variability in styles and expressiveness. Controllability evaluated with both objective subjective experiment. assessment measure correlation between acoustic features dimensions latent space representi...

Journal: :Journal of speech, language, and hearing research : JSLHR 2015
Rachel M Theodore Katherine Demuth Stefanie Shattuck-Hufnagel

PURPOSE Prosodic and articulatory factors influence children's production of inflectional morphemes. For example, plural -s is produced more reliably in utterance-final compared to utterance-medial position (i.e., the positional effect), which has been attributed to the increased planning time in utterance-final position. In previous investigations of plural -s, utterance-medial plurals were fo...

2009
Sofia Strömbergsson

This paper describes a method of resynthesising utterance-initial voiceless plosives, given an original utterance by one speaker and a speech database of utterances by many other speakers. The system removes an initial voiceless plosive from an utterance and replaces it with another voiceless plosive selected from the speech database. (For example, if the original utterance was /tat/, the resyn...

2005
Hynek Bořil

In this paper, an algorithm performing automatic reconstruction of utterance boundaries time marks in speech database re-grabbed from DAT recorder is presented. Originally, the database was grabbed from DAT and, after down-sampling, processed at 16 kHz. Utterance boundaries were manually found, each utterance was stored to a separate file and orthographic and phonetic transcriptions were perfor...

Journal: :EURASIP J. Adv. Sig. Proc. 2011
Jianhua Tao Shifeng Pan Minghao Yang Ya Li Kaihui Mu Jianfeng Che

Emotion expressions sometimes are mixed with the utterance expression in spontaneous face-to-face communication, which makes difficulties for emotion recognition. This article introduces the methods of reducing the utterance influences in visual parameters for the audio-visual-based emotion recognition. The audio and visual channels are first combined under a Multistream Hidden Markov Model (MH...

2014
Norimasa Yoshida Peter Gorniak

As applications incorporating speech recognition technology become widely used, it is desireable to have such systems interact naturally with its users. For such natural interaction to occur, recognition systems must be able to accurately detect when a speaker has finished speaking. This research presents an analysis combining lower and higher level cues to perform the utterance endpointing tas...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید