like utterance

Lexical-functional Grammar: a Formal System for Grammatical Representation. Robust Speech Repair

1995

A. Waibel B. Suhm P. Geutner T. Kemp T. Sloboda W. Ward M. Woszczyna

After locating and highlighting erroneous sections in the recognizer hypothesis, misrecognitions are corrected. The spoken hypothesis correction method uses N-Best lists for both the initial utterance and the respo-ken section. The N-Best list for the highlighted section of the initial utterance is rescored using scores from decoding the secondary utterance. Depending on the quality of the N-Be...

متن کامل

Keyword Spotting by Searching the Syllable Lattices

2000

Chia-Hsien LIN Hsiao-Chuan WANG

This paper presents a keyword spotting method based on searching a syllable lattice structure. The Mandarin syllables are represented in initial-final models. By one-stage dynamic programming, an utterance is converted into a sequence of topN-candidate syllables. It comes out a syllable lattice structure for this input utterance. A vocabulary of predefined keywords is represented as a set of sy...

متن کامل

Advancing methods for reliably assessing motivational interviewing fidelity using the motivational interviewing skills code.

Journal: :Journal of substance abuse treatment 2015

Sarah Peregrine Lord Doğan Can Michael Yi Rebeca Marin Christopher W Dunn Zac E Imel Panayiotis Georgiou Shrikanth Narayanan Mark Steyvers David C Atkins

The current paper presents novel methods for collecting MISC data and accurately assessing reliability of behavior codes at the level of the utterance. The MISC 2.1 was used to rate MI interviews from five randomized trials targeting alcohol and drug use. Sessions were coded at the utterance-level. Utterance-based coding reliability was estimated using three methods and compared to traditional ...

متن کامل

Analysis and Assessment of Controllability of an Expressive Deep Learning-Based TTS System

Journal: :Informatics (Basel) 2021

In this paper, we study the controllability of an Expressive TTS system trained on a dataset for continuous control. The is Blizzard 2013 based audiobooks read by female speaker containing great variability in styles and expressiveness. Controllability evaluated with both objective subjective experiment. assessment measure correlation between acoustic features dimensions latent space representi...

متن کامل

Towards Context-adaptive Utterance Interpretation

2002

Robert Porzel Iryna Gurevych

متن کامل

Examination of the Locus of Positional Effects on Children's Production of Plural -s: Considerations From Local and Global Speech Planning.

Journal: :Journal of speech, language, and hearing research : JSLHR 2015

Rachel M Theodore Katherine Demuth Stefanie Shattuck-Hufnagel

PURPOSE Prosodic and articulatory factors influence children's production of inflectional morphemes. For example, plural -s is produced more reliably in utterance-final compared to utterance-medial position (i.e., the positional effect), which has been attributed to the increased planning time in utterance-final position. In previous investigations of plural -s, utterance-medial plurals were fo...

متن کامل

Modified re-synthesis of initial voiceless plosives by concatenation of speech from different speakers

2009

Sofia Strömbergsson

This paper describes a method of resynthesising utterance-initial voiceless plosives, given an original utterance by one speaker and a speech database of utterances by many other speakers. The system removes an initial voiceless plosive from an utterance and replaces it with another voiceless plosive selected from the speech database. (For example, if the original utterance was /tat/, the resyn...

متن کامل

Automatic Reconstruction of Utterance Boundaries Time Marks in Speech Database Re-grabbed from DAT Recorder

2005

Hynek Bořil

In this paper, an algorithm performing automatic reconstruction of utterance boundaries time marks in speech database re-grabbed from DAT recorder is presented. Originally, the database was grabbed from DAT and, after down-sampling, processed at 16 kHz. Utterance boundaries were manually found, each utterance was stored to a separate file and orthographic and phonetic transcriptions were perfor...

متن کامل

Utterance independent bimodal emotion recognition in spontaneous communication

Journal: :EURASIP J. Adv. Sig. Proc. 2011

Jianhua Tao Shifeng Pan Minghao Yang Ya Li Kaihui Mu Jianfeng Che

Emotion expressions sometimes are mixed with the utterance expression in spontaneous face-to-face communication, which makes difficulties for emotion recognition. This article introduces the methods of reducing the utterance influences in visual parameters for the audio-visual-based emotion recognition. The audio and visual channels are first combined under a Multistream Hidden Markov Model (MH...

متن کامل

Automatic Utterance Segmentation in Spontaneous Speech

2014

Norimasa Yoshida Peter Gorniak

As applications incorporating speech recognition technology become widely used, it is desireable to have such systems interact naturally with its users. For such natural interaction to occur, recognition systems must be able to accurately detect when a speaker has finished speaking. This research presents an analysis combining lower and higher level cues to perform the utterance endpointing tas...

متن کامل