نتایج جستجو برای: like utterance
تعداد نتایج: 659663 فیلتر نتایج به سال:
After locating and highlighting erroneous sections in the recognizer hypothesis, misrecognitions are corrected. The spoken hypothesis correction method uses N-Best lists for both the initial utterance and the respo-ken section. The N-Best list for the highlighted section of the initial utterance is rescored using scores from decoding the secondary utterance. Depending on the quality of the N-Be...
This paper presents a keyword spotting method based on searching a syllable lattice structure. The Mandarin syllables are represented in initial-final models. By one-stage dynamic programming, an utterance is converted into a sequence of topN-candidate syllables. It comes out a syllable lattice structure for this input utterance. A vocabulary of predefined keywords is represented as a set of sy...
The current paper presents novel methods for collecting MISC data and accurately assessing reliability of behavior codes at the level of the utterance. The MISC 2.1 was used to rate MI interviews from five randomized trials targeting alcohol and drug use. Sessions were coded at the utterance-level. Utterance-based coding reliability was estimated using three methods and compared to traditional ...
In this paper, we study the controllability of an Expressive TTS system trained on a dataset for continuous control. The is Blizzard 2013 based audiobooks read by female speaker containing great variability in styles and expressiveness. Controllability evaluated with both objective subjective experiment. assessment measure correlation between acoustic features dimensions latent space representi...
PURPOSE Prosodic and articulatory factors influence children's production of inflectional morphemes. For example, plural -s is produced more reliably in utterance-final compared to utterance-medial position (i.e., the positional effect), which has been attributed to the increased planning time in utterance-final position. In previous investigations of plural -s, utterance-medial plurals were fo...
This paper describes a method of resynthesising utterance-initial voiceless plosives, given an original utterance by one speaker and a speech database of utterances by many other speakers. The system removes an initial voiceless plosive from an utterance and replaces it with another voiceless plosive selected from the speech database. (For example, if the original utterance was /tat/, the resyn...
In this paper, an algorithm performing automatic reconstruction of utterance boundaries time marks in speech database re-grabbed from DAT recorder is presented. Originally, the database was grabbed from DAT and, after down-sampling, processed at 16 kHz. Utterance boundaries were manually found, each utterance was stored to a separate file and orthographic and phonetic transcriptions were perfor...
Emotion expressions sometimes are mixed with the utterance expression in spontaneous face-to-face communication, which makes difficulties for emotion recognition. This article introduces the methods of reducing the utterance influences in visual parameters for the audio-visual-based emotion recognition. The audio and visual channels are first combined under a Multistream Hidden Markov Model (MH...
As applications incorporating speech recognition technology become widely used, it is desireable to have such systems interact naturally with its users. For such natural interaction to occur, recognition systems must be able to accurately detect when a speaker has finished speaking. This research presents an analysis combining lower and higher level cues to perform the utterance endpointing tas...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید