نتایج جستجو برای: utterance
تعداد نتایج: 6799 فیلتر نتایج به سال:
In this paper, we study the controllability of an Expressive TTS system trained on a dataset for continuous control. The is Blizzard 2013 based audiobooks read by female speaker containing great variability in styles and expressiveness. Controllability evaluated with both objective subjective experiment. assessment measure correlation between acoustic features dimensions latent space representi...
PURPOSE Prosodic and articulatory factors influence children's production of inflectional morphemes. For example, plural -s is produced more reliably in utterance-final compared to utterance-medial position (i.e., the positional effect), which has been attributed to the increased planning time in utterance-final position. In previous investigations of plural -s, utterance-medial plurals were fo...
This paper describes a method of resynthesising utterance-initial voiceless plosives, given an original utterance by one speaker and a speech database of utterances by many other speakers. The system removes an initial voiceless plosive from an utterance and replaces it with another voiceless plosive selected from the speech database. (For example, if the original utterance was /tat/, the resyn...
In this paper, an algorithm performing automatic reconstruction of utterance boundaries time marks in speech database re-grabbed from DAT recorder is presented. Originally, the database was grabbed from DAT and, after down-sampling, processed at 16 kHz. Utterance boundaries were manually found, each utterance was stored to a separate file and orthographic and phonetic transcriptions were perfor...
Emotion expressions sometimes are mixed with the utterance expression in spontaneous face-to-face communication, which makes difficulties for emotion recognition. This article introduces the methods of reducing the utterance influences in visual parameters for the audio-visual-based emotion recognition. The audio and visual channels are first combined under a Multistream Hidden Markov Model (MH...
As applications incorporating speech recognition technology become widely used, it is desireable to have such systems interact naturally with its users. For such natural interaction to occur, recognition systems must be able to accurately detect when a speaker has finished speaking. This research presents an analysis combining lower and higher level cues to perform the utterance endpointing tas...
The prevalent state of the art in spoken language understanding by spoken dialog systems is both modular and whole-utterance. It is modular in that incoming utterances are processed by independent components that handle different aspects, such as acoustics, syntax, semantics, and intention / goal recognition. It is whole-utterance in that each component completes its work for an entire utteranc...
The prevalent state of the art in spoken language understanding by spoken dialog systems is both modular and whole-utterance. It is modular in that incoming utterances are processed by independent components that handle different aspects, such as acoustics, syntax, semantics, and intention / goal recognition. It is whole-utterance in that each component completes its work for an entire utteranc...
8 speakers of American English produced utterances consisting of one to five disyllables ([bábe] or [pápe]). Vowel and stop closure intervals were defined by variations in supraglottal pressure, sensed through a thin tube inserted in the mouth. Closure was always longer for /p/ than /b/ in utterance-medial positions. In utterance-initial position, however, /b/ lengthened more than /p/ so that n...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید