Psychobiological Responses Reveal Audiovisual Noise Differentially Challenges Speech Recognition

نویسندگان

چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Audiovisual speech processing in visual speech noise

When the talker’s face (visual speech) can be seen, speech perception is both facilitated (for congruent visual speech) and interfered with (for incongruent visual speech). The current study investigated whether the degree of these visual speech effects was affected by the presence of an additional irrelevant talking face. In the experiment, auditory speech targets (vCv syllables) were presente...

متن کامل

Modeling of audiovisual speech perception in noise

We present three models of audiovisual speech perception at varying signal-to-noise ratios (SNR). The first model is Massaro’s Fuzzy Logical Model of Perception (FLMP) applied at each SNR. The second model imposes the constraint that the visual response probabilities are the same regardless of the SNR. Both models describe the data well. Root Mean Squared Error (RMSE) corrected for the numbers ...

متن کامل

Scale Based Features for Audiovisual Speech Recognition

This paper demonstrates the use of nonlinear image decomposition, in the form of a sieve, applied to the task of audiovisual speech recognition of a database of the letters A–Z for ten talkers. A scale based feature vector is formed directly from the grayscale pixels of an image containing the talkers mouth on a per frame basis. This is independent of image amplitude and position information an...

متن کامل

End-to-end Audiovisual Speech Recognition

Several end-to-end deep learning approaches have been recently presented which extract either audio or visual features from the input images or audio signals and perform speech recognition. However, research on end-to-end audiovisual models is very limited. In this work, we present an end-toend audiovisual model based on residual networks and Bidirectional Gated Recurrent Units (BGRUs). To the ...

متن کامل

Audiovisual Speech: Analysis, Synthesis, Perception, and Recognition

In many cases research in the fields of audiovisual speech analysis, synthesis, perception and (automatic) recognition is carried out separately with only limited account for the neighboring areas. But the author claims that these neighboring areas yield huge, currently idle potential to improve and better understand the field under investigation and that human speech as a phenomenon should be ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Ear & Hearing

سال: 2019

ISSN: 0196-0202

DOI: 10.1097/aud.0000000000000755