Using talker location to detect spurious utterances in desktop command and control

نویسنده

  • Devang Naik
چکیده

Hands-free desktop command and control speech recognition su ers from the critical drawback of improperly rejecting spurious conversation. This results in false acceptances of unintended speech commands that can inconvenience the user. A neural-network approach is proposed to detect spurious conversation by determining talker location. The approach is based on the premise that spoken utterances not directed towards the microphone source tend to be more reverberant and are likely to be spurious. The method estimates a con dence measure proportional to the amount of reverberation in the endpointed speech signal. The measure is obtained from a neural network that determines if the speech signal was directed to the microphone or was spoken otherwise. The proposed measure can be combined with the acoustic, linguistic and semantic information to improve upon decisions taken by conventional rejection modeling schemes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

BotOnus: an online unsupervised method for Botnet detection

Botnets are recognized as one of the most dangerous threats to the Internet infrastructure. They are used for malicious activities such as launching distributed denial of service attacks, sending spam, and leaking personal information. Existing botnet detection methods produce a number of good ideas, but they are far from complete yet, since most of them cannot detect botnets in an early stage ...

متن کامل

Spatial and temporal modifications of multitalker speech can improve speech perception in older adults.

Speech perception in multitalker environments often requires listeners to divide attention among several concurrent talkers before focusing on one talker with pertinent information. Such attentionally demanding tasks are particularly difficult for older adults due both to age-related hearing loss (presbacusis) and general declines in attentional processing and associated cognitive abilities. Th...

متن کامل

The Location of Memory A Bakhtinian Reading of Bahram Beyzaie’s The Crow

This study is concerned with the use of specific tools from Mikhail Bakhtin’s comprehensive literary work in order to investigate the notion of time/space in Bahram Beyzaie’s 1979 movie, The Crow. Employing the Bakhtinian notion of chronotope in the analysis of the movie as a cinematic text proves helpful in developing the notion of anachronotopicity, which is then utilized to investigate the w...

متن کامل

Masquerade Detection Using a Taxonomy-Based Multinomial Modeling Approach in UNIX Systems

This paper presents one-class Hellinger distance-based and one-class SVM modeling techniques that use a set of features to reveal user intent. The specific objective is to model user command profiles and detect deviations indicating a masquerade attack. The approach aims to model user intent, rather than only modeling sequences of user issued commands. We hypothesize that each individual user w...

متن کامل

Talker-Specific Generalization of Pragmatic Inferences based on Under- and Over-Informative Prenominal Adjective Use

According to Grice's (1975) Maxim of Quantity, rational talkers formulate their utterances to be as economical as possible while conveying all necessary information. Naturally produced referential expressions, however, often contain more or less information than what is predicted to be optimal given a rational speaker model. How do listeners cope with these variations in the linguistic input? W...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997