Event Selection from Phone Posteriorgrams Using Matched Filters

نویسندگان

  • Keith Kintzley
  • Aren Jansen
  • Hynek Hermansky
چکیده

In this paper we address the issue of how to select a minimal set of phonetic events from a phone posteriorgram while minimizing the loss of information. We derive phone posteriorgrams from two sources, Gaussian mixture models and sparse multilayer perceptrons, and apply phone-specific matched filters to the posteriorgrams to yield a smaller set of phonetic events. We introduce a mutual information based performance measure to compare phonetic event selection techniques and demonstrate that events extracted using matched filters can reduce input data while significantly improving performance of an event-based keyword spotting system.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

IIIT-H SWS 2013: Gaussian Posteriorgrams of Bottle-Neck Features for Query-by-Example Spoken Term Detection

This paper describes the experiments conducted for spoken web search (SWS) at MediaEval 2013 evaluations. A conventional approach is to train a multi-layer perceptron using high resource languages and then use it in the low resource scenario. However, phone posteriorgrams have been found to under-perform when the language they were trained on differs from the target language. In this paper, we ...

متن کامل

SpeeD @ MediaEval 2015: Multilingual Phone Recognition Approach to Query by Example STD

In this paper, we attempt to solve the Spoken Term Detection (STD) problem for under-resourced languages by a phone recognition approach within the Automatic Speech Recognition (ASR) paradigm, with multilingual acoustic models from six languages (Albanian, Czech, English, Hungarian, Romanian and Russian). The Power Normalized Cepstral Coefficients (PNCC) features are used for improved robustnes...

متن کامل

Customer Event Rate Estimation Using Particle Filters

Estimating the rate at which events happen has been studied under various guises and in different settings. We are interested in the specific case of consumerinitiated events or transactions (credit/debit card transactions, mobile phone calls, online purchases, etc.), and the modeling of such behavior, in order to estimate the rate at which such transactions are made. In this paper, we detail a...

متن کامل

Phonotactic Language Identification for Singing

In the past decades, many successful approaches for language identification have been published. However, almost none of these approaches were developed with singing in mind. Singing has a lot of characteristics that differ from speech, such as a wider variance of fundamental frequencies and phoneme durations, vibrato, pronunciation differences, and different semantic content. We present a new ...

متن کامل

Personality Types as Correlate of Specific Phone Usages and Smart Phone Addiction among Students in the University of Ilorin, Kwara State, Nigeria

Past researchers have examined the impact of personality types on technology usage among students from varying angles. However, due to dearth of studies on personality types and smart phone addiction among students in the Nigerian environment, this study investigated the relationship between personality types, smart phone addiction and specific phone usages among University students. A sample s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011