Event Selection from Phone Posteriorgrams Using Matched Filters
نویسندگان
چکیده
In this paper we address the issue of how to select a minimal set of phonetic events from a phone posteriorgram while minimizing the loss of information. We derive phone posteriorgrams from two sources, Gaussian mixture models and sparse multilayer perceptrons, and apply phone-specific matched filters to the posteriorgrams to yield a smaller set of phonetic events. We introduce a mutual information based performance measure to compare phonetic event selection techniques and demonstrate that events extracted using matched filters can reduce input data while significantly improving performance of an event-based keyword spotting system.
منابع مشابه
IIIT-H SWS 2013: Gaussian Posteriorgrams of Bottle-Neck Features for Query-by-Example Spoken Term Detection
This paper describes the experiments conducted for spoken web search (SWS) at MediaEval 2013 evaluations. A conventional approach is to train a multi-layer perceptron using high resource languages and then use it in the low resource scenario. However, phone posteriorgrams have been found to under-perform when the language they were trained on differs from the target language. In this paper, we ...
متن کاملSpeeD @ MediaEval 2015: Multilingual Phone Recognition Approach to Query by Example STD
In this paper, we attempt to solve the Spoken Term Detection (STD) problem for under-resourced languages by a phone recognition approach within the Automatic Speech Recognition (ASR) paradigm, with multilingual acoustic models from six languages (Albanian, Czech, English, Hungarian, Romanian and Russian). The Power Normalized Cepstral Coefficients (PNCC) features are used for improved robustnes...
متن کاملCustomer Event Rate Estimation Using Particle Filters
Estimating the rate at which events happen has been studied under various guises and in different settings. We are interested in the specific case of consumerinitiated events or transactions (credit/debit card transactions, mobile phone calls, online purchases, etc.), and the modeling of such behavior, in order to estimate the rate at which such transactions are made. In this paper, we detail a...
متن کاملPhonotactic Language Identification for Singing
In the past decades, many successful approaches for language identification have been published. However, almost none of these approaches were developed with singing in mind. Singing has a lot of characteristics that differ from speech, such as a wider variance of fundamental frequencies and phoneme durations, vibrato, pronunciation differences, and different semantic content. We present a new ...
متن کاملPersonality Types as Correlate of Specific Phone Usages and Smart Phone Addiction among Students in the University of Ilorin, Kwara State, Nigeria
Past researchers have examined the impact of personality types on technology usage among students from varying angles. However, due to dearth of studies on personality types and smart phone addiction among students in the Nigerian environment, this study investigated the relationship between personality types, smart phone addiction and specific phone usages among University students. A sample s...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011