frequency cepstral coefficient

An unsupervised acoustic fall detection system using source separation for sound interference suppression

Journal: :Signal Processing 2015

Muhammad Salman Khan Miao Yu Pengming Feng Liang Wang Jonathon A. Chambers

We present a novel unsupervised fall detection system that employs the collected acoustic signals (footstep sound signals) from an elderly person's normal activities to construct a data description model to distinguish falls from non-falls. The measured acoustic signals are initially processed with a source separation (SS) technique to remove the possible interferences from other background sou...

متن کامل

Comparative study of GMM, DTW, and ANN on Thai speaker identification system

2000

Chularat Tanprasert Varin Achariyakulporn

This paper proposes a new investigation on Gaussian mixture model (GMM) by comparing it with some preliminary experiments on multilayered perceptron network (MLP) with backpropagation learning algorithm (BKP) and dynamic time warping (DTW) techniques on Thai text-dependent speaker identification system. Three major identification engines are conducted on 50 speakers with isolated digits 0-9. Tr...

متن کامل

Disease Detection Using Analysis of Voice Parameters

2015

This paper investigates the adaptation of automatic speech recognition to disease detection by analyzing the voice parameters. The analysis of the voice allows the identification of the diseases which affect the vocal apparatus and currently is carried out from an expert doctor through methods based on the auditory analysis. This paper presents a novel method to keep track of patient’s patholog...

متن کامل

Short- and long-term dynamic features for robust speech recognition

2008

Takashi Fukuda Osamu Ichikawa Masafumi Nishimura

The short-term temporal information in speech is widely used for automatic speech recognition (ASR) systems in the form of dynamic features. Long-term temporal information has also been focused on recently and is used to complement traditional short-term features (typically from 25 to 100 ms). There are several approaches to represent long-term temporal information in ASR systems. However, thos...

متن کامل

Subband temporal modulation spectrum normalization for automatic speech recognition in reverberant environments

2009

Xugang Lu Masashi Unoki Satoshi Nakamura

Speech recognition in reverberant environments is still a challenge problem. In this paper, we first investigated the reverberation effect on subband temporal envelopes by using the modulation transfer function (MTF). Based on the investigation, we proposed an algorithm which normalizes the subband temporal modulation spectrum (TMS) to reduce the diffusion effect of the reverberation. During th...

متن کامل

Implementation Of Back-Propagation Neural Network For Isolated Bangla Speech Recognition

Journal: :CoRR 2013

Md. Ali Hossain Md. Mijanur Rahman Uzzal Kumar Prodhan Md. Farukuzzaman Khan

This paper is concerned with the development of Back-propagation Neural Network for Bangla Speech Recognition. In this paper, ten bangla digits were recorded from ten speakers and have been recognized. The features of these speech digits were extracted by the method of Mel Frequency Cepstral Coefficient (MFCC) analysis. The mfcc features of five speakers were used to train the network with Back...

متن کامل

Comparison of spectrum estimators in speaker verification: mismatch conditions induced by vocal effort

2013

Cemal Hanilçi Tomi Kinnunen Padmanabhan Rajan Jouni Pohjalainen Paavo Alku Figen Ertas

We study the problem of vocal effort mismatch in speaker verification. Changes in speaker’s vocal effort induce changes in fundamental frequency (F0) and formant structure which introduce unwanted intra-speaker variations to features. We compare seven alternative spectrum estimators in the context of melfrequency cepstral coefficient (MFCC) extraction for speaker verification. The compared vari...

متن کامل

An Analysis of the Performance Evaluation of Syllable Based Tamil Speech Recognition System

2017

A. Akila

Automatic Speech Recognition has been a goal of research for many decades. Many research works have been developed successfully for automatic speech recognition (ASR) of English language. ASR for European languages has not reached their height as ASR in English language. In this work, an implementation of Tamil based automatic speech Recognition System is developed. The ASR has many phases to p...

متن کامل

Acoustic Scene Classification Using Spatial Features

2017

Marc C. Green Damian Murphy

Due to various factors, the vast majority of the research in the field of Acoustic Scene Classification has used monaural or binaural datasets. This paper introduces EigenScape a new dataset of 4th-order Ambisonic acoustic scene recordings and presents preliminary analysis of this dataset. The data is classified using a standard Mel-Frequency Cepstral Coefficient Gaussian Mixture Model system, ...

متن کامل

Speaker recognition using pattern recognition neural network and feedforward neural network

2017

Neha Chauhan

Neha Chauhan Birla Institute of Technology, Mesra, Ranchi Abstract— Speaker Recognition is the computing task of validating a user’s claimed identity using speech characteristics. Main objective of speech recognition system is to communication with a device through our voice. Mel frequency Cepstral Coefficient (MFCC) features are combined with pitch and root mean square values and tested for im...

متن کامل