نتایج جستجو برای: mel frequency cel cepstrum mfcc

تعداد نتایج: 490625  

Journal: :JTRM 2022

Jatuh merupakan masalah kesehatan utama di seluruh dunia, terutama dalam dunia karena pasien jatuh terparah yang terus terjadi. Kebanyakan dari tempat tidur tidak disaksikan. Hal ini diperparah dengan berbagai bisa diakibatkan oleh jatuh. Tetap lantai setelah dapat menyebabkan trauma, cedera serius, dan bahkan kematian. Oleh itu, diperlukan sistem pendeteksi agar orang segera diberikan pertolon...

2003
Li Tan Montri Karnjanadecha

This paper describes the principle of MFCC feature extraction and the knowledge of human auditory masking effect in order to introduce a modified-MFCC feature extraction that can improve the robustness of speech recognition systems.

1999
Algimantas Rudzionis Vytautas Rudzionis

Speaker independent discrimination of four confusable consonants in the strictly fixed context of six vowels is considered. The consonants are depicted by features of consonant’s stationary part and changing rate of features (delta features) in transition from consonant to the following vowel. The mel frequency cepstrum (MFCC), linear prediction cepstrum (LPCC), recursive filter (F12) features ...

2007
Jón Guðnason

Voice source analysis and modelling has played a key role in important speech applications such as speech recognition, speech synthesis and speaker recognition. This work presents a robust algorithm for glottal closure detection and a novel set of voice source features for speaker recognition. In the rst part of the dissertation the DYPSA algorithm is developed for detecting glottal closure ins...

Journal: :Computer Speech & Language 2015
Jimmy Ludeña-Choez Ascensión Gallardo-Antolín

In this paper, we propose a new front-end for Acoustic Event Classification tasks (AEC). First, we study the spectral characteristics of different acoustic events in comparison with the structure of speech spectra. Second, from the findings of this study, we propose a new parameterization for AEC, which is an extension of the conventional Mel Frequency Cepstrum Coefficients (MFCC) and is based ...

2012
Clifford Loh Ting Yuan Dzati Athiar Ramli

Physiological research reported that certain frog species contain antimicrobial substances which is potentially and beneficial in overcoming certain health problem. As a result, there is an imperative need for an automated frog species identification to assist people in physiological research in detecting and localizing certain frog species. This project aims to develop a frog sound identificat...

2004
Hyoung-Gook Kim Thomas Sikora

In this paper, we present a classification and retrieval technique targeted for retrieval of home video abstract using dimension-reduced, decorrelated spectral features of audio content. The feature extraction based on MPEG-7 descriptors consists of three main stages: Normalized Audio Spectrum Envelope (NASE), basis decomposition algorithm and basis projection, obtained by multiplying the NASE ...

Journal: :J. Information Security 2012
Alfredo Maesa Fabio Garzia Michele Scarpiniti Roberto Cusani

The aim of this paper is to show the accuracy and time results of a text independent automatic speaker recognition (ASR) system, based on Mel-Frequency Cepstrum Coefficients (MFCC) and Gaussian Mixture Models (GMM), in order to develop a security control access gate. 450 speakers were randomly extracted from the Voxforge.org audio database, their utterances have been improved using spectral sub...

2013
Nidhi Srivastava

The most common mode of communication between humans is speech. As this is the most preferred way, humans would like to use speech to interact with machines also. That is why, automatic speech recognition has gained a lot of popularity. Many approaches for speech recognition exist like Dynamic Time Warping (DTW), Hidden Markov Model (HMM). This paper shows how Neural Network (NN) can be used fo...

2010
Jia Min Karen Kua Tharmarajah Thiruvaran Mohaddeseh Nosratighods Eliathamby Ambikairajah Julien Epps

Most conventional features used in speaker recognition are based on spectral envelope characterizations such as Mel-scale filterbank cepstrum coefficients (MFCC), Linear Prediction Cepstrum Coefficient (LPCC) and Perceptual Linear Prediction (PLP). The MFCC’s success has seen it become a de facto standard feature for speaker recognition. Alternative features, that convey information other than ...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید