speech problem

The Un-normalized Graph p-Laplacian based Semi-supervised Learning Method and Speech Recognition Problem

2017

Loc Tran Linh Tran

Speech recognition is the classical problem in pattern recognition research field. However, just a few graph based machine learning methods have been applied to this classical problem. In this paper, we propose the un-normalized graph p-Laplacian semi-supervised learning methods and these methods will be applied to the speech network constructed from the MFCC speech dataset to predict the label...

متن کامل

neural classifier ensemble using error-correcting output codes: access control application

پایان نامه :وزارت علوم، تحقیقات و فناوری - دانشگاه شاهد - دانشکده فنی و مهندسی 1387

نیما حاتمی, saeed seyedtabaii,

abstract biometric access control is an automatic system that intelligently provides the access of special actions to predefined individuals. it may use one or more unique features of humans, like fingerprint, iris, gesture, 2d and 3d face images. 2d face image is one of the important features with useful and reliable information for recognition of individuals and systems based on this ...

بررسی برخی ویژگی های آکوستیک گفتار نوزاد مدار در مادران فارسی زبان

ژورنال: مجله علمی پژوهان 2014

سلطانی, مجید , صیحه ای, الهام, باغبان, کوثر , جوادی پور, شیوا , مرادی, نگین ,

Introduction: When adults talk to another person, linguistic characteristics of the listener will also be considered. A clear example of speech changes depending on the listener is maternal or infant directed speech. Infant directed speech is more slowly with longer sentences and pauses at the end of the utterance. Undoubtedly the most distinctive feature of this style of speech is acoustic c...

متن کامل

Speech intelligibility after repair of cleft lip and palate

Journal: Medical Journal of Islamic Republic of Iran 2017

Azadeh Safaiean, Elham Asleshirin, Mehran Hiradfar, Mona Ebrahimipour, Nahid Jalilevand,

    Background: Intelligibility refers to understandability of speech; and lack of it can negatively affect children’s overall communication effectiveness. Children with repaired cleft lip and/or cleft palate (CL/P) may experience poor speech intelligibility. This study aimed at evaluating speech intelligibility in children with repaired CL/P who had not been referred to sp...

متن کامل

The characterization of the relative information content by spectral features for the objective intelligibility assessment of nonlinearly processed speech

2010

Anton Schlesinger Marinus M. Boone

The objective intelligibility assessment of nonlinearly enhanced speech is a widely experienced problem. Nonlinear speech enhancement processors operate primarily on the low-level and transient components of speech. As these sections contain important acoustic cues as well as context-constitutive information, they dominate speech intelligibility. For that reason, shorttime intelligibility measu...

متن کامل

Computational Auditory Scene Analysis Exploiting Speech-recognition Knowledge

1997

Dan Ellis

The field of computational auditory scene analysis (CASA) strives to build computer models of the human ability to interpret sound mixtures as the combination of distinct sources. A major obstacle to this enterprise is defining and incorporating the kind of high level knowledge of real-world signal structure exploited by listeners. Speech recognition, while typically ignoring the problem of non...

متن کامل

Blind One-microphone Speech Separation: A Spectral Learning Approach

2004

Francis R. Bach Michael I. Jordan

We present an algorithm to perform blind, one-microphone speech separation. Our algorithm separates mixtures of speech without modeling individual speakers. Instead, we formulate the problem of speech separation as a problem in segmenting the spectrogram of the signal into two or more disjoint sets. We build feature sets for our segmenter using classical cues from speech psychophysics. We then ...

متن کامل

Speech Enhancement using Laplacian Mixture Model under Signal Presence Uncertainty

Journal: International Journal of Engineering 2014

JAVAD HADDADNIA, ZEINAB MOHAMMADPOORY,

In this paper an estimator for speech enhancement based on Laplacian Mixture Model has been proposed. The proposed method, estimates the complex DFT coefficients of clean speech from noisy speech using the MMSE estimator, when the clean speech DFT coefficients are supposed mixture of Laplacians and the DFT coefficients of noise are assumed zero-mean Gaussian distribution. Furthermore, the MMS...

متن کامل

A recursive algorithm for the forced alignment of very long audio segments

1998

Pedro J. Moreno Christopher F. Joerg Jean-Manuel Van Thong Oren Glickman

In this paper we address the problem of aligning very long (often more than one hour) audio files to their corresponding textual transcripts in an effective manner. We present an efficient recursive technique to solve this problem that works well even on noisy speech signals. The key idea of this algorithm is to turn the forced alignment problem into a recursive speech recognition problem with ...

متن کامل

Speech Compression

Journal: :Information 2016

Jerry D. Gibson

Speech compression is a key technology underlying digital cellular communications, VoIP, voicemail, and voice response systems. We trace the evolution of speech coding based on the linear prediction model, highlight the key milestones in speech coding, and outline the structures of the most important speech coding standards. Current challenges, future research directions, fundamental limits on ...

متن کامل