Influence of Specific Voip Transmission Conditions on Speaker Recognition Problem
نویسنده
چکیده
The paper presents the problem of signal degradation in packet-based voice transmission and its influence on the voice recognition correctness. The Internet is evolving into universal communication network which carries all types of traffic including data, video and voice. Among them the Internet telephony, namely VoIP is going to be an application of a great importance and that is why it is so important to assess how specific conditions and distortions of the Internet transmission (speech coding and most of all packet loss and delay) can influence speaker recognition problem. The Gaussian Mixture Models classification, the feature extraction, the Internet speech transmission standards and the signal degradation methodology applied in the tested system were overviewed. The experiments carried out for two most commonly applied encoders (G.711 and G.723) and three network conditions (poor, average and with no packet loss) revealed a minor significance of the packet loss problem in the tested text-independent system.
منابع مشابه
Speaker recognition from encrypted VoIP communications
Most of the voice over IP (VoIP) traffic is encrypted prior to its transmission over the Internet. This makes the identity tracing of perpetrators during forensic investigations a challenging task since conventional speaker recognition techniques are limited to unencrypted speech communications. In this paper, we propose techniques for speaker identification and verification from encrypted VoIP...
متن کاملReviewing Speaker Recognition Influence in Strengthening VOIP Caller Authentication
This paper focuses on authentication issues and user ID detection depending on the voice signature of the incoming Voice over IP (VoIP) calls. Firstly, a complete speaker recognition system based on LPC feature extraction and decision based on Artificial Neural Network was described and built using MATLAB package. Then, MATLAB package was used to develop a simplified prototype version of the mo...
متن کاملA Bayesian network approach combining pitch and spectral envelope features to reduce channel mismatch in speaker verification and forensic speaker recognition
The aim of this paper is to reduce the effect of mismatch in recording conditions due to the transmission channel and recording device, using conditional dependencies of prosodic and spectral envelope features. The developed system is based on a Bayesian network framework which combines statistical models of the pitch and spectral envelope features. This approach is applied to forensic automati...
متن کاملForensic Automatic Speaker Recognition Using Bayesian Interpretation and Statistical Compensation for Mismatched Conditions
Nowadays, state-of-the-art automatic speaker recognition systems show very good performance in discriminating between voices of speakers under controlled recording conditions. However, the conditions in which recordings are made in investigative activities (e.g., anonymous calls and wire-tapping) cannot be controlled and pose a challenge to automatic speaker recognition. Differences in the phon...
متن کاملشبکه عصبی پیچشی با پنجرههای قابل تطبیق برای بازشناسی گفتار
Although, speech recognition systems are widely used and their accuracies are continuously increased, there is a considerable performance gap between their accuracies and human recognition ability. This is partially due to high speaker variations in speech signal. Deep neural networks are among the best tools for acoustic modeling. Recently, using hybrid deep neural network and hidden Markov mo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006