Chinese Speech Enhancement and Adaptive Recognition Technology for Complex Language Environments

نویسندگان

چکیده

The development of intelligent technology has also made rapid progress in relevant speech fields. In order to increase the application scenarios recognition systems, research improved traditional Speech enhancement algorithm, namely Ideal Binary Mask (IBM) and combined it with unimproved IBM algorithm propose an adaptive algorithm. Based on this built a new system, system uses FIR filter realize pre-emphasis processing Berouti spectral subtraction preprocess speech. model is using deep learning network model. results showed that had highest score Perceptual Evaluation Quality (PESQ) at 3.5596, followed by Ratio (IRM) 3.3429. improvement was feasible when noise intensity coefficient greater than 0.008. When 0.08, average 2.1079, 1.9418. proposed higher performance complex environments compared original system.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech enhancement strategy for speech recognition microcontroller under noisy environments

Industrial automation with speech control functions is generally installed with a speech recognition sensor which is used as an interface for users to articulate speech commands. However, recognition errors are likely to be produced when background noise surrounds the command spoken into the speech recognition microcontrollers. In this paper, a speech enhancement strategy is proposed to develop...

متن کامل

Language-adaptive persian speech recognition

Development of robust spoken language technology ideally relies on the availability of large amounts of data preferably in the target domain and language. However, more often than not, speech developers need to cope with very little or no data, typically obtained from a different target domain. This paper focuses on developing techniques towards addressing this challenge. Specifically we consid...

متن کامل

Language-independent and language-adaptive acoustic modeling for speech recognition

With the distribution of speech technology products all over the world, the portability to new target languages becomes a practical concern. As a consequence our research focuses on the question of how to port LVCSR systems in a fast and efficient way. More specifically we want to estimate acoustic models for a new target language using speech data from varied source languages, but only limited...

متن کامل

Language independent and language adaptive large vocabulary speech recognition

This paper describes the design of a multilingual speech recognizer using an LVCSR dictation database which has been collected under the project GlobalPhone. This project at the University of Karlsruhe investigates LVCSR systems in 15 languages of the world, namely Arabic, Chinese, Croatian, English, French, German, Italian, Japanese, Korean, Portuguese, Russian, Spanish, Swedish, Tamil, and Tu...

متن کامل

Language Adaptive Multilingual CTC Speech Recognition

Recently, it has been demonstrated that speech recognition systems are able to achieve human parity. While much research is done for resource-rich languages like English, there exists a long tail of languages for which no speech recognition systems do yet exist. The major obstacle in building systems for new languages is the lack of available resources. In the past, several methods have been pr...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: ACM Transactions on Asian and Low-Resource Language Information Processing

سال: 2023

ISSN: ['2375-4699', '2375-4702']

DOI: https://doi.org/10.1145/3608950