Chinese Speech Enhancement and Adaptive Recognition Technology for Complex Language Environments
نویسندگان
چکیده
The development of intelligent technology has also made rapid progress in relevant speech fields. In order to increase the application scenarios recognition systems, research improved traditional Speech enhancement algorithm, namely Ideal Binary Mask (IBM) and combined it with unimproved IBM algorithm propose an adaptive algorithm. Based on this built a new system, system uses FIR filter realize pre-emphasis processing Berouti spectral subtraction preprocess speech. model is using deep learning network model. results showed that had highest score Perceptual Evaluation Quality (PESQ) at 3.5596, followed by Ratio (IRM) 3.3429. improvement was feasible when noise intensity coefficient greater than 0.008. When 0.08, average 2.1079, 1.9418. proposed higher performance complex environments compared original system.
منابع مشابه
Speech enhancement strategy for speech recognition microcontroller under noisy environments
Industrial automation with speech control functions is generally installed with a speech recognition sensor which is used as an interface for users to articulate speech commands. However, recognition errors are likely to be produced when background noise surrounds the command spoken into the speech recognition microcontrollers. In this paper, a speech enhancement strategy is proposed to develop...
متن کاملLanguage-adaptive persian speech recognition
Development of robust spoken language technology ideally relies on the availability of large amounts of data preferably in the target domain and language. However, more often than not, speech developers need to cope with very little or no data, typically obtained from a different target domain. This paper focuses on developing techniques towards addressing this challenge. Specifically we consid...
متن کاملLanguage-independent and language-adaptive acoustic modeling for speech recognition
With the distribution of speech technology products all over the world, the portability to new target languages becomes a practical concern. As a consequence our research focuses on the question of how to port LVCSR systems in a fast and efficient way. More specifically we want to estimate acoustic models for a new target language using speech data from varied source languages, but only limited...
متن کاملLanguage independent and language adaptive large vocabulary speech recognition
This paper describes the design of a multilingual speech recognizer using an LVCSR dictation database which has been collected under the project GlobalPhone. This project at the University of Karlsruhe investigates LVCSR systems in 15 languages of the world, namely Arabic, Chinese, Croatian, English, French, German, Italian, Japanese, Korean, Portuguese, Russian, Spanish, Swedish, Tamil, and Tu...
متن کاملLanguage Adaptive Multilingual CTC Speech Recognition
Recently, it has been demonstrated that speech recognition systems are able to achieve human parity. While much research is done for resource-rich languages like English, there exists a long tail of languages for which no speech recognition systems do yet exist. The major obstacle in building systems for new languages is the lack of available resources. In the past, several methods have been pr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ACM Transactions on Asian and Low-Resource Language Information Processing
سال: 2023
ISSN: ['2375-4699', '2375-4702']
DOI: https://doi.org/10.1145/3608950