SpeechDat-Car Fixed Platform
نویسندگان
چکیده
SpeechDat-Car aims to develop a set of speech databases to support training and testing of multilingual speech recognition applications in the car environment. Two types of recordings compose the database. The first type consist of wideband audio signals recorded directly in the car while the second type is composed by GSM signals transmitted from the car and recorded simultaneously in a far-end. Therefore, two recording platforms were used, a 'mobile' recording platform installed inside the car and a 'fixed' recording platform located at the far-end fixed side of the GSM communications system. This paper describes the fixed platform software developed by the Universitat Politècnica de Catalunya (ADA-K). This software is able to work with standard inexpensive PC cards for ISDN lines.. The telephone server presented in this paper to automate the recording of speech databases was developed by the authors in the framework of the SpeechDat-Car EC project LE4-8334 [Moreno (2000)]. Automatic speech recognition (ASR) appears to be a particularly well-adapted technology for providing voice-based interfaces (based on hands-free mode) that will enable new in-car applications to develop while taking care of safety aspects. However, the car environment is known to be particularly noisy (street noise, car engine noise, vibration noises, bubble noise, etc...). To obtain an optimal performance for speech recognition, it is necessary to train the system on large corpora of speech data recorded in context (i.e. directly in the car). The European project SpeechDat-Car 1 aims at providing a set of uniform, coherent databases for nine European languages and for American English. SpeechDat-Car continues the success of the SpeechDat project in developing large-scale speech resources for a wide range of languages and for in-car applications (voice dialling, car accessories control, etc.). It will produce resources for participation of external partners to the original consortium is also possible. Siemens is an 'external' partner. It is also important to note that SpeechDat-Car commits itself to a strict validation protocol to ensure optimal quality and exchangeability of the databases. 1 SpeechDat-Car started in April 1998 in the 4th EC framework under project code LE4-8334 with a 30 months' project duration. 5HFRUGLQJJSODWIRUPV Two types of recordings compose the database. The first type consist of wideband audio signals recorded directly in the car and the second type is composed by a GSM signal transmitted from the car and recorded simultaneously in a far-end. Two recording platforms were used, a 'mobile' recording platform …
منابع مشابه
The speechdat-car multilingual speech databases for in-car applications: some first validation results
The main objective of SpeechDat-Car is to develop a set of speech databases to support training and testing of multilingual speech recognition applications in the car environment. SpeechDat-Car started in April 1998 in the 4th EC framework under project code LE4-8334. The duration of the project is 30 months. Equivalent and similar resources for nine languages will be created: Danish, English, ...
متن کاملFirst experiences of the German speechdat-car database collection in mobile environments
In SpeechDat-Car, speech databases for speech driven devices and services for mobile environments are collected for nine European languages. The German SpeechDat-Car installation was the first fully equipped platform within the project. It has served as a testbed for the recording software for the entire project, and as an opportunity to perform technical and organizational feasibility tests fo...
متن کاملThe u.s. speechdat-car data collection
The SpeechDat-Car data collection effort is an ambitious effort to collect data from multiple languages in an in-car setting. This paper describes the U.S. data collection effort. We discuss problems we had implementing the collection procedure; and changes we made to improve the procedure. This paper should benefit future in-car data collections.
متن کاملSPEECHDAT-CAR. A Large Speech Database for Automotive Environments
The aims of the SpeechDat-Car project are to develop a set of speech databases to support training and testing of multilingual speech recognition applications in the car environment. As a result, a total of ten (10) equivalent and similar resources will be created. The 10 languages are Danish, each language 600 sessions will be recorded (from at least 300 speakers) in seven characteristic envir...
متن کاملSpeechdat-car: Speech Databases for Voice Driven Teleservices and Control of In-car Applications
The SpeechDat-Car project included in the 4 framework of the European Community's Language Engineering Programme, started in April 1998 with a duration of 30 months. It is a common initiative of car manufacturers, telephone communications operators, companies active in voice operated services and Universities that aims at collecting a set of speech databases in nine different languages to suppo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000