Automat Parsing of Audio Recordings. Testing Children with Dyslalia. Theoretical Background
نویسندگان
چکیده
In this paper we present our researches regarding automat parsing of audio recordings. These recordings are obtained from children with dyslalia and are necessary for an accurate identification of speech problems. We develop the ADM algorithm and we analyze the complexity of this solution. The main objectives of this task are: recording 120 children (60 with correct pronunciation and 60 with dyslalia) we must permit different audio environments during recording (some phonemes will be used for training a real time recognition system) the cost of recording devices and the children’s impact must be minimized after recording is necessary to split the stream into phonemes the speech therapist’s voice must be ignored We utilize a digital voice recorder in High Quality mode and with VCVA (Variable Control Voice Actuator) activated. The record format is IMA-ADPCM, 16KHz and 4bits (16 bits PCM). A microphone was placed at 10 cm from mouth in order to minimize environment noise. A software set of classes (C#) was created for handling audio stream (read, conversion between different format, write). We also propose an original solution for placing markers in audio stream. These markers are needed for a correct parsing af full recoding.
منابع مشابه
Software Package with Exercises for Therapy of Children with Dyslalia
In this paper we present a consistent set of exercises for children with dyslalia (dyslalia is a speech disorder that affect pronunciation of one ore many sounds). The achievement has gone from “Therapeutic Guide" made available by the team of researchers led by Professor Mrs. Iolanda TOBOLCEA from the "Alexandru Ioan Cuza" University of Iasi. The specifications of the "Therapeutic Guide" have ...
متن کاملPrevalence of dyslalias in 8 to 16 year-old students with anterior open bite in the municipality of Envigado, Colombia
BACKGROUND Anterior open bite AOB is the most common malocclusion associated with speech disorders and the literature has shown that problems of occlusion involve all oral functions. AOB not only produce aesthetic and occlusal problems for the patient and modifies the union of the lips, tongue, teeth, palate, palatal rugae and oropharynx, and thus affecting the ability to communicate well with ...
متن کاملAcoustical Characterization of Gunshots
This paper addresses several practical and theoretical issues encountered in the analysis of gunshot audio recordings. Gunshot recordings have the potential for both tactical detection and forensic evaluation. Such recordings can provide information about speed and trajectory of the projectile, the estimated location of the shooter, and in some cases the type of firearm and ammunition used. How...
متن کاملAutomatic Recognition of Dyslalia Affecting Pre-Scholars
This article describes the recognition part of a system that will be used for personalized therapy of dyslalia affecting pre scholars. Dyslalia is a speech disorder that affect pronunciation of one ore many sounds. The full system targets interdisciplinary research (computer science, psychology, electronics) having as main objective the development of methods, models, algorithms, System on Chip...
متن کاملDoes the recording medium influence phonetic transcription of cleft palate speech?
BACKGROUND In recent years, analyses of cleft palate speech based on phonetic transcriptions have become common. However, the results vary considerably among different studies. It cannot be excluded that differences in assessment methodology, including the recording medium, influence the results. AIMS To compare phonetic transcriptions from audio and audio/video recordings of cleft palate spe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1406.4879 شماره
صفحات -
تاریخ انتشار 2014