نتایج جستجو برای: audio input flooding

تعداد نتایج: 298389  

2015
Tom Sercu Christian Puhrsch

One of the major challenges in speech recogntion or any other field, that concerns itself with structured predictions, is the alginment of two different sequences. The training data for training an RNN is a set of utterances, consisting of audio recorded via a regular microphone and the transcription of the spoken words. This transcription may either be in terms of phonemes or characters. It is...

Journal: :journal of english language studies 0
dennis moradkhan assistant professor of applied linguistics, islamic azad university north tehran branch, iran behnaz jalayer ma in tefl, islamic azad university north tehran branch, iran

this study sought to investigate the impact of teaching speech acts and role relations through authentic audio-taped and video-taped materials on the level of iranian efl learners’ pragmatic competence. to this end, 55 intermediate learners were ed and randomly assigned to two experimental groups: the audio-taped group and the video-taped group. during the treatment, the audio-taped group recei...

2005
Hariharan Subramanian

Audio signal classification system analyzes the input audio signal and creates a label that describes the signal at the output. These are used to characterize both music and speech signals. The categorization can be done on the basis of pitch, music content, music tempo and rhythm. The signal classifier analyzes the content of the audio format thereby extracting information about the content fr...

Journal: :IEEE Trans. Speech and Audio Processing 2003
Christopher J. C. Burges John C. Platt Soumya Jana

Mapping audio data to feature vectors for the classification, retrieval or identification tasks presents four principal challenges. The dimensionality of the input must be significantly reduced; the resulting features must be robust to likely distortions of the input; the features must be informative for the task at hand; and the feature extraction operation must be computationally efficient. I...

2007
Ben P. Milner Ibrahim Almajai

The aim of this paper is to use visual speech information to create Wiener filters for audio speech enhancement. Wiener filters require estimates of both clean speech statistics and noisy speech statistics. Noisy speech statistics are obtained from the noisy input audio while obtaining clean speech statistics is more difficult and is a major problem in the creation of Wiener filters for speech ...

پایان نامه :وزارت علوم، تحقیقات و فناوری - دانشگاه تربیت مدرس - دانشکده ادبیات و علوم انسانی 1390

مواد سمعی بصری غیر آموزشی و مشکلات شنیداری/ ادراکی زبان آموزان ایرانی استفاده از مواد سمعی بصری غیر آموزشی در کلاسهای آموزش زبان انگلیسی همیشه مسئله ای مورد مناقشه بین معلملان و اساتید زبان بوده است. بخش عمده این مخالفت ها به عدم وجود یک بررسی جامع از مولفه های زبانی و غیر زبانی موجود در این مواد بر می گردد. فقدان یک تحقیق کامل در زمینه ویژگی های غالب در این مواد و مشکلاتی که زبان آموزان با آ...

2007
Lorna M. Brown John Williamson

Shake2Talk is a new system for mobile messaging, using gestural input and non-speech audio and tactile output. Users can compose audio-tactile messages through simple gesture interactions and then send them to other users, enabling a range of different types of communication. This paper presents an overview of the Shake2Talk demonstration.

1998
Scott N. Levine

The purpose of this paper is to demonstrate a low bitrate audio coding algorithm that allows modi cations in the compressed domain. The input audio is segregated into three di erent representations: sinusoids, transients, and noise. Each representation can be individually quantized, and then easily be time-scaled and/or

Journal: :CoRR 2017
Keunwoo Choi George Fazekas Kyunghyun Cho Mark B. Sandler

Deep neural networks (DNN) have been successfully applied for music classification tasks including music tagging. In this paper, we investigate the effect of audio preprocessing on music tagging with neural networks. We perform comprehensive experiments involving audio preprocessing using different time-frequency representations, logarithmic magnitude compression, frequency weighting and scalin...

2013
Greg Surges Shlomo Dubnov

A system is described which uses the Audio Oracle algorithm for music analysis and machine improvisation. Some improvements on previous Factor Oracle-based systems are presented, including automatic model calibration based on measures fromMusic Information Dynamics, facilities for compositional structuring and automation, and an audio-based query mode which uses the input signal to influence th...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید