نتایج جستجو برای: audio signal
تعداد نتایج: 477437 فیلتر نتایج به سال:
During the fusion of audio and video information for speech recognition, the estimation of the reliability of the noise affected audio channel is crucial to get meaningful recognition results. In this paper we compare two types of reliability measures. One is the use of the statistics of the phoneme a-posteriori probabilities and the other is the analysis of the audio signal itself. We implemen...
In order to protect the digital audio and video products copyright in the network, an improved audio blind watermarking algorithm scheme based on discrete wavelet transform (DWT) and singular value decomposition (SVD) is proposed. In the algorithm, an original audio is split as blocks and each block is decomposed on discrete wavelet transform for two degree, then first quarter audio approximate...
Orthogonal information present in the video signal associated with the audio helps in improving the accuracy of a speech recognition system. Audio-visual speech recognition involves extraction of both the audio as well as visual features from the input signal. Extraction of visual parameters is done by the recognition of speech dependent features from the video sequence. This paper uses geometr...
The current study examines the temporal parameters associated with cross-modal integration of auditory-visual information for sentential material (Harvard/IEEE sentences). The speech signal was filtered into 1/3-octave channels, all of which were discarded (in the primary experiment) save for a low-frequency (298-375 Hz) and a high-frequency (4762-6000 Hz) band. The intelligibility of this audi...
This paper describes a method for estimating the amplitude characteristics of poles common to multiple room transfer functions from musical audio signals received by multiple microphones. Knowledge of these pole characteristics would make it easier to manipulate audio equalizers, since they correspond to the room resonance. It has been proven that an estimate of the poles can be calculated prec...
The study of impulse response technology allows audio engineers to attempt to recreate the tonal characteristics of a certain device. The purpose of this project was to recreate a sonic representation of analog microphone pre amplifiers and acoustic spaces through the use of impulse response technology. By creating these impulses, the student will attain the sonic characteristics of professiona...
Automatic speech recognition (ASR) enables very intuitive human-machine interaction. However, signal degradations due to reverberation or noise reduce the accuracy of audio-based recognition. The introduction of a second signal stream that is not affected by degradations in the audio domain (e.g., a video stream) increases the robustness of ASR against degradations in the original domain. Here,...
This paper presents the technique of embedding data in an audio signal by inserting low power tones and its robustness to noise and cropping of embedded speech samples. Experiments on the embedding procedure applied to cover audio utterances from noise-free TIMIT database and a noisy database demonstrate the feasibility of the technique in terms of imperceptible embedding, high data rate and ac...
We introduce a method for audio-integrity verification based on a combination of watermarking and fingerprinting. An audio fingerprint is a perceptual digest that holds content information of a recording and allows one to identify it from other recordings. Integrity verification is performed by embedding the fingerprint into the audio signal itself by means of a watermark. The original fingerpr...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید