نتایج جستجو برای: audio signal

تعداد نتایج: 477437  

2012
Shuhua Zhang Laurent Girin

In two previous papers, we proposed an audio Informed Source Separation (ISS) system which can achieve the separation of I > 2 musical sources from linear instantaneous stationary stereo (2-channel) mixtures, based on audio signal’s natural sparsity, pre-mix source signals analysis, and side-information embedding (within the mix signal). In the present paper and for the first time, we apply thi...

2011
Shuhua Zhang Laurent Girin

In two previous papers, we proposed an audio Informed Source Separation (ISS) system which can achieve the separation of I > 2 musical sources from linear instantaneous stationary stereo (2-channel) mixtures, based on audio signal’s natural sparsity, pre-mix source signals analysis, and side-information embedding (within the mix signal). In the present paper and for the first time, we apply thi...

2015
Tom Sercu Christian Puhrsch

One of the major challenges in speech recogntion or any other field, that concerns itself with structured predictions, is the alginment of two different sequences. The training data for training an RNN is a set of utterances, consisting of audio recorded via a regular microphone and the transcription of the spoken words. This transcription may either be in terms of phonemes or characters. It is...

Journal: :CoRR 2015
K. V. Vijay Girish T. V. Ananthapadmanabha A. G. Ramakrishnan

A dictionary learning based audio source classification algorithm is proposed to classify a sample audio signal as one amongst a finite set of different audio sources. Cosine similarity measure is used to select the atoms during dictionary learning. Based on three objective measures proposed, namely, signal to distortion ratio (SDR), the number of non-zero weights and the sum of weights, a fram...

2014
Jagadeesh B. Kanade

Audio compression is the lossy compression technique of converting audio signal into an efficiently encoded bitstream that can be decoded to produce a close approximation of the original signal. For the purpose of improving the coding this work attempts to combine psychoacoustic model for perceptual evaluation of audio quality (PEAQ) in BS.1387 with perceptual audio coder. The implementation of...

2007
José Mario De Martino Fábio Violaro

In this paper we evaluate the effectiveness in conveying speech information of a speech synchronized facial animation system based on context-dependent visemes. The evaluation procedure is based on an oral speech intelligibility test conducted with, and without, supplementary visual information provided by a real and a virtual speaker. Three situations (audio-only, audio+video and audio+animati...

2006
Manuel Briand David Virette Nadine Martin

Low bit rate parametric coding of multichannel audio is mainly based on Binaural Cue Coding (BCC). Another multichannel audio processing method called upmix can also be used to deliver multichannel audio, typically 5.1 signals, at low data rates. More precisely, we focus on existing upmix method based on Principal Component Analysis (PCA). This PCA-based upmix method aims at blindly create a re...

2008
O. FAROOQ

In this paper, we propose the use of ‘chirp coding’ for embedding a watermark in audio data without generating any perceptual degradation of audio quality. A binary sequence (the watermark) is derived using energy based features from the audio signal and chirp coding used to embed the watermark in audio data. The chirp coding technique is such that the same watermark can be derived from the ori...

2011
Ren Gang Gregory Bocko Justin Lundberg Stephen Roessner Dave Headlam Mark F. Bocko

In this paper we propose a real-time signal processing framework for musical audio that 1) aligns the audio with an existing music score or creates a musical score by automated music transcription algorithms; and 2) obtains the expressive feature descriptors of music performance by comparing the score with the audio. Real-time audio segmentation algorithms are implemented to identify the onset ...

Journal: :IEEE transactions on neural networks 2009
Gianluca Monaci Pierre Vandergheynst Friedrich T. Sommer

A novel model is presented to learn bimodally informative structures from audio-visual signals. The signal is represented as a sparse sum of audio-visual kernels. Each kernel is a bimodal function consisting of synchronous snippets of an audio waveform and a spatio-temporal visual basis function. To represent an audio-visual signal, the kernels can be positioned independently and arbitrarily in...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید