audio input flooding

نتایج جستجو برای: audio input flooding

تعداد نتایج: 298389 فیلتر نتایج به سال:

Cqt-based Convolutional Neural Networks for Audio Scene Classification

2016

Thomas Lidy Alexander Schindler

In this paper, we propose a parallel Convolutional Neural Network architecture for the task of classifying acoustic scenes and urban sound scapes. A popular choice for input to a Convolutional Neural Network in audio classification problems are Mel-transformed spectrograms. We, however, show in this paper that a ConstantQ-transformed input improves results. Furthermore, we evaluated critical pa...

متن کامل

Multi-Resolution Fully Convolutional Neural Networks for Monaural Audio Source Separation

Journal: :CoRR 2017

Emad M. Grais Hagen Wierstorf Dominic Ward Mark D. Plumbley

In deep neural networks with convolutional layers, each layer typically has fixed-size/single-resolution receptive field (RF). Convolutional layers with a large RF capture global information from the input features, while layers with small RF size capture local details with high resolution from the input features. In this work, we introduce novel deep multi-resolution fully convolutional neural...

متن کامل

Content Based Audio Retrieval Based on Hidden Markov Models Speech and Audio Processing and Recognition Final Project

2001

DAN ELLIS MANUEL REYES

This project consists in the implementation of a system that retrieves the five most similar audio files from an audio database when an audio file is presented as the input. I concentrated on indoor and outdoor environmental audio files. Audio is a very important kind of media that includes speech, music and various kinds of environmental noise. With the recent public access to different audio ...

متن کامل

Radio drum gesture detection system using only sticks, antenna and computer with audio interface

2006

Ben Nevile Peter F. Driessen W. Andrew Schloss

We present the results of our ongoing project researching a tighter coupling between computer and performer. The audio-input radio drum is presented, a simplification of the original apparatus that provides superior latency and resolution. Different demodulation schemes for the amplitude modulated input signals are discussed. Techniques to analyze gesture data are outlined, including eestimatio...

متن کامل

Evaluating Content Extraction from Audio Sources

1999

Lynette Hirschman John Burger David Palmer Patricia Robinson

This paper discusses evaluation of content extraction from audio sources. The most straightforward approach is to adapt existing methods for written sources to handle audio input. A transcription then becomes the representation of the audio source in written form; it must capture the word stream, but also other information that aids in decoding the overall structure and content of the audio sou...

متن کامل

Impact of Audio Auxiliary Input upon Incidental Vocabulary Acquisition in Foreign Journal Study

Journal: :Scholars International Journal of Linguistics and Literature 2020

متن کامل

Aided Electrophysiology Using Direct Audio Input: Effects of Amplification and Absolute Signal Level

Journal: :American Journal of Audiology 2016

متن کامل

Speed Sonic across the Span: a Platform Audio Game

2008

Michael Oren Chris Harding Terri Bonebright

We describe the sound design and initial user study of an audio game created for gamers with visual impairments. Despite the wild popularity of platform games such as Super Mario [1] and the development of many audio games over the past decade, the platform genre has so far been all but ignored by audio game designers. To fill this gap and to add to the limited entertainment choices visually im...

متن کامل

Extending voice-driven synthesis to audio mosaicing

2008

Jordi Janer Maarten de Boer

This paper presents a system for controlling audio mosaicing with a voice signal, which can be interpreted as a further step in voice-driven sound synthesis. Compared to voice-driven instrumental synthesis, it increases the variety in the synthesized timbre. Also, it provides a more direct interface for audio mosaicing applications, where the performer voice controls rhythmic, tonal and timbre ...

متن کامل

Visual Speech Enhancement

2017

Aviv Gabbay Asaph Shamir Shmuel Peleg

When video is shot in noisy environment, the voice of a speaker seen in the video can be enhanced using the visible mouth movements, reducing background noise. While most existing methods use audio-only inputs, improved performance is obtained with our visual speech enhancement, based on an audio-visual neural network. We add to the training data videos with synthetic background noise taken fro...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید