audio system

Development of a Speaker Diarization System for Speaker Tracking in Audio Broadcast News: a Case Study

Journal: :CIT 2008

Janez Zibert Bostjan Vesnicer France Mihelic

A system for speaker tracking in broadcast-news audio data is presented and the impacts of the main components of the system to the overall speaker-tracking performance are evaluated. The process of speaker tracking in continuous audio streams involves several processing tasks and is therefore treated as a multistage process. The main building blocks of such system include the components for au...

متن کامل

Structural and Semantic Modeling of Audio for Content-Based Querying and Browsing

2006

Mustafa Sert Buyurman Baykal Adnan Yazici

A typical content-based audio management system deals with three aspects namely audio segmentation and classification, audio analysis, and content-based retrieval of audio. In this paper, we integrate the three aspects of content-based audio management into a single framework and propose an efficient method for flexible querying and browsing of auditory data. More specifically, we utilize two r...

متن کامل

CS 224N Final Project: Speech Summarization for Rapid Playback

2008

Charles DuHadway

Searching audio streams is currently a tedious task. There exist no tools that allow a user to quickly scan through a long video lecture or audio book using only audio cues. In this paper I propose and build a simple system that alleviates the burden of searching through a large amount of audio data. This system use concepts from other text and speech summarization work, but focuses on the prob...

متن کامل

Text Independent Human Voice Ranking System for Audio Search Engines Using Wavelet Features

2015

A. Jose Albin N. M. Nandhitha

Performance of conventional text based audio search engines can be improved with feature based search engines. In this paper, text independent audio ranking system for audio engines with audio signal as query is proposed. Discrete Wavelet Transform (DWT) is used for feature extraction. Ranking is obtained using three different distance metrics namely Euclidean distance, Manhattan distance and M...

متن کامل

A Hybrid Ann/hmm Audio-visual Spee System

2001

Martin Heckmann

In this paper we present a system for audio-visual speech recognition based on a hybrid Artificial Neural Network/Hidden Markov Model (ANN/HMM) approach. To setup the system it was necessary to record a new audio-visual database. We will describe the recording and labeling of the database. The fusion of audio and video data is a key aspect of the paper. Three conditions, when only the audio or ...

متن کامل

A hybrid ANN/HMM audio-visual speech recognition system

2001

Martin Heckmann Frédéric Berthommier Kristian Kroschel

In this paper we present a system for audio-visual speech recognition based on a hybrid Artificial Neural Network/Hidden Markov Model (ANN/HMM) approach. To setup the system it was necessary to record a new audio-visual database. We will describe the recording and labeling of the database. The fusion of audio and video data is a key aspect of the paper. Three conditions, when only the audio or ...

متن کامل

Towards Automatic Music Structural Analysis: Identifying Characteristic Within-Song Excerpts in Popular Music

2005

Bee Suan Ong Xavier Serra

Automatic audio content analysis is a general research area in which algorithms are developed to allow computer systems to understand the content of digital audio signals for further exploitations. The main focus therein is on the practical applications for audio files management, like automatic labeling, efficient browsing, or the retrieval of relevant files with little effort from a big datab...

متن کامل

Computational Creativity

2010

Mary Lou Maher Tony Veale Rob Saunders Oliver Bown

Soundscape composition is the creative practice of processing and combining sound recordings to evoke auditory associations and memories within a listener. We present Audio Metaphor, a system for creating novel soundscape compositions. Audio Metaphor processes natural language queries derived from Twitter for retrieving semantically linked sound recordings from online user-contributed audio dat...

متن کامل

Hear&There: An Augmented Reality System of Linked Audio by

2000

Joseph Michael Rozier Judith Donath Arthur C. Smith

In this thesis, I designed a system for augmenting a space with linked audio. Using this system, individuals can associate audio clips with a location in real-world space. When an individual using the system passes through this augmented space, he or she can hear the audio clips that have been left by traveling through the associated locations. Furthermore, audio clips in the environment can be...

متن کامل

Musical Audio Synthesis Using Autoencoding Neural Nets

2014

Andy M. Sarroff Michael A. Casey

With an optimal network topology and tuning of hyperparameters, artificial neural networks (ANNs) may be trained to learn a mapping from low level audio features to one or more higher-level representations. Such artificial neural networks are commonly used in classification and regression settings to perform arbitrary tasks. In this work we suggest repurposing autoencoding neural networks as mu...

متن کامل