Computational Music Audio Scene Analysis
نویسنده
چکیده
Computational Music Audio Scene Analysis
منابع مشابه
Musical Sound Separation Based on Binary Time-Frequency Masking
The problem of overlapping harmonics is particularly acute in musical sound separation and has not been addressed adequately. We propose a monaural system based on binary time-frequency masking with an emphasis on robust decisions in timefrequency regions, where harmonics from different sources overlap. Our computational auditory scene analysis system exploits the observation that sounds from t...
متن کاملImproved monaural speech segregation based on computational auditory scene analysis
A lot of effort has been made in Computational Auditory Scene Analysis (CASA) to segregate target speech from monaural mixtures. Based on the principle of CASA, this article proposes an improved algorithm for monaural speech segregation. To extract the energy feature more accurately, the proposed algorithm improves the threshold selection for response energy in initial segmentation stage. Since...
متن کاملSinging Voice Separation Using Spectro-Temporal Modulation Features
An auditory-perception inspired singing voice separation algorithm for monaural music recordings is proposed in this paper. Under the framework of computational auditory scene analysis (CASA), the music recordings are first transformed into auditory spectrograms. After extracting the spectral-temporal modulation contents of the timefrequency (T-F) units through a two-stage auditory model, we de...
متن کاملAutomatic Music Transcription and Audio Source Separation
2 In this article, we give an overview of a range of approaches to the analysis and separation of musical audio. In particular, we consider the problems of automatic music transcription and audio source separation, which are of particular interest to our group. Monophonic music transcription, where a single note is present at one time, can be tackled using an autocorrelation-based method. For p...
متن کاملAudio scene analysis and scene change detection in the MPEG compressed domain
The use of audio to retrieve and index the associated video is a relatively new approach. In this paper the focus is on MPEG video. For indexing and retrieval one needs to segment the audio stream associated with the video in terms of gender, speech, music and the speaker. This is called “Audio scene” analysis. The paper discusses techniques for such analysis in the MPEG audio compressed domain.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013