Semantic Rank Reduction of Music Audio

نویسنده

  • Brian Whitman
چکیده

Audio understanding and classification tasks are often aided by a reduced dimensionality representation of the source observations. For example, a supervised learning system trained to detect the genre or artist of a piece of music performs better if the input nodes are statistically de-correlated, either to prevent overfitting in the learning process or to ‘anchor’ similar observations to cluster centroids in the observation space. We provide an alternate approach that decomposes audio observations of music into semantically significant dimensions where each resultant dimension corresponds to the perceived meaning of the audio, and only the most significant meanings (those which are most effective in describing music audio) are kept. We show a fundamentally unsupervised method to automatically obtain this decomposition and compare its performance in a music understanding task against statistical de-correlation approaches such as PCA and non-negative matrix factorization (NMF).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Music classification by low-rank semantic mappings

A challenging open question in music classification is which music representation (i.e., audio features) and which machine learning algorithm is appropriate for a specific music classification task. To address this challenge, given a number of audio feature vectors for each training music recording that capture the different aspects of music (i.e., timbre, harmony, etc.), the goal is to find a ...

متن کامل

Music Warehouses: Challenges for the Next Generation of Music Search Engines

Music Information Retrieval has received increasing attention from both the industrial and the research communities in recent years. Many audio extraction techniques providing content-based music information have been developed, sparking the need for intelligent storage and retrieval facilities. This paper proposes to satisfy this need by extending technology from business-oriented data warehou...

متن کامل

Large-Scale Music Annotation and Retrieval: Learning to Rank in Joint Semantic Spaces

Music prediction tasks range from predicting tags given a song or clip of audio, predicting the name of the artist, or predicting related songs given a song, clip, artist name or tag. That is, we are interested in every semantic relationship between the different musical concepts in our database. In realistically sized databases, the number of songs is measured in the hundreds of thousands or m...

متن کامل

A Music Recommendation System Based on Semantic Audio Segments Similarity

In this paper we propose a novel approach for contentbased music recommendation. The main innovation of the proposed technique consists of a similarity function that, instead of considering entire songs or their thumbnail representations, analyzes audio similarities between semantic segments from different audio tracks. The rationale of our idea is that a song similarity and recommendation tech...

متن کامل

Music Listening in the Future: Augmented Music-Understanding Interfaces and Crowd Music Listening

In the future, music listening can be more active, more immersive, richer, and deeper by using automatic music-understanding technologies (semantic audio analysis). In the first half of this invited talk, four Augmented Music-Understanding Interfaces that facilitate deeper understanding of music are introduced. In our interfaces, visualization of music content and music touch-up (customization)...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003