Semantic Rank Reduction of Music Audio
نویسنده
چکیده
Audio understanding and classification tasks are often aided by a reduced dimensionality representation of the source observations. For example, a supervised learning system trained to detect the genre or artist of a piece of music performs better if the input nodes are statistically de-correlated, either to prevent overfitting in the learning process or to ‘anchor’ similar observations to cluster centroids in the observation space. We provide an alternate approach that decomposes audio observations of music into semantically significant dimensions where each resultant dimension corresponds to the perceived meaning of the audio, and only the most significant meanings (those which are most effective in describing music audio) are kept. We show a fundamentally unsupervised method to automatically obtain this decomposition and compare its performance in a music understanding task against statistical de-correlation approaches such as PCA and non-negative matrix factorization (NMF).
منابع مشابه
Music classification by low-rank semantic mappings
A challenging open question in music classification is which music representation (i.e., audio features) and which machine learning algorithm is appropriate for a specific music classification task. To address this challenge, given a number of audio feature vectors for each training music recording that capture the different aspects of music (i.e., timbre, harmony, etc.), the goal is to find a ...
متن کاملMusic Warehouses: Challenges for the Next Generation of Music Search Engines
Music Information Retrieval has received increasing attention from both the industrial and the research communities in recent years. Many audio extraction techniques providing content-based music information have been developed, sparking the need for intelligent storage and retrieval facilities. This paper proposes to satisfy this need by extending technology from business-oriented data warehou...
متن کاملLarge-Scale Music Annotation and Retrieval: Learning to Rank in Joint Semantic Spaces
Music prediction tasks range from predicting tags given a song or clip of audio, predicting the name of the artist, or predicting related songs given a song, clip, artist name or tag. That is, we are interested in every semantic relationship between the different musical concepts in our database. In realistically sized databases, the number of songs is measured in the hundreds of thousands or m...
متن کاملA Music Recommendation System Based on Semantic Audio Segments Similarity
In this paper we propose a novel approach for contentbased music recommendation. The main innovation of the proposed technique consists of a similarity function that, instead of considering entire songs or their thumbnail representations, analyzes audio similarities between semantic segments from different audio tracks. The rationale of our idea is that a song similarity and recommendation tech...
متن کاملMusic Listening in the Future: Augmented Music-Understanding Interfaces and Crowd Music Listening
In the future, music listening can be more active, more immersive, richer, and deeper by using automatic music-understanding technologies (semantic audio analysis). In the first half of this invited talk, four Augmented Music-Understanding Interfaces that facilitate deeper understanding of music are introduced. In our interfaces, visualization of music content and music touch-up (customization)...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003