High-Level Audio Features: Distributed Extraction and Similarity Search

نویسندگان

François Deliège

Bee Yong Chua

Torben Bach Pedersen

چکیده

Today, automatic extraction of high-level audio features suffers from two main scalability issues. First, the extraction algorithms are very demanding in terms of memory and computation resources. Second, copyright laws prevent the audio files to be shared among computers, limiting the use of existing distributed computation frameworks and reducing the transparency of the methods evaluation process. The iSound Music Warehouse (iSoundMW), presented in this paper, is a framework to collect and query high-level audio features. It performs the feature extraction in a two-step process that allows distributed computations while respecting copyright laws. Using public computers, the extraction can be performed on large scale music collections. However, to be truly valuable, data management tools to search among the extracted features are needed. The iSoundMW enables similarity search among the collected high-level features and demonstrates its flexibility and efficiency by using a weighted combination of high-level features and constraints while showing good search performance results.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Novel Architecture for Detecting Phishing Webpages using Cost-based Feature Selection

Phishing is one of the luring techniques used to exploit personal information. A phishing webpage detection system (PWDS) extracts features to determine whether it is a phishing webpage or not. Selecting appropriate features improves the performance of PWDS. Performance criteria are detection accuracy and system response time. The major time consumed by PWDS arises from feature extraction that ...

متن کامل

Zhejiang University at TRECVID 2006

We participated in the high-level feature extraction and interactive-search task for TRECVID 2006. Interaction and integration of multi-modality media types such as visual, audio and textual data in video are the essence of video content analysis. Although any uni-modality type partially expresses limited semantics less or more, video semantics are fully manifested only by interaction and integ...

متن کامل

University of Central Florida at TRECVID 2007 Semantic Video Classification and Automatic Search

In this paper, we describe our approaches and experiments in semantic video classification (high-level features extraction) and fully automatic topic search tasks of TRECVID 2007. We designed a unified high-level features extraction framework. Two types of discriminative low level features, Spatial Pyramid Edge/Color Histograms and Bag of Visterms, are extracted from the key-frames of the shots...

متن کامل

Unsupervised learning of low-level audio features for music similarity estimation

While there is an enormous amount of music data available, the field of music analysis almost exclusively uses manually designed features. In this work we learn features from music data in a completely unsupervised way and evaluate them on a musical genre classification task. We achieve results very close to state-of-the-art performance which relies on highly hand-tuned feature extractors.

متن کامل

PicSOM Experiments in TRECVID 2008

Our experiments in TRECVID 2008 include participation in the high-level feature extraction, automatic search, video summarization, and video copy detection tasks, using a common system framework. In the high-level feature extraction task, we extended our last year’s experiments, which were based on SOM-based semantic concept modeling followed by a post-processing stage utilizing the concepts’ t...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2008

High-Level Audio Features: Distributed Extraction and Similarity Search

نویسندگان

چکیده

منابع مشابه

A Novel Architecture for Detecting Phishing Webpages using Cost-based Feature Selection

Zhejiang University at TRECVID 2006

University of Central Florida at TRECVID 2007 Semantic Video Classification and Automatic Search

Unsupervised learning of low-level audio features for music similarity estimation

PicSOM Experiments in TRECVID 2008

عنوان ژورنال:

اشتراک گذاری