TokyoTechCanon at TRECVID 2012

نویسندگان

  • Nakamasa Inoue
  • Yusuke Kamishima
  • Kotaro Mori
  • Koichi Shinoda
چکیده

We aim at developing a high-performance semantic indexing system using Gaussian-mixture-model (GMM) supervectors and tree-structured GMMs [1, 2, 3]. GMM supervectors corresponding to six types of audio and visual features are extracted from video shots. Tree-structured GMMs reduce the computational cost of maximum a posteriori (MAP) adaptation for estimating GMM parameters while keeping accuracy at high levels. This year, we introduce two new low-level features of HOG-Dense and LBP-Dense and video-clip scores. HOG-Dense and LBP-Dense are extracted from up to 100 frames per shot by using dense sampling. The video-clip score is defined as the maximum value of shot scores among all the shots in a video clip and is used for re-ranking video shots. Our best result was 32.10% in terms of Mean InfAP, which was ranked first over all semantic indexing runs in the full task.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

TokyoTechCanon at TRECVID 2013

We aim at developing a high-performance system using Gaussian-mixture-model (GMM) supervectors and tree-structured GMMs [6, 7, 8] for the semantic indexing task [1, 2, 3, 4]. GMM supervectors corresponding to six types of audio and visual features are extracted from video shots. Tree-structured GMMs reduce the computational cost of maximum a posteriori (MAP) adaptation for estimating GMM parame...

متن کامل

TRECVid 2012 Experiments at Dublin City University

Following previous participations in TRECVid, this year, the DCU-IAD team participated in four tasks of TRECVid 2012: Instance Search (INS), Interactive Known-Item Search (KIS), Multimedia Event Detection (MED) and Multimedia Event Recounting (MER).

متن کامل

Event detection: BJTU-SED at Trecvid 2012

In trecvid 2012, our team takes part in 2 event detection competition including embrace and pointing. We build two systems to recognize these events separately. For embracing, we use a probability accumulated method. For pointing, we use texture and silhouette. Different from the former works, the two systems are interactive systems and feedback strategy is used in the detection of events. In t...

متن کامل

ARTEMIS-UBIMEDIA at TRECVid 2011: Instance Search

This paper describes the approach proposed by ARTEMISUBIMEDIA team at TRECVID 2011, Instance Search (INS) task. The method is based on a semi-global image representation relying on an over-segmentation of the keyframes. An aggregation mechanism was then applied in order to group a set of sub-regions into an object similar to the query, under a global similarity criterion.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012