Harmonium Models for Video Classification

نویسندگان

  • Jun Yang
  • Rong Yan
  • Yan Liu
  • Eric P. Xing
چکیده

Accurate and efficient video classification demands the fusion of multimodal information and the use of intermediate representations. Combining the two ideas into one framework, we propose a series of probabilistic models for video representation and classification using intermediate semantic representations derived from multimodal features of video. On the basis of a class of bipartite undirected graphical models named harmonium, we propose dual-wing harmonium (DWH) model that represents video shots as latent semantic topics derived by jointly modeling the transcript keywords and color-histogram features of the data. Our family-of-harmonium (FoH) and hierarchical harmonium (HH) model extends DWH by introducing variables representing category labels of data, which allows data representation and classification to be performed in the same model. Our models are among the few attempts of using undirected graphical models for representing and classifying video data. Experiments on a benchmark video collection show different semantic interpretations of video data under our models, as well as superior classification performance in comparison with several directed models.  2008 Wiley Periodicals, Inc. Statistical Analy Data Mining 1: 23–37, 2008

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Harmonium Models for Video Classi cation

Accurate and e cient video classi cation demands the fusion of multi-modal information and the use of intermediate representations. Combining the two ideas into the one framework, we propose a series of probabilistic models for video representation and classi cation using intermediate semantic representations derived from multi-modal features of video. Based on a class of bipartite undirected g...

متن کامل

Harmonium Models for Semantic Video Representation and Classification

Accurate and efficient video classification demands the fusion of multimodal information and the use of intermediate representations. Combining the two ideas into the same framework, we propose a probabilistic approach for video classification using intermediate semantic representations derived from the multi-modal features. Based on a class of bipartite undirected graphical models named harmon...

متن کامل

Mining Associated Text and Images with Dual-Wing Harmoniums

We propose a multi-wing harmonium model for mining multimedia data that extends and improves on earlier models based on two-layer random fields, which capture bidirectional dependencies between hidden topic aspects and observed inputs. This model can be viewed as an undirected counterpart of the two-layer directed models such as LDA for similar tasks, but bears significant difference in inferen...

متن کامل

Automatic road crack detection and classification using image processing techniques, machine learning and integrated models in urban areas: A novel image binarization technique

The quality of the road pavement has always been one of the major concerns for governments around the world. Cracks in the asphalt are one of the most common road tensions that generally threaten the safety of roads and highways. In recent years, automated inspection methods such as image and video processing have been considered due to the high cost and error of manual metho...

متن کامل

A Bayesian Framework for Learning Shared and Individual Subspaces from Multiple Data Sources

space learning for multi-view data: a large margin approach .WIDE: A real-world web image database from national university of singapore. sampling for Bayesian non-conjugate and hierarchical models by using auxiliary variables. A choice model with infinitely many latent features. [6] T. Griffiths and Z. Ghahramani. Infinite latent feature models and the Indian buffet process. nonparametric join...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Statistical Analysis and Data Mining

دوره 1  شماره 

صفحات  -

تاریخ انتشار 2008