Noise Estimation Using Density Estimation for Self-Supervised Multimodal Learning

نویسندگان

چکیده

One of the key factors enabling machine learning models to comprehend and solve real-world tasks is leverage multimodal data. Unfortunately, annotation data challenging expensive. Recently, self-supervised methods that combine vision language were proposed learn representations without annotation. However, these often choose ignore presence high levels noise thus yield sub-optimal results. In this work, we show problem estimation for can be reduced a density task. Using estimation, propose building block representation based strictly on inherent correlation between different modalities. We demonstrate how our broadly integrated achieves comparable results state-of-the-art performance five benchmark datasets two tasks: Video Question Answering Text-To-Video Retrieval. Furthermore, provide theoretical probabilistic error bound substantiating empirical analyze failure cases. Code: https://github.com/elad-amrani/ssml.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

channel estimation for mimo-ofdm systems

تخمین دقیق مشخصات کانال در سیستم های مخابراتی یک امر مهم محسوب می گردد. این امر به ویژه در کانال های بیسیم با ‏خاصیت فرکانس گزینی و زمان گزینی شدید، چالش بزرگی است. مقالات متعدد پر از روش های مبتکرانه ای برای طراحی و آنالیز ‏الگوریتم های تخمین کانال است که بیشتر آنها از روش های خاصی استفاده می کنند که یا دارای عملکرد خوب با پیچیدگی ‏محاسباتی بالا هستند و یا با عملکرد نه چندان خوب پیچیدگی پایینی...

Self-Supervised Monocular Image Depth Learning and Confidence Estimation

Convolutional Neural Networks (CNNs) need large amounts of data with ground truth annotation, which is a challenging problem that has limited the development and fast deployment of CNNs for many computer vision tasks. We propose a novel framework for depth estimation from monocular images with corresponding confidence in a selfsupervised manner. A fully differential patch-based cost function is...

متن کامل

Semi-Supervised Least-Squares Conditional Density Estimation

---Conditional density estimation is an useful alternative to regression to learn an input-output relationship under multi-modality, asymmetry, and heteroscedasticity. The supervised learning method called least-squares conditional density estimation (LSCDE) is the state-of-the-art method that directly estimates the conditional density using a linear model. In this paper, we extend the supervis...

متن کامل

Decision Forests: A Unified Framework for Classification, Regression, Density Estimation, Manifold Learning and Semi-Supervised Learning

This review presents a unified, efficient model of random decision forests which can be applied to a number of machine learning, computer vision, and medical image analysis tasks. Our model extends existing forest-based techniques as it unifies classification, regression, density estimation, manifold learning, semisupervised learning, and active learning under the same decision forest framework...

متن کامل

Using Supervised Learning to Improve Monte Carlo Integral Estimation

Monte Carlo (MC) techniques are often used to estimate integrals of a multivariate function using randomly generated samples of the function. In light of the increasing interest in uncertainty quantification and robust design applications in aerospace engineering, the calculation of expected values of such functions (e.g. performance measures) becomes important. However, MC techniques often su ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2021

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v35i8.16822