Minimum Description Length Principle in Supervised Learning with Application to Lasso

نویسندگان

Masanori Kawakita

Jun'ichi Takeuchi

چکیده

The minimum description length (MDL) principle in supervised learning is studied. One of the most important theories for the MDL principle is Barron and Cover’s theory (BC theory), which gives a mathematical justification of the MDL principle. The original BC theory, however, can be applied to supervised learning only approximately and limitedly. Though Barron et al. recently succeeded in removing a similar approximation in case of unsupervised learning, their idea cannot be essentially applied to supervised learning in general. To overcome this issue, an extension of BC theory to supervised learning is proposed. The derived risk bound has several advantages inherited from the original BC theory. First, the risk bound holds for finite sample size. Second, it requires remarkably few assumptions. Third, the risk bound has a form of redundancy of the two-stage code for the MDL procedure. Hence, the proposed extension gives a mathematical justification of the MDL principle to supervised learning like the original BC theory. As an important example of application, new risk and (probabilistic) regret bounds of lasso with random design are derived. The derived risk bound holds for any finite sample size n and feature number p even if n ≪ p without boundedness of features in contrast to the past work. Behavior of the regret bound is investigated by numerical simulations. We believe that this is the first extension of BC theory to general supervised learning with random design without approximation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Pruning Fuzzy ARTMAP using the Minimum Description Length Principle in Learning from Clinical Databases

Fuzzy ARTMAP is one of the families of the neural network architectures bused on ART(Adaptive Resonance Theory) in which supervised learning can be curried out. However, it usually tends to create more categories than are actually needed. This often causes the so culled overfitting problem, namely the performunce of the networks in test set is not monotonically increasing with the additional tr...

متن کامل

Barron and Cover's Theory in Supervised Learning and its Application to Lasso

We study Barron and Cover’s theory (BC theory) in supervised learning. The original BC theory can be applied to supervised learning only approximately and limitedly. Though Barron & Luo (2008) and Chatterjee & Barron (2014a) succeeded in removing the approximation, their idea cannot be essentially applied to supervised learning in general. By solving this issue, we propose an extension of BC th...

متن کامل

Transfer Learning Using the Minimum Description Length Principle with a Decision Tree Application

Transfer learning is about how learning from one domain or a collection of domains can be applied to another. It is learning from similarities and parallels, from experience. This paper is about a distribution free, data driven, extendable framework for transfer learning, based on the minimum description length principle. We define transfer learning in terms of a specific framework, where we ha...

متن کامل

Bayesian Models to Assess Risk of Corruption of Federal Management Units

This paper presents a data mining project that generated Bayesian models to assess risk of corruption of federal management units. With thousands of extracted features related to corruptibility, the data were processed using techniques like correlation analysis and variance per class. We also compared two different discretization methods: Minimum Description Length Principle (MDLP) and Class-At...

متن کامل

Generalization vs. Discrimination in Learning

concept learning in animalsAnimal perceptual learningArtificial learning and Machine LearningCategorical learningClassification learningConcept formationConcept learningConcept learning of machinesLearning algorithms ReferencesFlach, P. (forthcoming). Machine Learning. The Art and Science of Algorithms that Make Sense of Data. CambridgeUniversity Press. M...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1607.02914 شماره

صفحات -

تاریخ انتشار 2016

Minimum Description Length Principle in Supervised Learning with Application to Lasso

نویسندگان

چکیده

منابع مشابه

Pruning Fuzzy ARTMAP using the Minimum Description Length Principle in Learning from Clinical Databases

Barron and Cover's Theory in Supervised Learning and its Application to Lasso

Transfer Learning Using the Minimum Description Length Principle with a Decision Tree Application

Bayesian Models to Assess Risk of Corruption of Federal Management Units

Generalization vs. Discrimination in Learning

عنوان ژورنال:

اشتراک گذاری