A Theoretical Analysis on Structured Learning with Noisy Data and its Applications

نویسنده

  • Yu Mo
چکیده

Abstract: Performances of supervised machine learning will be affected by noises of labeled data badly, which has been well studied by existing theories about learning with noisy data. However these theories only focus on two-class classification problems. In this paper we studied the relation between noise examples and their effects on structured learning. Firstly, we found that noises of labeled data will enlarge in structured learning problems, leading to a higher noise rate in training procedure than on labeled data. Existed theories did not consider the enlargement of noises in structured learning, thus underestimate the complexities of learning problems. Starting from the observation of enlargement of noises, this paper proposed a new theory on learning from noise data with structured predictions. Based on the theory, the concept of “effective size of training data” is proposed to describe the qualities of noisy training data sets in practice. We also analyze the situations when structured learning models will go back to lower order ones in applications. Experimental results are given to confirm the correctness of our theories as well as their practical values on cross-lingual projection and co-training.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sparse Structured Principal Component Analysis and Model Learning for Classification and Quality Detection of Rice Grains

In scientific and commercial fields associated with modern agriculture, the categorization of different rice types and determination of its quality is very important. Various image processing algorithms are applied in recent years to detect different agricultural products. The problem of rice classification and quality detection in this paper is presented based on model learning concepts includ...

متن کامل

Identifying the Dimensions and Components of Education based on Reverse Learning in the Elementary School

Purpose: The learning style of students is one of the important factors that help teachers prepare the conditions for students to learn. As a result, the purpose of this research was to identifying the dimensions and components of education based on reverse learning in the elementary school. Methodology: This study in terms of purpose was applied and in terms of implementation method was descr...

متن کامل

A study of viewpoints of English language instructors to motivate Lerner to learning English through curricular; representation of a Model

One of the problems of students’ entrance from secondary education to  university is lack of English language skills  and incentive to improve their learning. This research aims to identify the ways to strengthen English language skills with an emphasis on undergraduate students' motivation. This research is qualitative approach and Grounded theory strategy. The study population has been consis...

متن کامل

Factors Affecting Learning of Anatomy: Students' Viewpoints

Introduction: Anatomy is of outmost importance for medical students because of its application in medical practice. Students are a main source of information for quality and effectiveness of education and their views can be used to improve teaching and learning. This study was performed to determine factors affecting anatomy learning from the opinion of medical students of Semnan University of ...

متن کامل

Finding and Leveraging Structure in Learning Problems

The problem of learning from noisy and high dimensional data is an important challenge that has received much attention in the modern machine learning and statistics literature. These problems arise in numerous applications: large scale collaborative filtering, learning gene regulatory networks and genome wide association studies to name a few. This thesis focuses on understanding the statistic...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013