Semi-supervised Learning for Sentiment Classification

نویسنده

  • Bishan Yang
چکیده

With the growing need of identifying opinions and sentiments automatically from online text data, sentiment classification tasks have received considerable attention recently. One can treat sentiment classification as a text classification problem, however, it is very time-consuming and somewhat impractical to acquire enough labeled data to train a good sentiment classifier. This paper investigates a semisupervised learning method for sentiment classification which can take advantage of large amounts of unlabeled data. Specifically, we learn some structural information from both labeled and unlabeled data to form a good feature mapping for the sentiment classification tasks, and then bootstrap the learner to improve the classification performance. We present an empirical study on two different sentiment classification tasks which indicates the proposed method can make good used of unlabeled data and improve the classification performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Active Deep Networks for Semi-Supervised Sentiment Classification

This paper presents a novel semisupervised learning algorithm called Active Deep Networks (ADN), to address the semi-supervised sentiment classification problem with active learning. First, we propose the semi-supervised learning method of ADN. ADN is constructed by Restricted Boltzmann Machines (RBM) with unsupervised learning using labeled data and abundant of unlabeled data. Then the constru...

متن کامل

Sentiment Analysis by Augmenting Expectation Maximisation with Lexical Knowledge

Sentiment analysis of documents aims to characterise the positive or negative sentiment expressed in documents. It has been formulated as a supervised classification problem, which requires large numbers of labelled documents. Semi-supervised sentiment classification using limited documents or words labelled with sentiment-polarities are approaches to reducing labelling cost for effective learn...

متن کامل

Semi-Supervised Learning for Imbalanced Sentiment Classification

Various semi-supervised learning methods have been proposed recently to solve the long-standing shortage problem of manually labeled data in sentiment classification. However, most existing studies assume the balance between negative and positive samples in both the labeled and unlabeled data, which may not be true in reality. In this paper, we investigate a more common case of semi-supervised ...

متن کامل

Semi-Stacking for Semi-supervised Sentiment Classification

In this paper, we address semi-supervised sentiment learning via semi-stacking, which integrates two or more semi-supervised learning algorithms from an ensemble learning perspective. Specifically, we apply metalearning to predict the unlabeled data given the outputs from the member algorithms and propose N-fold cross validation to guarantee a suitable size of the data for training the meta-cla...

متن کامل

Sentiment Classification in Under-Resourced Languages Using Graph-Based Semi-Supervised Learning Methods

In sentiment classification, conventional supervised approaches heavily rely on a large amount of linguistic resources, which are costly to obtain for under-resourced languages. To overcome this scarce resource problem, there exist several methods that exploit graph-based semisupervised learning (SSL). However, fundamental issues such as controlling label propagation, choosing the initial seeds...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010