Research Proposal: Instance-Based Weighting Scheme in Ensemble Methods
ثبت نشده
چکیده
Recent research in classification problems has mostly concentrated on ensemble methods that construct a set of base classifiers instead of a single classifier. An unlabeled instance is then classified by taking a vote of the base classifiers’ predictions of its class label. Ensemble methods like Bagging and AdaBoost have been shown to outperform the individual base classifiers when the base inducer that produces the base classifiers is unstable. An inducer is said to be unstable if a slight change in the training examples results in a very different classifier being constructed. Further, the idea of ensemble methods also gives rise to the use of error correcting output code technique in enhancing accuracy in multi-class classification problems. In ensemble methods, the vote of each base classifier either receives the same weight (e.g. Bagging and Arcing) or is weighted according to the estimated error rate (e.g. AdaBoost). Based on the recent work of the PI, we propose an instance-based approach of assigning weights to the base classifiers that may enhance the performance of ensemble methods. Instead of assigning a weight to a base classifier that is fixed for all instances, we attempt to assign a weight that is based upon how well we think the base classifier is going to predict on the label of the test instance. Intuitively, given an unlabeled instance x, if there is some indication that the best classifier’s prediction is correct then we should increase its weight. Otherwise, we should reduce its weight. In our preliminary investigation (see Section C.3), on nine UC Irvine datasets where AdaBoost did not perform well, with accuracy between 70% to 80%, we manage to improve the prediction accuracy on all datasets with an average improvement of more than 1%. This suggests that such instance-based weighting scheme does enhance the accuracy of ensemble methods. The implementation is currently still quite crude. We believe it can be further fine-tuned to achieve an even higher accuracy. Section C.3 proposes some ideas that will be explored in this project.
منابع مشابه
A Novel Ensemble Approach for Anomaly Detection in Wireless Sensor Networks Using Time-overlapped Sliding Windows
One of the most important issues concerning the sensor data in the Wireless Sensor Networks (WSNs) is the unexpected data which are acquired from the sensors. Today, there are numerous approaches for detecting anomalies in the WSNs, most of which are based on machine learning methods. In this research, we present a heuristic method based on the concept of “ensemble of classifiers” of data minin...
متن کاملDiversity-Based Weighting Schemes for Clustering Ensembles
Clustering ensembles has been recently recognized as an emerging approach to provide more robust solutions to the data clustering problem. Current methods of clustering ensembles typically fall into instance-based, cluster-based, or hybrid approaches; however, most of such methods fail in discriminating among the various clusterings that participate to the ensemble. In this paper, we address th...
متن کاملFeature Ensemble Plus Sample Selection: Domain Adaptation for Sentiment Classification (Extended Abstract)
The domain adaptation problem arises often in the field of sentiment classification. There are two distinct needs in domain adaptation, namely labeling adaptation and instance adaptation. Most of current research focuses on the former one, while neglects the latter one. In this work, we propose a joint approach, named feature ensemble plus sample selection (SS-FE), which takes both types of ada...
متن کاملWeighted Ensemble Clustering for Increasing the Accuracy of the Final Clustering
Clustering algorithms are highly dependent on different factors such as the number of clusters, the specific clustering algorithm, and the used distance measure. Inspired from ensemble classification, one approach to reduce the effect of these factors on the final clustering is ensemble clustering. Since weighting the base classifiers has been a successful idea in ensemble classification, in th...
متن کاملFault Detection in Ring Based Smart LVDC Microgrid Using Ensemble of Decision Tree
In modern infrastructure, the demand for DC power-based appliances is rapidly increasing, and this phenomenon has created a positive impact on the acceptance of the DC microgrid. However, due to numerous issues such as the absence of zero crossing, bidirectional behaviour of sources, and different magnitudes of fault current during grid connected and islanded modes of operation, protecting DC m...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002