Information Fusion based on Multiplicity of Data Preprocessing Boosts the AdaBoost Classifier
نویسنده
چکیده
The paper considers the possibility of boosting efficiency of the AdaBoost algorithm by introducing diversity in the information extraction procedures and subsequent information fusion. A given raw data set is transformed by two well known methods known as vector autoscaling and dimensional autoscaling. These methods transform data by mean-centering and variancenormalization with respect to the measurement variables and the measured samples, respectively. The feature (information) extraction is implemented by the principal component analysis. Final data representation is obtained by two alternate fusion strategies. In one, feature extraction is done immediately after the scaling transformations and the two feature sets are fused by a simple procedure of concatenation. In the other, data vectors obtained by the two scaling transformations are first fused by simple concatenation and then the feature extraction is done. The data representation is thus by fused feature vectors of dimensionality twice that of the original data. The AdaBoost algorithm by Freund and Schapire is implemented by using a simple threshold base classifier. The classification efficiency of these procedures is compared with that obtained by using single preprocessor based AdaBoosting. Four benchmark data sets are used for validation. The analysis demonstrates that the performance of the AdaBoost algorithm is enhanced further by multiple 170 Divya Somvanshi and R.D.S. Yadava preprocessor based fusion of data space, particularly if the original variables are of same kind.
منابع مشابه
Facial Expression Recognition Based on Anatomical Structure of Human Face
Automatic analysis of human facial expressions is one of the challenging problems in machine vision systems. It has many applications in human-computer interactions such as, social signal processing, social robots, deceit detection, interactive video and behavior monitoring. In this paper, we develop a new method for automatic facial expression recognition based on facial muscle anatomy and hum...
متن کاملCombining Adaboost with Preprocessing Algorithms for Extracting Fuzzy Rules from Low Quality Data in Possibly Imbalanced Problems
An extension of the Adaboost algorithm for obtaining fuzzy rule-based systems from low quality data is combined with preprocessing algorithms for equalizing imbalanced datasets. With the help of synthetic and real-world problems, it is shown that the performance of the Adaboost algorithm is degraded in presence of a moderate uncertainty in either the input or the output values. It is also estab...
متن کاملA Novel Fusion Method for Semantic Concept Classification in Video
Semantic concept classification is a critical task for content-based video retrieval. Traditional methods of machine learning focus on increasing the accuracy of classifiers or models, and face the problems of inducing new data errors and algorithm complexity. Recent researches show that fusion strategies of ensemble learning have appeared promising for improving the classification performance,...
متن کاملTree-based Ensemble Classifiers for High-dimensional Data
Building a classification model from thousands of available predictor variables with a relatively small sample size presents challenges for most traditional classification algorithms. When the number of samples is much smaller than the number of predictors, there can be a multiplicity of good classification models. An ensemble classifier combines multiple single classifiers to improve classific...
متن کاملIntegration of Support Vector Machine and Bayesian Neural Network for Data Mining and Classification
Several combinations of the preprocessing algorithms, feature selection techniques and classifiers can be applied to the data classification tasks. This study introduces a new accurate classifier, the proposed classifier consist from four components: Signal-toNoise as a feature selection technique, support vector machine, Bayesian neural network and AdaBoost as an ensemble algorithm. To verify ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011