Combining Binary Classifiers for a Multiclass Problem with Differential Privacy
نویسندگان
چکیده
Multiclass classification problem is often solved by combing binary classifiers into ensembles. While this is required for inherently binary classifiers, such as SVM, it also provides performance advantages for other classifiers. In this paper, we address the problem of combining binary classifiers into ensembles in the differentially private data publishing framework, where the data privacy is achieved by anonymization. The main idea of this paper is to counter the inevitable loss of data quality due to anonymization of the data by building an ensemble of binary classifiers, and then to use an error-correcting approach to obtain a class decision from this ensemble. We describe the proposed algorithm and present the results of extensive experimentation on synthetic and UC Irvine data. We find that while building ensembles after anonymization leads to no change in classifier accuracy, preparing the data for ensembles prior to anonymization improves accuracy in most of the cases.
منابع مشابه
A General Procedure for Combining Binary Classifiers and Its Performance Analysis
A general procedure for combining binary classifiers for multiclass classification problems with one-against-one decomposition policy is presented in this paper. Two existing schemes, namely the min-max combination and the most-winning combination, may be regarded as its two special cases. We show that the accuracy of the combination procedure will increase and time complexity will decrease as ...
متن کاملRecursive Ant Colony Based ECOC: An Ensemble Learning Technique for Classifying Data
Error correcting output code (ECOC) is one of the widely used classifier ensemble technique .That technique provide solution for the various multiclass classification problem by dividing multiclass problem into binary class classification problem. In this paper, a new enhanced heuristic coding method, based on ECOC, RACS-ECOC is proposed. To generate strong classifiers for the multiclass classi...
متن کاملA comparison of methods for multiclass support vector machines
Support vector machines (SVMs) were originally designed for binary classification. How to effectively extend it for multiclass classification is still an ongoing research issue. Several methods have been proposed where typically we construct a multiclass classifier by combining several binary classifiers. Some authors also proposed methods that consider all classes at once. As it is computation...
متن کاملConvex Optimization for Binary Classifier Aggregation in Multiclass Problems
Multiclass problems are often decomposed into multiple binary problems that are solved by individual binary classifiers whose results are integrated into a final answer. Various methods, including all-pairs (APs), one-versus-all (OVA), and error correcting output code (ECOC), have been studied, to decompose multiclass problems into binary problems. However, little study has been made to optimal...
متن کاملReducing Multiclass to Binary: A Unifying Approach for Margin Classifiers
We present a unifying framework for studying the solution of multiclass categorization problems by reducing them to multiple binary problems that are then solved using a margin-based binary learning algorithm. The proposed framework unifies some of the most popular approaches in which each class is compared against all others, or in which all pairs of classes are compared to each other, or in w...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Trans. Data Privacy
دوره 7 شماره
صفحات -
تاریخ انتشار 2014