Fuzzy-rough Information Gain Ratio Approach to Filter-wrapper Feature Selection

نویسندگان

چکیده مقاله:

Feature selection for various applications has been carried out for many years in many different research areas. However, there is a trade-off between finding feature subsets with minimum length and increasing the classification accuracy. In this paper, a filter-wrapper feature selection approach based on fuzzy-rough gain ratio is proposed to tackle this problem. As a search strategy, a modified Ant Colony Optimization (ACO) algorithm is applied on filter phase. ACO has been approved to be a suitable solution in many difficult problems with graph search space such as feature selection. Choosing minimal data reductions among the subsets of features with first and second maximum accuracies is the main contribution of this work. To verify the efficiency of our approach, experiments are performed on 10 well-known UCI data sets. Analysis of the experimental results demonstrates that the proposed approach is able to satisfy two conflicting constraints of feature selection, increasing the classification accuracy as well as decreasing the length of the reduced subsets of features.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Diagnosis of the disease using an ant colony gene selection method based on information gain ratio using fuzzy rough sets

With the advancement of metagenome data mining science has become focused on microarrays. Microarrays are datasets with a large number of genes that are usually irrelevant to the output class; hence, the process of gene selection or feature selection is essential. So, it follows that you can remove redundant genes and increase the speed and accuracy of classification. After applying the gene se...

متن کامل

A hybrid filter-based feature selection method via hesitant fuzzy and rough sets concepts

High dimensional microarray datasets are difficult to classify since they have many features with small number ofinstances and imbalanced distribution of classes. This paper proposes a filter-based feature selection method to improvethe classification performance of microarray datasets by selecting the significant features. Combining the concepts ofrough sets, weighted rough set, fuzzy rough se...

متن کامل

A hybrid wrapper / filter approach for feature subset selection

This work presents a hybrid wrapper/filter algorithm for feature subset selection that can use a combination of several quality criteria measures to rank the set of features of a dataset. These ranked features are used to prune the search space of subsets of possible features such that the number of times the wrapper executes the learning algorithm for a dataset with M features is reduced to O(...

متن کامل

An Effective Feature Selection Approach Using the Hybrid Filter Wrapper

Feature selection is an important data preprocessing technique and has been widely studied in data mining, machine learning and granular computing. In this paper, we introduced an effective feature selection method using the hybrid approaches, that is, use the mutual information to select the candidate feature set, then, obtain the super-reduct space from the candidate feature set by a wrapper ...

متن کامل

On fuzzy-rough sets approach to feature selection

In this paper, we have shown that the fuzzy-rough set attribute reduction algorithm [Jenson, R., Shen, Q., 2002. Fuzzy-rough sets for descriptive dimensionality reduction. In: Proceedings of IEEE International Conference on Fuzzy Systems, FUZZ-IEEE'02, May 12-17, pp. 29-34] is not convergent on many real datasets due to its poorly designed termination criteria; and the computational complexity ...

متن کامل

A hybrid filter/wrapper approach of feature selection using information theory

We focus on a hybrid approach of feature selection. We begin our analysis with a $lter model, exploiting the geometrical information contained in the minimum spanning tree (MST) built on the learning set. This model exploits a statistical test of relative certainty gain, used in a forward selection algorithm. In the second part of the paper, we show that the MST can be replaced by the 1 nearest...

متن کامل

منابع من

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}


عنوان ژورنال

دوره 30  شماره 9

صفحات  1326- 1333

تاریخ انتشار 2017-09-01

با دنبال کردن یک ژورنال هنگامی که شماره جدید این ژورنال منتشر می شود به شما از طریق ایمیل اطلاع داده می شود.

میزبانی شده توسط پلتفرم ابری doprax.com

copyright © 2015-2023