Feature Selection On Boolean Symbolic Objects
نویسنده
چکیده
With the boom in IT technology, the data sets used in application are more and more larger and are described by a huge number of attributes, therefore, the feature selection become an important discipline in Knowledge discovery and data mining, allowing the experts to select the most relevant features to improve the quality of their studies and to reduce the time processing of their algorithm. In addition to that, the data used by the applications become richer. They are now represented by a set of complex and structured objects, instead of simple numerical matrixes. The purpose of our algorithm is to do feature selection on rich data, called Boolean Symbolic Objects (BSOs). These objects are described by multivalued features. The BSOs are considered as higher level units which can model complex data, such as cluster of individuals, aggregated data or taxonomies. In this paper we will introduce a new feature selection criterion for BSOs, and we will explain how we improved its complexity.
منابع مشابه
Flexible Matching of Boolean Symbolic Objects
Matching is the process of comparing two or more structures to discover their likenesses or differences. It is a common operation performed in symbolic classification, pattern recognition, data mining and expert systems. The definition of a matching operator for Boolean symbolic objects is important for the development of symbolic data analysis techniques. In this paper we give the definition o...
متن کاملBoolean Reasoning for Feature Extraction Problems
We recall several applications of Boolean reasoning for feature extraction and we propose an approach based on Boolean reasoning for new feature extraction from data tables with symbolic (nominal, qualitative) attributes. New features are of the form a 2 V , where V Va and Va is the set of values of attribute a. We emphasize that Boolean reasoning is also a good framework for complexity analysi...
متن کاملFusion of Feature Selection with Symbolic Approach for Dimensionality Reduction
In this paper, a fusion of two methods for dimensionality reduction is proposed. First method is the selection of features using FQ measure method followed by another method based on symbolic approach is proposed. The symbolic method is based on the transformation of features into symbolic data using the property of collinearity and variance based cumulative sum of features. In this proposed ap...
متن کاملA feature selection algorithm with Fuzzy information
This paper deals with the problem of feature selection. Almuallim and Dieterich [1] developed the FOCUS algorithm which performs optimal feature selection on boolean domains. In a previous paper an extension of FOCUS is developed to deal with discrete and continuous features. In this paper we present an extension to work with fuzzy features, which is verified on two well known problem with quit...
متن کاملSymbolic Approaches to Feature Interaction Detection
Feature interaction refers to situations where a combination of different services behaves differently than expected from the single services’ behaviors. For example, consider a situation where user A has subscribed to the service Originating Call Screening (OCS) and does not want calls to user C to be put through, and user B has activated the service Call Forwarding (CF) to user C. In this sit...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1405.0647 شماره
صفحات -
تاریخ انتشار 2013