Ranking Classi cation Algorithms with Dataset Selection: Using Accuracy and Time Results

نویسندگان

  • Carlos Soares
  • Pavel B. Brazdil
چکیده

Given that wide variety of available classiication algorithms exists, the selection of the right algorithm to use on a new problem is an important issue. In this paper we present zooming, that analyzes a given dataset and selects relevant (similar) datasets used in the past. This process is based on the \distance" calculated on the basis of several dataset characteristics. The accuracy and time results associated with the selected datasets are then processed to generate an advice in the form of a ranking, indicating which algorithms should be applied in which order. Here we propose the adjusted ratio of ratios ranking method. The generalization power of this ranking method is analyzed and the experimental results indicate that zooming leads to better results on average. The work presented can be seen as a rst step towards a system to provide advice on the utility of diierent solution strategies.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ranking Classi cation Algorithms Based on Relevant Performance Information

Given the wide variety of available classiication algorithms and the volume of data today's organizations need to analyze, the selection of the right algorithm to use on a new problem is an important issue. In this paper we present zooming, a technique that, for a given dataset, selects relevant past performance information. The selection process is based on the distance between the dataset at ...

متن کامل

Attribute bagging: improving accuracy of classifier ensembles by using random feature subsets

We present attribute bagging (AB), a technique for improving the accuracy and stability of classi#er ensembles induced using random subsets of features. AB is a wrapper method that can be used with any learning algorithm. It establishes an appropriate attribute subset size and then randomly selects subsets of features, creating projections of the training set on which the ensemble classi#ers ar...

متن کامل

Feature selection using Fuzzy Entropy measures with Yu ' s Similarity measure

In this study, feature selection in classi cation based problems is highlighted. The role of feature selection methods is to select important features by discarding redundant and irrelevant features in the data set, we investigated this case by using fuzzy entropy measures. We developed fuzzy entropy based feature selection method using Yu's similarity and test this using similarity classi er. ...

متن کامل

Feature Selection: Evaluation, Application, and Small Sample Performance

A large number of algorithms have been proposed for feature subset selection. Our experimental results show that the sequential forward oating selection (SFFS) algorithm, proposed by Pudil et al., dominates the other algorithms tested. We study the problem of choosing an optimal feature set for land use classi cation based on SAR satellite images using four di erent texture models. Pooling feat...

متن کامل

Developing a Filter-Wrapper Feature Selection Method and its Application in Dimension Reduction of Gen Expression

Nowadays, increasing the volume of data and the number of attributes in the dataset has reduced the accuracy of the learning algorithm and the computational complexity. A dimensionality reduction method is a feature selection method, which is done through filtering and wrapping. The wrapper methods are more accurate than filter ones but perform faster and have a less computational burden. With ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007