A Replicator Dynamics Approach to Collective Feature Engineering in Random Forests

نویسندگان

  • Khaled Fawagreh
  • Mohamed Medhat Gaber
  • Eyad Elyan
چکیده

It has been demonstrated how random subspaces can be used to create a Diversified Random Forest, which in turn can lead to better performance in terms of predictive accuracy. Motivated by the fact that each subsforest is built using a set of features that can overlap with those sets of features in other subforests, we hypothesise that using Replicator Dynamics can perform a collective feature engineering, by allowing subforests with better performance to grow and those with lower performance to shrink. In this paper, we propose a new method to further improve the performance of Diversified Random Forest using Replicator Dynamics which has been used extensively in evolutionary game dynamics. A thorough experimental study on 15 real datasets showed favourable results, demonstrating the potential of the proposed method. Some experiments reported a boost in predictive accuracy of over 10% consistently, evidencing the effectiveness of the iterative feature engineering achieved through the Replicator Dynamics procedure.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sparse Covariance Matrix Adaptation Techniques for Evolution Strategies

Knowledge Discovery and Data Mining A Replicator Dynamics Approach to Collective Feature Engineering in Random Forests . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25 Khaled Fawgreh, Mohamed Medhat Gaber and Eyad Elyan A Directed Acyclic Graph Based Approach to Multi-Class Ensemble Classification . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43 Esra’a Alshda...

متن کامل

Fault Locating in High Voltage Transmission Lines Based on Harmonic Components of One-end Voltage Using Random Forests

In this paper, an approach is proposed for accurate locating of single phase faults in transmission lines using voltage signals measured at one-end. In this method, harmonic components of the voltage signals are extracted through Discrete Fourier Transform (DFT) and are normalized by a transformation. The proposed fault locator, which is designed based on Random Forests (RF) algorithm, is train...

متن کامل

A MODEL FOR EVOLUTIONARY DYNAMICS OF WORDS IN A LANGUAGE

Human language, over its evolutionary history, has emerged as one of the fundamental defining characteristic of the modern man. However, this milestone evolutionary process through natural selection has not left any ’linguistic fossils’ that may enable us to trace back the actual course of development of language and its establishment in human societies. Lacking analytical tools to fathom the cr...

متن کامل

Coupled replicator equations for the dynamics of learning in multiagent systems.

Starting with a group of reinforcement-learning agents we derive coupled replicator equations that describe the dynamics of collective learning in multiagent systems. We show that, although agents model their environment in a self-interested way without sharing knowledge, a game dynamics emerges naturally through environment-mediated interactions. An application to rock-scissors-paper game inte...

متن کامل

VHR Semantic Labeling by Random Forest Classification and Fusion of Spectral and Spatial Features on Google Earth Engine

Semantic labeling is an active field in remote sensing applications. Although handling high detailed objects in Very High Resolution (VHR) optical image and VHR Digital Surface Model (DSM) is a challenging task, it can improve the accuracy of semantic labeling methods. In this paper, a semantic labeling method is proposed by fusion of optical and normalized DSM data. Spectral and spatial featur...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015