Combining TD-learning with Cascade-correlation Networks

نویسندگان

François Rivest

Doina Precup

چکیده

Using neural networks to represent value functions in reinforcement learning algorithms often involves a lot of work in hand-crafting the network structure, and tuning the learning parameters. In this paper, we explore the potential of using constructive neural networks in reinforcement learning. Constructive neural network methods are appealing because they can build the network structure based on the data that needs to be represented. To our knowledge, such algorithms have not been used in reinforcement learning. A major issue is that constructive algorithms often work in batch mode, while many reinforcement learning algorithms work on-line. We use a cache to accumulate data, then use a variant of cascade correlation to update the value function. Preliminary results on the game of Tic-Tac-Toe show the potential of this new algorithm, compared to using static feed-forward neural networks trained with backpropagation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Application of combined genetic algorithms with cascade correlation to diagnosis of delayed gastric emptying from electrogastrograms.

The current standard method (radioscintigraphy) for the diagnosis of delayed gastric emptying (GE) of a solid meal involves radiation exposure and considerable expense. Based on combining genetic algorithms with the cascade correlation learning architecture, a neural network approach is proposed for the diagnosis of delayed GE from electrogastrograms (EGGs). EGGs were measured by placing surfac...

متن کامل

TD(λ) Networks: Temporal-Difference Networks with Eligibility Traces

Temporal-difference (TD) networks have been introduced as a formalism for expressing and learning grounded world knowledge in a predictive form (Sutton & Tanner, 2005). Like conventional TD(0) methods, the learning algorithm for TD networks uses 1-step backups to train prediction units about future events. In conventional TD learning, the TD(λ) algorithm is often used to do more general multi-s...

متن کامل

Variations on the Cascade-Correlation Learning Architecture for Fast Convergence in Robot Control

Most applications of Neural Networks in Control Systems use a version of the BackPropagation algorithm for training. Learning in these networks is generally a slow and very time consuming process. Cascade-Correlation is a supervised learning algorithm that automatically determines the size and topology of the network and is quicker than back-propagation in learning for several benchmarks. We pr...

متن کامل

Combining pattern recognition and deep-learning-based algorithms to automatically detect commercial quadcopters using audio signals (Research Article)

Commercial quadcopters with many private, commercial, and public sector applications are a rapidly advancing technology. Currently, there is no guarantee to facilitate the safe operation of these devices in the community. Three different automatic commercial quadcopters identification methods are presented in this paper. Among these three techniques, two are based on deep neural networks in whi...

متن کامل

Learning Aggregate Functions with Neural Networks Using a Cascade-Correlation Approach

In various application domains, data can be represented as bags of vectors. Learning functions over such bags is a challenging problem. In this paper, a neural network approach, based on cascade-correlation networks, is proposed to handle this kind of data. By defining special aggregation units that are integrated in the network, a general framework to learn functions over bags is obtained. Res...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2003

Combining TD-learning with Cascade-correlation Networks

نویسندگان

چکیده

منابع مشابه

Application of combined genetic algorithms with cascade correlation to diagnosis of delayed gastric emptying from electrogastrograms.

TD(λ) Networks: Temporal-Difference Networks with Eligibility Traces

Variations on the Cascade-Correlation Learning Architecture for Fast Convergence in Robot Control

Combining pattern recognition and deep-learning-based algorithms to automatically detect commercial quadcopters using audio signals (Research Article)

Learning Aggregate Functions with Neural Networks Using a Cascade-Correlation Approach

عنوان ژورنال:

اشتراک گذاری