keywords reinforcement learning

نتایج جستجو برای: keywords reinforcement learning

تعداد نتایج: 2453256 فیلتر نتایج به سال:

Multi-Agent Learning Methods in an Uncertain Environment

2002

ShuHua Liu YanTao

Learning in multi-agent environments constitutes a research and application area whose importance is broadly acknowledged in artificial intelligence. There is a rapidly growing body of literature on multi-agent learning. In this paper, the multi-agent learning methods in an uncertain environment are addressed. The presented methods are not exhaustive, but they highlight the major methods used b...

متن کامل

hierarchical functional concepts for knowledge transfer among reinforcement learning agents

Journal: :iranian journal of fuzzy systems 2015

a. mousavi m. nili ahmadabadi h. vosoughpour b. n. araabi n. zaare

this article introduces the notions of functional space and concept as a way of knowledge representation and abstraction for reinforcement learning agents. these definitions are used as a tool of knowledge transfer among agents. the agents are assumed to be heterogeneous; they have different state spaces but share a same dynamic, reward and action space. in other words, the agents are assumed t...

متن کامل

Machine learning in Dynamic Adaptive Streaming over HTTP (DASH)

2017

Koffka Khan

Koffka Khan Department of Computing and Information Technology The University of the West Indies, Trinidad and Tobago, W.I Email: [email protected] Wayne Goodridge Department of Computing and Information Technology The University of the West Indies, Trinidad and Tobago, W.I Email: [email protected] -------------------------------------------------------------------ABSTRACT----...

متن کامل

Multiple-Goal Reinforcement Learning with Modular Sarsa(O)

2003

Nathan Sprague Dana Ballard

We present a new algorithm, GM-Sarsa(O), for finding approximate solutions to multiple-goal reinforcement learning problems that are modeled as composite Markov decision processes. According to our formulation different sub-goals are modeled as MDPs that are coupled by the requirement that they share actions. Existing reinforcement learning algorithms address similar problem formulations by fir...

متن کامل

Basis Function Adaptation in Temporal Difference Reinforcement Learning

Journal: :Annals OR 2005

Ishai Menache Shie Mannor Nahum Shimkin

We examine methods for on-line optimization of the basis function for temporal difference Reinforcement Learning algorithms. We concentrate on architectures with a linear parameterization of the value function. Our methods optimize the weights of the network while simultaneously adapting the parameters of the basis functions in order to decrease the Bellman approximation error. A gradient-based...

متن کامل

Is Active Learning via Internet Technologies Possible?

Journal: :IJOPCD 2017

Victor C. X. Wang Leslie Hitch

This study addresses the question of whether or not active learning can be taught online. There are many definitions of learning: It is the process and the sum total of acquiring knowledge, skills, attitudes, values, beliefs, and emotions. There is, however, a nuanced definition of active online learning, defined as methods by which learners actively participate in the learning process (e.g., o...

متن کامل

Acquiring Body Representation for Reinforcement Learning Based on Slow Feature Analysis

2011

Akihiko Nishikawa Masaki Ogino Minoru Asada

The center of spatio-temporal representation for own body and its surrounding space is supposed at the parietal cortex in human brains, but the mechanism how the brain computes them is still not clearly understood though its hierarchical representation is expected. One of such hierarchical models, this paper propose a method which integrates multimodal information based on the Slow Feature Anal...

متن کامل

Intl . Joint Conf . on Neural Networks IJCNN ’ 98 , Anchorage

1998

Eduardo Sanchez

|Blackjack or twenty-one is a card game where the player attempts to beat the dealer, by obtaining a sum of card values that is equal to or less than 21 so that his total is higher than the dealer's. The probabilistic nature of the game makes it an interesting testbed problem for learning algorithms, though the problem of learning a good playing strategy is not obvious. Learning with a teacher ...

متن کامل

Reinforcement Learning based Routing Protocols in WSNs: A Survey

2013

Anju Arya Amita Malik Ritu Garg

Advances in the technology along with reduction in processor size, its memory, and wireless antenna size has facilitated the construction of low cost, low powered and multifunctional Sensor nodes which in turn led to high demand for development of Wireless Sensor Networks. A lot of research work has been done regarding the development of routing protocols for WSNs. This paper provides a brief o...

متن کامل

A Computer Model of Language Acquisition – Syllable Learning – Based on Hebbian Cell Assemblies and Reinforcement Learning

2009

Sepideh Fazeli Fariba Bahrami

Investigating language acquisition is one of the most challenging problems in the area of studying language. Syllable learning as a level of language acquisition has a considerable significance since it plays an important role in language acquisition. Because of impossibility of studying language acquisition directly with children, especially in its developmental phases, computer models will be...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید