q learning

Use of Q-learning approaches for practical medium access control in wireless sensor networks

Journal: :Eng. Appl. of AI 2016

Selahattin Kosunalp Yi Chu Paul D. Mitchell David Grace Tim Clarke

This paper studies the potential of a novel approach to ensure more efficient and intelligent assignment of capacity through medium access control (MAC) in practical wireless sensor networks. Q-Learning is employed as an intelligent transmission strategy. We review the existing MAC protocols in the context of Q-learning. A recently-proposed, ALOHA and Q-Learning based MAC scheme, ALOHA-Q, is co...

متن کامل

Learning from Demonstrations for Real World Reinforcement Learning

Journal: :CoRR 2017

Todd Hester Matej Vecerik Olivier Pietquin Marc Lanctot Tom Schaul Bilal Piot Andrew Sendonaris Gabriel Dulac-Arnold Ian Osband John Agapiou Joel Z. Leibo Audrunas Gruslys

Deep reinforcement learning (RL) has achieved several high profile successes in difficult decision-making problems. However, these algorithms typically require a huge amount of data before they reach reasonable performance. In fact, their performance during learning can be extremely poor. This may be acceptable for a simulator, but it severely limits the applicability of deep RL to many real-wo...

متن کامل

Online adaptive policies for ensemble classifiers

Journal: :Neurocomputing 2005

Christos Dimitrakakis Samy Bengio

Ensemble algorithms can improve the performance of a given learning algorithm through the combination of multiple base classifiers into an ensemble. In this paper we attempt to train and combine the base classifiers using an adaptive policy. This policy is learnt through a Q-learning inspired technique. Its effectiveness for an essentially supervised task is demonstrated by experimental results...

متن کامل

Cooperative Q-learning: the knowledge sharing issue

Journal: :Advanced Robotics 2001

Majid Nili Ahmadabadi Masoud Asadpour Eiji Nakano

A group of cooperative and homogeneous Q-learning agents can cooperate to learn faster and gainmore knowledge. In order to do so, each learner agent must be able to evaluate the expertness and the intelligence level of the other agents, and to assess the knowledge and the information it gets from them. In addition, the learner needs a suitable method to properly combine its own knowledge and wh...

متن کامل

Apprentissage par renforcement dans le cadre des processus décisionnels de Markov factorisés observables dans le désordre. Étude expérimentale du Q-Learning parallèle appliqué aux problèmes du labyrinthe et du New York Driving

Journal: :Revue d'Intelligence Artificielle 2006

Guillaume J. Laurent Emmanuel Piat

RÉSUMÉ. Cet article présente les résultats expérimentaux obtenus avec une architecture originale permettant un apprentissage générique dans le cadre de processus décisionnels de Markov factorisés observables dans le désordre (PDMFOD). L’article décrit tout d’abord le cadre formel des PDMFOD puis le fonctionnement de l’algorithme, notamment le principe de parallélisation et l’attribution dynamiq...

متن کامل

Q-learning with heuristic exploration in Simulated Car Racing

2013

Daniel Karavolos

متن کامل

Qlass: an Enhancement of Q-learning to Generate State Space Adaptively

2007

Hajime Murao Shinzo Kitamura

In this paper, we propose Q-learning with adaptive state segmentation (QLASS). QLASS provides an e cient method to construct state space suitable for Q-learning to accomplish the task in a continuous sensor space. In QLASS, the robot starts with single state covering whole sensor space. The sensor space is segmented incrementally based on sensor vectors and reinforcement signals. The segmented ...

متن کامل

Naive Augmenting Q-Learning to Process Feature-Based Representations of States

2014

Janis Zuters

Temporal difference algorithms perform well on discrete and small problems. This paper proposes a modification of the Q-learning algorithm towards natural ability to receive a feature list instead of an already identified state in the input. Complete observability is still assumed. The algorithm, Naive Augmenting Q-Learning, has been designed through building a hierarchical structure of input f...

متن کامل

Deep Q-Learning With Q-Matrix Transfer Learning for Novel Fire Evacuation Environment

Journal: :IEEE transactions on systems, man, and cybernetics 2021

Deep reinforcement learning (RL) is achieving significant success in various applications like control, robotics, games, resource management, and scheduling. However, the important problem of emergency evacuation, which clearly could benefit from RL, has been largely unaddressed. Indeed, evacuation a complex task that difficult to solve with RL. An situation highly dynamic, lot changing variabl...

متن کامل

Supervised Q-walk for Learning Vector Representation of Nodes in Networks

Journal: :CoRR 2017

Naimish Agarwal Gora Chand Nandi

Automatic feature learning algorithms are at the forefront of modern day machine learning research. We present a novel algorithm, supervised Q-walk, which applies Q-learning to generate random walks on graphs such that the walks prove to be useful for learning node features suitable for tackling with the node classification problem. We present another novel algorithm, k-hops neighborhood based ...

متن کامل