Policy search in continuous action domains: An overview
نویسندگان
چکیده
منابع مشابه
Policy Search in Continuous Action Domains: an Overview
Continuous action policy search, the search for efficient policies in continuous control tasks, is currently the focus of intensive research driven both by the recent success of deep reinforcement learning algorithms and by the emergence of competitors based on evolutionary algorithms. In this paper, we present a broad survey of policy search methods, incorporating into a common big picture the...
متن کاملNear-Optimal Search in Continuous Domains
We investigate search problems in continuous state and action spaces with no uncertainty. Actions have costs and can only be taken at discrete time steps (unlike the case with continuous control). Given an admissible heuristic function and a starting state, the objective is to find a minimum-cost plan that reaches a goal state. As the continuous domain does not allow the tight optimality result...
متن کاملBatch Policy Iteration Algorithms for Continuous Domains
This paper establishes the link between an adaptation of the policy iteration method for Markov decision processes with continuous state and action spaces and the policy gradient method when the differentiation of the mean value is directly done over the policy without parameterization. This approach allows deriving sound and practical batch Reinforcement Learning algorithms for continuous stat...
متن کاملLearning Integrated Symbolic and Continuous Action Models for Continuous Domains
Long-living autonomous agents must be able to learn to perform competently in novel environments. One important aspect of competence is the ability to plan, which entails the ability to learn models of the agent’s own actions and their effects on the environment. In this paper we describe an approach to learn action models of environments with continuous-valued spatial states and realistic phys...
متن کاملMetamodels in action: An overview
This paper strives for demonstrating “metamodels in action” which means showing concrete applications of this concept. Based on a literature survey we develop a taxonomy that helps classifying existing application scenarios concerning the dimensions of domain, design, and integration and briefly describe some of the existing work we came across. Furthermore, we provide an insight into the area ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Neural Networks
سال: 2019
ISSN: 0893-6080
DOI: 10.1016/j.neunet.2019.01.011