Parameter-Free Approximation Method for Controlling Discrete Event Simulation by Reinforcement Learning
نویسندگان
چکیده
منابع مشابه
A Discrete Parameter Stochastic Approximation Algorithm for Simulation Optimization
We develop in this paper a two-timescale simultaneous perturbation stochastic approximation algorithm for simulation based parameter optimization over discrete sets. This algorithm is applicable in cases where the cost to be optimized is in itself the longrun average of certain cost functions whose noisy estimates are obtained via simulation. We present the convergence analysis of our algorithm...
متن کاملEfficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case
We consider reinforcement learning in the parameterized setup, where the model is known to belong to a parameterized family of Markov Decision Processes (MDPs). We further impose here the assumption that set of possible parameters is finite, and consider the discounted return. We propose an on-line algorithm for learning in such parameterized models, dubbed the Parameter Elimination (PEL) algor...
متن کاملA Third Order Discrete Event Method for Continuous System Simulation
This paper introduces a new numerical method for integration of ordinary differential equations. Following the idea of quantization based integration, i.e., replacing the time discretization by state quantization, this new method performs a third order approximation allowing to achieve better accuracy than their first and second order predecessors. It is shown that the new algorithm satisfies t...
متن کاملWeb-based Environment for Learning Discrete Event Simulation
This paper describes a web-based environment for learning discrete simulation. The main goal of the paper is to foster the process of e-learning simulation by providing students and teachers with effective and comprehensive tools for creating, storing and executing discrete system simulation models. For these purposes the FONWEBGPSS application was developed and integrated into the e-learning s...
متن کاملReinforcement Learning Policy Approximation by Behavior Trees
Traditionally a Reinforcement Learning (RL) policy is stored in a lookup table. From such a table it is difficult to observe the behavioral logic or manually adjust this logic post-learning is difficult. This paper shows how behavioral logic of a RL controller is presented in an insightful manner and can be adjusted using the Behavior Tree (BT) framework. It shows a method to approximate an RL ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Simulation notes Europe
سال: 2023
ISSN: ['2305-9974', '2306-0271']
DOI: https://doi.org/10.11128/sne.33.sn.10635