Parameter-Free Approximation Method for Controlling Discrete Event Simulation by Reinforcement Learning

نویسندگان

چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Discrete Parameter Stochastic Approximation Algorithm for Simulation Optimization

We develop in this paper a two-timescale simultaneous perturbation stochastic approximation algorithm for simulation based parameter optimization over discrete sets. This algorithm is applicable in cases where the cost to be optimized is in itself the longrun average of certain cost functions whose noisy estimates are obtained via simulation. We present the convergence analysis of our algorithm...

متن کامل

Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case

We consider reinforcement learning in the parameterized setup, where the model is known to belong to a parameterized family of Markov Decision Processes (MDPs). We further impose here the assumption that set of possible parameters is finite, and consider the discounted return. We propose an on-line algorithm for learning in such parameterized models, dubbed the Parameter Elimination (PEL) algor...

متن کامل

A Third Order Discrete Event Method for Continuous System Simulation

This paper introduces a new numerical method for integration of ordinary differential equations. Following the idea of quantization based integration, i.e., replacing the time discretization by state quantization, this new method performs a third order approximation allowing to achieve better accuracy than their first and second order predecessors. It is shown that the new algorithm satisfies t...

متن کامل

Web-based Environment for Learning Discrete Event Simulation

This paper describes a web-based environment for learning discrete simulation. The main goal of the paper is to foster the process of e-learning simulation by providing students and teachers with effective and comprehensive tools for creating, storing and executing discrete system simulation models. For these purposes the FONWEBGPSS application was developed and integrated into the e-learning s...

متن کامل

Reinforcement Learning Policy Approximation by Behavior Trees

Traditionally a Reinforcement Learning (RL) policy is stored in a lookup table. From such a table it is difficult to observe the behavioral logic or manually adjust this logic post-learning is difficult. This paper shows how behavioral logic of a RL controller is presented in an insightful manner and can be adjusted using the Behavior Tree (BT) framework. It shows a method to approximate an RL ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Simulation notes Europe

سال: 2023

ISSN: ['2305-9974', '2306-0271']

DOI: https://doi.org/10.11128/sne.33.sn.10635