q policy

نتایج جستجو برای: q policy

تعداد نتایج: 381585 فیلتر نتایج به سال:

An Absorbing Markov Chain Approach to GI/M/1 Queues with Generalized Vacations

2005

Kyung Chul Chae Sang Min Lee

1 Q I − − ) ( y , where the j th entry of the row vector y is the probability that the system state seen by the first arrival during a busy period is j and 1 Q I − − ) ( is the fundamental matrix associated with the standard GI/M/1 queue. In this paper, we present the entries of 1 Q I − − ) ( explicitly. Also, we illustrate how to find y by examples such as the N -policy GI/M/1 queue with or wi...

متن کامل

Dynamic Reward-Based Dueling Deep Dyna-Q: Robust Policy Learning in Noisy Environments

Journal: :Proceedings of the AAAI Conference on Artificial Intelligence 2020

متن کامل

Strategies of Policy Advocacy Organizations and Their Theoretical Affinities: Evidence from Q-Methodology

Journal: :Policy Studies Journal 2016

متن کامل

An Empirical Comparison of Neural Architectures for Reinforcement Learning in Partially Observable Environments

Journal: :CoRR 2015

Denis Steckelmacher Peter Vrancx

This paper explores the performance of fitted neural Q iteration for reinforcement learning in several partially observable environments, using three recurrent neural network architectures: Long ShortTerm Memory [7], Gated Recurrent Unit [3] and MUT1, a recurrent neural architecture evolved from a pool of several thousands candidate architectures [8]. A variant of fitted Q iteration, based on A...

متن کامل

Deep Reinforcement Learning with Regularized Convolutional Neural Fitted Q Iteration

2016

Cosmo Harrigan

We review the deep reinforcement learning setting, in which an agent receiving high-dimensional input from an environment learns a control policy without supervision using multilayer neural networks. We then extend the Neural Fitted Q Iteration value-based reinforcement learning algorithm (Riedmiller et al) by introducing a novel variation which we call Regularized Convolutional Neural Fitted Q...

متن کامل

A novel policy iteration based deterministic Q-learning for discrete-time nonlinear systems

Journal: :Science China Information Sciences 2015

متن کامل

Belief Structures, Common Policy Space and Health Care Reform: A Q Methodology Study

Journal: :Psychology 2011

متن کامل

Off-Policy Actor-Critic

Journal: :CoRR 2012

Thomas Degris Martha White Richard S. Sutton

This paper presents the first actor-critic algorithm for off-policy reinforcement learning. Our algorithm is online and incremental, and its per-time-step complexity scales linearly with the number of learned weights. Previous work on actor-critic algorithms is limited to the on-policy setting and does not take advantage of the recent advances in offpolicy gradient temporal-difference learning....

متن کامل

Determining the Optimal Order Quantity with Compound Erlang Demand under (T,Q) Policy

Journal: :Mathematical Problems in Engineering 2018

متن کامل

Revealing stakeholders’ perspectives on educational language policy in higher education through Q-methodology

Journal: :Current Issues in Language Planning 2020

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید