نتایج جستجو برای: reward penalty scheme

تعداد نتایج: 265788  

Journal: :Drones 2023

This paper presents a study on quadrotor unmanned aerial vehicle (UAV) fault-tolerant control scheme. According to the attitude model and safety of aircraft under uncertainty inertial matrix, state constraint by reinforcement learning is designed ensure safety. Even if boundary crossed, it can be pulled back means penalty function with learning. Meanwhile, in order inhibit oscillation caused im...

Journal: :IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society 2002
M. Agache B. John Oommen

The fastest learning automata (LA) algorithms currently available fall in the family of estimator algorithms introduced by Thathachar and Sastry (1986). The pioneering work of these authors was the pursuit algorithm, which pursues only the current estimated optimal action. If this action is not the one with the minimum penalty probability, this algorithm pursues a wrong action. In this paper, w...

Journal: :Journal of Abnormal Child Psychology 2009
Marjolein Luman Steffen J. P. van Noesel Alky Papanikolau Janneke Van Oostenbruggen-Scheffer Diane Veugelers Joseph A. Sergeant Jaap Oosterlaan

This study compared children with ADHD-only, ADHD+ODD and normal controls (age 8-12) on three key neurocognitive functions: response inhibition, reinforcement sensitivity, and temporal information processing. The goal was twofold: (a) to investigate neurocognitive impairments in children with ADHD-only and children with ADHD+ODD, and (b) to test whether ADHD+ODD is a more severe from of ADHD in...

2000
Mariana Agache John Oommen

A Learning Automaton is an automaton that interacts with a random environment, having as its goal the task of learning the optimal action based on its acquired experience. Many learning automata have been proposed, with the class of Estimator Algorithms being among the fastest ones. Thathachar and Sastry [24], through the Pursuit Algorithm, introduced the concept of learning algorithms. Their a...

2018
Makoto Suzuki Toyohiro Hamaguchi Atsuhiko Matsunaga

Objective The difference between positive and negative outcomes is important in trial-and-error decision-making processes and affects corticospinal excitability. This study investigated corticospinal excitability during the performance of trial-and-error decision-making tasks with varying competing behavioral outcomes. Methods Each trial began with one of five colored circles presented as a c...

2002
Theerayod Wiangtong Peter Y. K. Cheung Wayne Luk

This paper presents tabu search (TS) method with intensification strategy for hardware-software partitioning. The algorithm operates on functional blocks for designs represented as directed acyclic graphs (DAG), with the objective of minimising processing time under various hardware area constraints. Results are compared to two other heuristic search algorithms: genetic algorithm (GA) and simul...

2011
Ali Nasir Ella M. Atkins Ilya V. Kolmanovsky

We present two approaches for conflict resolution between two fault detection schemes, detecting the same fault, via optimization with bounded adjustment of detection thresholds. In our first method, we assume initially that there is no conflict and optimize the thresholds of both schemes with respect to a partial cost function that penalizes false alarms and missed detections. Then we continuo...

1998
Sheng-Tzong Cheng Chi-Ming Chen Ing-Ray Chen

An admission control algorithm for a multimedia server is responsible for determining if a new request can be accepted without violating the QoS requirements of the existing requests in the system. Most admission control algorithms treat every request uniformly and hence optimize the system performance by maximizing the number of admitted and served requests. In practice, requests might have di...

Journal: :IOP conference series 2023

Abstract The Australian government aims to achieve net-zero carbon emissions by 2050. Therefore, introducing a market-oriented trading scheme offer financial reward (or penalty) those who emit below beyond) the allowed limits is expected. Under such scheme, cement industry forced reduce its energy consumption and emissions. Limestone calcined clay (LC3) has been extensively studied regarded as ...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید