reward penalty scheme

نتایج جستجو برای: reward penalty scheme

تعداد نتایج: 265788 فیلتر نتایج به سال:

Adaptive Fault-Tolerant Tracking Control of Quadrotor UAVs against Uncertainties of Inertial Matrices and State Constraints

Journal: :Drones 2023

This paper presents a study on quadrotor unmanned aerial vehicle (UAV) fault-tolerant control scheme. According to the attitude model and safety of aircraft under uncertainty inertial matrix, state constraint by reinforcement learning is designed ensure safety. Even if boundary crossed, it can be pulled back means penalty function with learning. Meanwhile, in order inhibit oscillation caused im...

متن کامل

Generalized pursuit learning schemes: new families of continuous and discretized learning automata

Journal: :IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society 2002

M. Agache B. John Oommen

The fastest learning automata (LA) algorithms currently available fall in the family of estimator algorithms introduced by Thathachar and Sastry (1986). The pioneering work of these authors was the pursuit algorithm, which pursues only the current estimated optimal action. If this action is not the one with the minimum penalty probability, this algorithm pursues a wrong action. In this paper, w...

متن کامل

Inhibition, Reinforcement Sensitivity and Temporal Information Processing in ADHD and ADHD+ODD: Evidence of a Separate Entity?

Journal: :Journal of Abnormal Child Psychology 2009

Marjolein Luman Steffen J. P. van Noesel Alky Papanikolau Janneke Van Oostenbruggen-Scheffer Diane Veugelers Joseph A. Sergeant Jaap Oosterlaan

This study compared children with ADHD-only, ADHD+ODD and normal controls (age 8-12) on three key neurocognitive functions: response inhibition, reinforcement sensitivity, and temporal information processing. The goal was twofold: (a) to investigate neurocognitive impairments in children with ADHD-only and children with ADHD+ODD, and (b) to test whether ADHD+ODD is a more severe from of ADHD in...

متن کامل

Continuous and Discretized Generalized Pursuit Learning Schemes

2000

Mariana Agache John Oommen

A Learning Automaton is an automaton that interacts with a random environment, having as its goal the task of learning the optimal action based on its acquired experience. Many learning automata have been proposed, with the class of Estimator Algorithms being among the fastest ones. Thathachar and Sastry [24], through the Pursuit Algorithm, introduced the concept of learning algorithms. Their a...

متن کامل

The Influence of Reward and Penalty on Households’ Recycling Intention

Journal: :APCBEE Procedia 2014

متن کامل

Nonequivalent modulation of corticospinal excitability by positive and negative outcomes

2018

Makoto Suzuki Toyohiro Hamaguchi Atsuhiko Matsunaga

Objective The difference between positive and negative outcomes is important in trial-and-error decision-making processes and affects corticospinal excitability. This study investigated corticospinal excitability during the performance of trial-and-error decision-making tasks with varying competing behavioral outcomes. Methods Each trial began with one of five colored circles presented as a c...

متن کامل

Tabu Search with Intensification Strategy for Functional Partitioning in Hardware-Software Codesign

2002

Theerayod Wiangtong Peter Y. K. Cheung Wayne Luk

This paper presents tabu search (TS) method with intensification strategy for hardware-software partitioning. The algorithm operates on functional blocks for designs represented as directed acyclic graphs (DAG), with the objective of minimising processing time under various hardware area constraints. Results are compared to two other heuristic search algorithms: genetic algorithm (GA) and simul...

متن کامل

Conflict Resolution Algorithms for Fault Detection and Diagnosis

2011

Ali Nasir Ella M. Atkins Ilya V. Kolmanovsky

We present two approaches for conflict resolution between two fault detection schemes, detecting the same fault, via optimization with bounded adjustment of detection thresholds. In our first method, we assume initially that there is no conflict and optimize the thresholds of both schemes with respect to a partial cost function that penalizes false alarms and missed detections. Then we continuo...

متن کامل

Pii: S0166-5316(02)00128-1

1998

Sheng-Tzong Cheng Chi-Ming Chen Ing-Ray Chen

An admission control algorithm for a multimedia server is responsible for determining if a new request can be accepted without violating the QoS requirements of the existing requests in the system. Most admission control algorithms treat every request uniformly and hence optimize the system performance by maximizing the number of admitted and served requests. In practice, requests might have di...

متن کامل

Environmental Assessment of Limestone Calcined Clay Cement in Australia

Journal: :IOP conference series 2023

Abstract The Australian government aims to achieve net-zero carbon emissions by 2050. Therefore, introducing a market-oriented trading scheme offer financial reward (or penalty) those who emit below beyond) the allowed limits is expected. Under such scheme, cement industry forced reduce its energy consumption and emissions. Limestone calcined clay (LC3) has been extensively studied regarded as ...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید