نتایج جستجو برای: root reinforcement

تعداد نتایج: 179037  

پایان نامه :وزارت بهداشت، درمان و آموزش پزشکی - دانشگاه علوم پزشکی و خدمات بهداشتی درمانی استان فارس - دانشکده دندانپزشکی 1385

چکیده ندارد.

Journal: :IEEE Journal of Selected Topics in Quantum Electronics 2022

We propose an autoencoder (AE)-based transceiver for a wavelength division multiplexing (WDM) system impaired by hardware imperfections. design our AE following the architecture of conventional communication systems. This enables to initialize AE-based have similar performance its counterpart prior training and improves convergence rate. first train in single-channel system, show that it achiev...

Journal: :Lancet 2017
An P Jairam Lucas Timmermans Hasan H Eker Robert E G J M Pierik David van Klaveren Ewout W Steyerberg Reinier Timman Arie C van der Ham Imro Dawson Jan A Charbon Christoph Schuhmacher André Mihaljevic Jakob R Izbicki Panagiotis Fikatas Philip Knebel René H Fortelny Gert-Jan Kleinrensink Johan F Lange Hans J Jeekel

BACKGROUND Incisional hernia is a frequent long-term complication after abdominal surgery, with a prevalence greater than 30% in high-risk groups. The aim of the PRIMA trial was to evaluate the effectiveness of mesh reinforcement in high-risk patients, to prevent incisional hernia. METHODS We did a multicentre, double-blind, randomised controlled trial at 11 hospitals in Austria, Germany, and...

2008
David Kadleček

This thesis provides novel approach to a single agent architecture design. The primary motivation behind this research was to test the possibility of integrating rigorous methods of reinforcement learning and control engineering with behavioral (ethology) approach to agent technology. This work deals with a single agent architecture, rather then modeling multi-agent system. The main outcome of ...

2018
Elien Segers Tom Beckers Hilde Geurts Laurence Claes Marina Danckaerts Saskia van der Oord

Citation: Segers E, Beckers T, Geurts H, Claes L, Danckaerts M and van der Oord S (2018) Working Memory and Reinforcement Schedule Jointly Determine Reinforcement Learning in Children: Potential Implications for Behavioral Parent Training. Front. Psychol. 9:394. doi: 10.3389/fpsyg.2018.00394 Working Memory and Reinforcement Schedule Jointly Determine Reinforcement Learning in Children: Potentia...

2013
Robert Mark

In order to effectively teach new skills, it is important to identify ways in which to reinforce the behavior. One important aspect of reinforcement is the way in which the reinforcer is delivered upon the completion of the task. Direct and indirect reinforcement are examples of two different contingencies of reinforcement, each associated with different stimulus arrangements. Direct reinforcem...

Journal: :Journal of applied behavior analysis 2009
Kevin C Luczynski Gregory P Hanley

Discovering whether children prefer reinforcement via a contingency or independent of their behavior is important considering the ubiquity of these programmed schedules of reinforcement. The current study evaluated the efficacy of and preference for social interaction within differential reinforcement of alternative behavior (DRA) and noncontingent reinforcement (NCR) schedules with typically d...

Journal: :Science 1981
J E Mazur

Optimization theory states that organisms behave in a way that maximizes reinforcement or "value." In a two-response situation, pigeons' response proportions approximately equaled reinforcement proportions, even when this behavior pattern substantially decreased the rate of reinforcement. Optimization or reinforcement maximization was not supported as the basic mechanism underlying choice behav...

Journal: :Proceedings of the ... AAAI Conference on Artificial Intelligence 2023

Multi-arm bandit (MAB) and stochastic linear (SLB) are important models in reinforcement learning, it is well-known that classical algorithms for bandits with time horizon T suffer from the regret of at least square root T. In this paper, we study MAB SLB quantum reward oracles propose both order polylog regrets, exponentially improving dependence terms To best our knowledge, first provable spe...

1994
Holly A. Yanco

Cooperating robots can benefit from communication. Our robots create their own adaptable synthetic robot languages (ASRLs). We have shown that robots can develop “basic”, context dependent, and compositional ASRLs using reinforcement learning techniques. (See (Yanco 1994) for a complete description of this work.) We have demonstrated that the robots are able to develop ASRLs using two different...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید