نتایج جستجو برای: root reinforcement
تعداد نتایج: 179037 فیلتر نتایج به سال:
چکیده ندارد.
We propose an autoencoder (AE)-based transceiver for a wavelength division multiplexing (WDM) system impaired by hardware imperfections. design our AE following the architecture of conventional communication systems. This enables to initialize AE-based have similar performance its counterpart prior training and improves convergence rate. first train in single-channel system, show that it achiev...
BACKGROUND Incisional hernia is a frequent long-term complication after abdominal surgery, with a prevalence greater than 30% in high-risk groups. The aim of the PRIMA trial was to evaluate the effectiveness of mesh reinforcement in high-risk patients, to prevent incisional hernia. METHODS We did a multicentre, double-blind, randomised controlled trial at 11 hospitals in Austria, Germany, and...
This thesis provides novel approach to a single agent architecture design. The primary motivation behind this research was to test the possibility of integrating rigorous methods of reinforcement learning and control engineering with behavioral (ethology) approach to agent technology. This work deals with a single agent architecture, rather then modeling multi-agent system. The main outcome of ...
Citation: Segers E, Beckers T, Geurts H, Claes L, Danckaerts M and van der Oord S (2018) Working Memory and Reinforcement Schedule Jointly Determine Reinforcement Learning in Children: Potential Implications for Behavioral Parent Training. Front. Psychol. 9:394. doi: 10.3389/fpsyg.2018.00394 Working Memory and Reinforcement Schedule Jointly Determine Reinforcement Learning in Children: Potentia...
In order to effectively teach new skills, it is important to identify ways in which to reinforce the behavior. One important aspect of reinforcement is the way in which the reinforcer is delivered upon the completion of the task. Direct and indirect reinforcement are examples of two different contingencies of reinforcement, each associated with different stimulus arrangements. Direct reinforcem...
Discovering whether children prefer reinforcement via a contingency or independent of their behavior is important considering the ubiquity of these programmed schedules of reinforcement. The current study evaluated the efficacy of and preference for social interaction within differential reinforcement of alternative behavior (DRA) and noncontingent reinforcement (NCR) schedules with typically d...
Optimization theory states that organisms behave in a way that maximizes reinforcement or "value." In a two-response situation, pigeons' response proportions approximately equaled reinforcement proportions, even when this behavior pattern substantially decreased the rate of reinforcement. Optimization or reinforcement maximization was not supported as the basic mechanism underlying choice behav...
Multi-arm bandit (MAB) and stochastic linear (SLB) are important models in reinforcement learning, it is well-known that classical algorithms for bandits with time horizon T suffer from the regret of at least square root T. In this paper, we study MAB SLB quantum reward oracles propose both order polylog regrets, exponentially improving dependence terms To best our knowledge, first provable spe...
Cooperating robots can benefit from communication. Our robots create their own adaptable synthetic robot languages (ASRLs). We have shown that robots can develop “basic”, context dependent, and compositional ASRLs using reinforcement learning techniques. (See (Yanco 1994) for a complete description of this work.) We have demonstrated that the robots are able to develop ASRLs using two different...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید