نتایج جستجو برای: atari
تعداد نتایج: 829 فیلتر نتایج به سال:
The Quantum RNA/DNA Theory of Cosmos and Consciousness Ernest Lawrence Rossi, Ph.D. Kathryn Lane Rossi, Ph.D. [email protected] [email protected] Professor of Neuroscience: Professor of Psychotherapy: Mind/Body Institute, Solopaca, Italy Mind/Body Institute, Solopaca, Italy Directors of the Milton H. Erickson Institute of the California Central Coast, 125 Howard Avenue, Los Osos, CA ...
Deep Reinforcement Learning methods have achieved state of the art performance in learning control policies for the games in the Atari 2600 domain. One of the important parameters in the Arcade Learning Environment (ALE, [Bellemare et al., 2013]) is the frame skip rate. It decides the granularity at which agents can control game play. A frame skip value of k allows the agent to repeat a selecte...
Der Beitrag geht von der Annahme aus, dass Medienbildung wesentliche Bestandteile Informatischer Bildung umfasst. Mit dem Portfolio Medienkompetenz wird ein Instrument vorgestellt, das Schülerinnen und Schülern die Möglichkeit bietet, ihre Aktivitäten und individuellen Leistungen im Zusammenhang mit Medien und somit auch zu Bereichen der Informatischen Bildung zu dokumentieren. Dieses Portfolio...
Recent improvements in deep reinforcement learning have allowed to solve problems in many 2D domains such as Atari games. However, in complex 3D environments, numerous learning episodes are required which may be too time consuming or even impossible especially in real-world scenarios. We present a new architecture to combine external knowledge and deep reinforcement learning using only visual i...
Motivated by vision-based reinforcement learning (RL) problems, in particular Atari games from the recent benchmark Aracade Learning Environment (ALE), we consider spatio-temporal prediction problems where future image-frames depend on control variables or actions as well as previous frames. While not composed of natural scenes, frames in Atari games are high-dimensional in size, can involve te...
In this work we aim to solve a large collection of tasks using a single reinforcement learning agent with a single set of parameters. A key challenge is to handle the increased amount of data and extended training time, which is already a problem in single task learning. We have developed a new distributed agent IMPALA (Importance-Weighted Actor Learner Architecture) that can scale to thousands...
We introduce the first deep reinforcement learning agent that learns to beat Atari games with the aid of natural language instructions. The agent uses a multimodal embedding between environment observations and natural language to self-monitor progress through a list of English instructions, granting itself additional reward for completing instructions in addition to increasing the game score. ...
IW(1) is a simple search algorithm that assumes that states can be characterized in terms of a set of boolean features or atoms. IW(1) consists of a standard breadth-first search with one variation: a newly generated state is pruned if it does not make a new atom true. Thus, while a breadth-first search runs in time that is exponential in the number of atoms, IW(1) runs in linear time. Variatio...
Bellemare et al. (2016) introduced the notion of a pseudo-count, derived from a density model, to generalize count-based exploration to nontabular reinforcement learning. This pseudocount was used to generate an exploration bonus for a DQN agent and combined with a mixed Monte Carlo update was sufficient to achieve state of the art on the Atari 2600 game Montezuma’s Revenge. We consider two que...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید