نتایج جستجو برای: atari

تعداد نتایج: 829  

Journal: :CoRR 2016
Seyed Sajad Mousavi Ali Borji Nasser Mozayani

Bottom-Up (BU) saliency models do not perform well in complex interactive environments where humans are actively engaged in tasks (e.g., sandwich making and playing the video games). In this paper, we leverage Reinforcement Learning (RL) to highlight task-relevant locations of input frames. We propose a soft attention mechanism combined with the Deep Q-Network (DQN) model to teach an RL agent h...

2013
P. J. Rooney M. Sahlén J. P. Stott

1 Centro de Astrofı́sica, Universidade do Porto, Rua das Estrelas, 4150-762 Porto, Portugal 2 Departamento de Fı́sica e Astronomia, Faculdade de Ciências, Universidade do Porto, Rua do Campo Alegre, 687, 4169-007 Porto, Portugal 3 Astronomy Centre, University of Sussex, Falmer, Brighton BN1 9QH, UK 4 Department of Physics and Astronomy, Texas A&M University, College Station, TX 77845-4242, USA 5 ...

2006
Boris KAYSER

Gustavo Branco’s distinguished contributions include leading ones to neutrino physics, CP violation, and leptogenesis. To plan the future experimental exploration of some of the physics to which Prof. Branco has contributed, and which is near and dear to his heart, several studies focusing on neutrinos have been carried out. I would like to report on one of these, sponsored by four Divisions of...

2001
S. Pascoli S. T. Petcov L. Wolfenstein

The effective Majorana mass which determines the rate of the neutrinoless double beta ((ββ) 0ν-)decay, |< m >| , is considered in the case of three-neutrino mixing and massive Majo-rana neutrinos. Assuming a rather precise determination of the parameters characterizing the neutrino oscillation solutions of the solar and atmospheric neutrino problems has been made, we discuss the information a m...

2007
T. Aso H. Miyata T. Sasaki K. Ohkubo M. Oyoshi M. Sato N. Takashimizu M. H. Lee F. Liu A. Maki T. Omori S. Terada P. Kirk T. J. Wang A. Abashian K. Gotow L. Piilonen S. K. Choi C. Rosenfeld S. Behari S. Kobayashi A. Murakami M. Yang A. Bodek B. J. Kim J. S. Kang H. S. Ahn

Measurements of open charm production in photon-photon collisions made with the AMY detector at TRISTAN are reported. Charmed hadrons were identi ed by detecting the high momentum muons or electrons from their semileptonic decays. The data sample corresponds to an integrated luminosity of 275 pb 1 at an average center of mass energy of 58 GeV. Results are presented in the form of cross sections...

2015
Joel Veness Marc G. Bellemare Marcus Hutter Alvin Chua Guillaume Desjardins

This paper describes a new information-theoretic policy evaluation technique for reinforcement learning. This technique converts any compression or density model into a corresponding estimate of value. Under appropriate stationarity and ergodicity conditions, we show that the use of a sufficiently powerful model gives rise to a consistent value function estimator. We also study the behavior of ...

2000
Andrew J. Seltzer David Varney Jeff Frank Diana Weinhold

This paper examines the consequences of branch banking for the Australian economy. There is little evidence to show that branching increased the stability of Australian banking. In 1893 Australia suffered the worst panic ever in a branch banking country. During the crisis, more extensively branched banks were more likely to suspend payments. However, it is shown that branching increased the pro...

Journal: :Proceedings of the ... AAAI Conference on Artificial Intelligence 2023

Some of the most powerful reinforcement learning frameworks use planning for action selection. Interestingly, their horizon is either fixed or determined arbitrarily by state visitation history. Here, we expand beyond naive and propose a theoretically justified strategy adaptive selection as function state-dependent value estimate. We two variants lookahead analyze trade-off between iteration c...

2004
Gilles Chemla

Although most shareholders hold diversi…ed portfolios, the corporate …nance literature postulates that shareholders maximise …rm value, while managers sometimes do not. We argue to the contrary that undiversi…ed managers may care more about …rm-level risk and return than about the value of their shareholders’diversi…ed portfolio. These two objectives may di¤er in presence of product-market inte...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید