atari

Learning to predict where to look in interactive environments using deep recurrent q-learning

Journal: :CoRR 2016

Seyed Sajad Mousavi Ali Borji Nasser Mozayani

Bottom-Up (BU) saliency models do not perform well in complex interactive environments where humans are actively engaged in tasks (e.g., sandwich making and playing the video games). In this paper, we leverage Reinforcement Learning (RL) to highlight task-relevant locations of input frames. We propose a soft attention mechanism combined with the Deep Q-Network (DQN) model to teach an RL agent h...

متن کامل

The XMM Cluster Survey: Present status and latest results

2013

P. J. Rooney M. Sahlén J. P. Stott

1 Centro de Astrofı́sica, Universidade do Porto, Rua das Estrelas, 4150-762 Porto, Portugal 2 Departamento de Fı́sica e Astronomia, Faculdade de Ciências, Universidade do Porto, Rua do Campo Alegre, 687, 4169-007 Porto, Portugal 3 Astronomy Centre, University of Sussex, Falmer, Brighton BN1 9QH, UK 4 Department of Physics and Astronomy, Texas A&M University, College Station, TX 77845-4242, USA 5 ...

متن کامل

3 Neutrinos and the Cosmos

2006

Boris KAYSER

Gustavo Branco’s distinguished contributions include leading ones to neutrino physics, CP violation, and leptogenesis. To plan the future experimental exploration of some of the physics to which Prof. Branco has contributed, and which is near and dear to his heart, several studies focusing on neutrinos have been carried out. I would like to report on one of these, sponsored by four Divisions of...

متن کامل

Searching for the CP - Violation Associated with Majorana Neutrinos

2001

S. Pascoli S. T. Petcov L. Wolfenstein

The effective Majorana mass which determines the rate of the neutrinoless double beta ((ββ) 0ν-)decay, |< m >| , is considered in the case of three-neutrino mixing and massive Majo-rana neutrinos. Assuming a rather precise determination of the parameters characterizing the neutrino oscillation solutions of the solar and atmospheric neutrino problems has been made, we discuss the information a m...

متن کامل

Measurement of charm production in two-photon processes using inclusive lepton events at TRISTAN

2007

T. Aso H. Miyata T. Sasaki K. Ohkubo M. Oyoshi M. Sato N. Takashimizu M. H. Lee F. Liu A. Maki T. Omori S. Terada P. Kirk T. J. Wang A. Abashian K. Gotow L. Piilonen S. K. Choi C. Rosenfeld S. Behari S. Kobayashi A. Murakami M. Yang A. Bodek B. J. Kim J. S. Kang H. S. Ahn

Measurements of open charm production in photon-photon collisions made with the AMY detector at TRISTAN are reported. Charmed hadrons were identi ed by detecting the high momentum muons or electrons from their semileptonic decays. The data sample corresponds to an integrated luminosity of 275 pb 1 at an average center of mass energy of 58 GeV. Results are presented in the form of cross sections...

متن کامل

Compress and Control

2015

Joel Veness Marc G. Bellemare Marcus Hutter Alvin Chua Guillaume Desjardins

This paper describes a new information-theoretic policy evaluation technique for reinforcement learning. This technique converts any compression or density model into a corresponding estimate of value. Under appropriate stationarity and ergodicity conditions, we show that the use of a sufficiently powerful model gives rise to a consistent value function estimator. We also study the behavior of ...

متن کامل

A Comparison between Unit and Branch Banking: Australian Evidence on Portfolio Diversification and Branch Specialization, 1860-1930

2000

Andrew J. Seltzer David Varney Jeff Frank Diana Weinhold

This paper examines the consequences of branch banking for the Australian economy. There is little evidence to show that branching increased the stability of Australian banking. In 1893 Australia suffered the worst panic ever in a branch banking country. During the crisis, more extensively branched banks were more likely to suspend payments. However, it is shown that branching increased the pro...

متن کامل

Planning and Learning with Adaptive Lookahead

Journal: :Proceedings of the ... AAAI Conference on Artificial Intelligence 2023

Some of the most powerful reinforcement learning frameworks use planning for action selection. Interestingly, their horizon is either fixed or determined arbitrarily by state visitation history. Here, we expand beyond naive and propose a theoretically justified strategy adaptive selection as function state-dependent value estimate. We two variants lookahead analyze trade-off between iteration c...

متن کامل

EFEKTIVITAS PENGELOLAAN DANA DESA (DD) TERHADAP KESEJAHTERAAN MASYARAKAT Studi Di Desa Atari Jaya Kecamatan Lalembuu Kabupaten Konawe Selatan

Journal: :Welvaart : Jurnal Ilmu Kesejahteraan Sosial 2020

متن کامل

Optimal Portfolio Diversication and Product-Market Interactions

2004

Gilles Chemla

Although most shareholders hold diversied portfolios, the corporate nance literature postulates that shareholders maximise rm value, while managers sometimes do not. We argue to the contrary that undiversied managers may care more about rm-level risk and return than about the value of their shareholdersdiversied portfolio. These two objectives may di¤er in presence of product-market inte...

متن کامل