sequential exploration approach1

Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case

2008

Kirill Dyagilev Shie Mannor Nahum Shimkin

We consider reinforcement learning in the parameterized setup, where the model is known to belong to a parameterized family of Markov Decision Processes (MDPs). We further impose here the assumption that set of possible parameters is finite, and consider the discounted return. We propose an on-line algorithm for learning in such parameterized models, dubbed the Parameter Elimination (PEL) algor...

متن کامل

Bayesian and Approximate Bayesian Modeling of Human Sequential Decision-Making on the Multi-Armed Bandit Problem

2007

In this paper we investigate human exploration/exploitation behavior in sequential-decision making tasks. Previous studies have suggested that people are suboptimal at scheduling exploration, and heuristic decision strategies are better predictors of human choices than the optimal model. By incorporating more realistic assumptions about subject’s knowledge and limitations into models of belief ...

متن کامل

Accelerating the Requirement Space Exploration through Coarse-Grained Parallel Execution

2011

Zhongwei Lin Yiping Yao

The design and analysis of complex systems need to determine suitable configurations for meeting requirement constraints. The Monotonic Indices Space (MIS) method is a useful approach for monotonic requirement space exploration. However, the method is highly time and memory-Consuming. Aiming to the problem of low efficiency of sequential MIS method, this paper introduces a coarse-grained parall...

متن کامل

Bayesian and Approximate Bayesian Modeling of Human Sequential Decision-Making on the Multi-Armed Bandit Problem

2007

Daniel Acuña Paul Schrater

In this paper we investigate human exploration/exploitation behavior in a sequential-decision making task. Previous studies have suggested that people are suboptimal at scheduling exploration, and heuristic decision strategies are better predictors of human choices than the optimal model. By incorporating more realistic assumptions about subject’s knowledge and limitations into models of belief...

متن کامل

A Comparative Study of Feature Selection

2011

Satrya Fajri Pratama Azah Kamilah Muda Yun-Huoy Choo Noor Azilah Muda

Handwriting is individualistic. The uniqueness of shape and style of handwriting can be used to identify the significant features in authenticating the author of writing. Acquiring these significant features leads to an important research in Writer Identification domain. This paper is meant to explore the usage of feature selection in Writer Identification. Various filter and wrapper feature se...

متن کامل

Boltzmann Exploration Done Right

2017

Nicolò Cesa-Bianchi Claudio Gentile Gergely Neu Gábor Lugosi

Boltzmann exploration is a classic strategy for sequential decision-making under uncertainty, and is one of the most standard tools in Reinforcement Learning (RL). Despite its widespread use, there is virtually no theoretical understanding about the limitations or the actual benefits of this exploration scheme. Does it drive exploration in a meaningful way? Is it prone to misidentifying the opt...

متن کامل

Information-Theoretic Approach to Efficient Adaptive Path Planning for Mobile Robotic Environmental Sensing

Journal: :CoRR 2009

Kian Hsiang Low John M. Dolan Pradeep K. Khosla

Recent research in robot exploration and mapping has focused on sampling environmental hotspot fields. This exploration task is formalized by Low, Dolan, and Khosla (2008) in a sequential decision-theoretic planning under uncertainty framework called MASP. The time complexity of solving MASP approximately depends on the map resolution, which limits its use in large-scale, high-resolution explor...

متن کامل

Sedimentary Evolution Characteristics of Fine-Grained Lithofacies under the High-Resolution Isochronous Shelf System: Insights from the Wufeng-Longmaxi Shales in the Sichuan Basin

Journal: :Lithosphere 2021

Abstract The deposition and evolution of fine-grained sediments is a hot topic in sedimentary rock studies important for accurately evaluating shale gas sweet spots. In this paper, the characteristics Wufeng-Longmaxi shales, major targets Chinese exploration, were studied by using core observations, thin section analyses, scanning electron microscopy, geochemical analysis, fossil identification...

متن کامل

Risk-aware multi-armed bandit problem with application to portfolio selection

2017

Xiaoguang Huo Feng Fu

Sequential portfolio selection has attracted increasing interest in the machine learning and quantitative finance communities in recent years. As a mathematical framework for reinforcement learning policies, the stochastic multi-armed bandit problem addresses the primary difficulty in sequential decision-making under uncertainty, namely the exploration versus exploitation dilemma, and therefore...

متن کامل

Sequential decision making in repeated coalition formation under uncertainty

2008

Georgios Chalkiadakis Craig Boutilier

The problem of coalition formation when agents are uncertain about the types or capabilities of their potential partners is a critical one. In [3] a Bayesian reinforcement learning framework is developed for this problem when coalitions are formed (and tasks undertaken) repeatedly: not only does the model allow agents to refine their beliefs about the types of others, but uses value of informat...

متن کامل