minimax regret

نتایج جستجو برای: minimax regret

تعداد نتایج: 12162 فیلتر نتایج به سال:

Asymptotically minimax regret for exponential families

2005

Jun-ichi Takeuchi Andrew R. Barron

We study the problem of data compression, gambling and prediction of a sequence x = x1x2...xn from a certain alphabet X , in terms of regret and redundancy with respect to a general exponential family. In particular, we evaluate the regret of the Bayes mixture density and show that it asymptotically achieves their minimax values when variants of Jeffreys prior are used. Keywords— universal codi...

متن کامل

Sampling Based Approaches for Minimizing Regret in Uncertain Markov Decision Processes (MDPs)

Journal: :J. Artif. Intell. Res. 2017

Asrar Ahmed Pradeep Varakantham Meghna Lowalekar Yossiri Adulyasak Patrick Jaillet

Markov Decision Processes (MDPs) are an effective model to represent decision processes in the presence of transitional uncertainty and reward tradeoffs. However, due to the difficulty in exactly specifying the transition and reward functions in MDPs, researchers have proposed uncertain MDP models and robustness objectives in solving those models. Most approaches for computing robust policies h...

متن کامل

Algorithmic Chaining and the Role of Partial Feedback in Online Nonparametric Learning

2017

Nicolò Cesa-Bianchi Pierre Gaillard Claudio Gentile Sébastien Gerchinovitz

We investigate contextual online learning with nonparametric (Lipschitz) comparison classes under different assumptions on losses and feedback information. For full information feedback and Lipschitz losses, we design the first explicit algorithm achieving the minimax regret rate (up to log factors). In a partial feedback model motivated by second-price auctions, we obtain algorithms for Lipsch...

متن کامل

A Rational Decision Criterion, the Iterated Minimax Regret Criterion

Journal: :The Annals of Statistics 1974

متن کامل

Minimax Regret 1-Median Problem in Dynamic Path Networks

Journal: :Theory of Computing Systems 2017

متن کامل

On Minimaxity of Follow the Leader Strategy in the Stochastic Setting

2016

Wojciech Kotlowski

We consider the setting of prediction with expert advice with an additional assumption that each expert generates its losses i.i.d. according to some distribution. We first identify a class of “admissible” strategies, which we call permutation invariant, and show that every strategy outside this class will perform not better than some permutation invariant strategy. We then show that when the l...

متن کامل

Minimax Regret and Efficient Bargaining under Uncertainty

Journal: :Games and Economic Behavior 2001

Walter Bossert Hans Peters

Bargaining under uncertainty is modeled by the assumption that there are several possible states of nature, each of which is identified with a bargaining problem. We characterize bargaining solutions which generate ex ante efficient combinations of outcomes under the assumption that the bargainers have minimax regret preferences. For the case of two bargainers a class of monotone utopia-path so...

متن کامل

Regret based Robust Solutions for Uncertain Markov Decision Processes

2013

Asrar Ahmed Pradeep Varakantham Yossiri Adulyasak Patrick Jaillet

In this paper, we seek robust policies for uncertain Markov Decision Processes (MDPs). Most robust optimization approaches for these problems have focussed on the computation of maximin policies which maximize the value corresponding to the worst realization of the uncertainty. Recent work has proposed minimax regret as a suitable alternative to the maximin objective for robust optimization. Ho...

متن کامل

A Geometric Traversal Algorithm for Reward-Uncertain MDPs

2011

Eunsoo Oh Kee-Eung Kim

Markov decision processes (MDPs) are widely used in modeling decision making problems in stochastic environments. However, precise specification of the reward functions in MDPs is often very difficult. Recent approaches have focused on computing an optimal policy based on the minimax regret criterion for obtaining a robust policy under uncertainty in the reward function. One of the core tasks i...

متن کامل

Regret-based Incremental Partial Revelation Mechanisms

2006

Nathanael Hyafil Craig Boutilier

Classic direct mechanisms suffer from the drawback of requiring full type (or utility function) revelation from participating agents. In complex settings with multi-attribute utility, assessing utility functions can be very difficult, a problem addressed by recent work on preference elicitation. In this work we propose a framework for incremental, partial revelation mechanisms and study the use...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید