minimax regret

نتایج جستجو برای: minimax regret

تعداد نتایج: 12162 فیلتر نتایج به سال:

Minimax Regret Sink Location Problem in Dynamic Tree Networks with Uniform Capacity

2014

Yuya Higashikawa Mordecai J. Golin Naoki Katoh

This paper addresses the minimax regret sink location problem in dynamic tree networks. In our model, a dynamic tree network consists of an undirected tree with positive edge lengths and uniform edge capacity, and the vertex supply which is a positive value is unknown but only the interval of supply is known. A particular realization of supply to each vertex is called a scenario. Under any scen...

متن کامل

4 Learning , Regret minimization , and Equilibria

2007

A. Blum Y. Mansour

Many situations involve repeatedly making decisions in an uncertain environment: for instance, deciding what route to drive to work each day, or repeated play of a game against an opponent with an unknown strategy. In this chapter we describe learning algorithms with strong guarantees for settings of this type, along with connections to game-theoretic equilibria when all players in a system are...

متن کامل

Statistical Treatment Choice Based on Asymmetric Minimax Regret Criteria

2009

Aleksey Tetenov

This paper studies the problem of treatment choice between a status quo treatment with a known outcome distribution and an innovation whose outcomes are observed only in a representative finite sample. I evaluate statistical decision rules, which are functions that map sample outcomes into the planner’s treatment choice for the population, based on regret, which is the expected welfare loss due...

متن کامل

Minmax regret solutions for minimax optimization problems with uncertainty

Journal: :Oper. Res. Lett. 2000

Igor Averbakh

We propose a general approach for nding minmax regret solutions for a class of combinatorial optimization problems with an objective function of minimax type and uncertain objective function coe cients. The approach is based on reducing a problem with uncertainty to a number of problems without uncertainty. The method is illustrated on bottleneck combinatorial optimization problems, minimax mul...

متن کامل

Asymptotic minimax regret for data compression, gambling, and prediction

Journal: :IEEE Trans. Information Theory 2000

Qun Xie Andrew R. Barron

For problems of data compression, gambling, and prediction of individual sequences 1 the following questions arise. Given a target family of probability mass functions ( 1 ), how do we choose a probability mass function ( 1 ) so that it approximately minimizes the maximum regret /belowdisplayskip10ptminus6pt max (log 1 ( 1 ) log 1 ( 1 )̂) and so that it achieves the best constant in the asymptot...

متن کامل

Minimax Policy for Heavy-Tailed Bandits

Journal: :IEEE Control Systems Letters 2021

We study the stochastic Multi-Armed Bandit (MAB) problem under worst-case regret and heavy-tailed reward distribution. modify minimax policy MOSS for sub-Gaussian distribution by using saturated empirical mean to design a new algorithm called Robust MOSS. show that if moment of order $1+\epsilon $ exists, then refined strategy has matching lower bound while maintaining distribution-dependent lo...

متن کامل

The Maximal Domain for the Revelation Principle when Preferences are Menu Dependent∗

2008

Rene Saran Takashi Kunimoto Ronald Peeters Roberto Serrano

We extend the domain of preferences to include menu-dependent preferences and characterize the maximal subset of this domain in which the revelation principle holds. Minimax-regret preference is shown to be outside this subset.

متن کامل

Statistical treatment choice based on asymmetric minimax regret criteria

Journal: :Journal of Econometrics 2012

متن کامل

Do People Vote on the Basis of Minimax Regret?

Journal: :Political Research Quarterly 1995

متن کامل

Batched Bandit Problems

2015

Vianney Perchet Philippe Rigollet Sylvain Chassang Erik Snowberg

Motivated by practical applications, chiefly clinical trials, we study the regret achievable for stochastic bandits under the constraint that the employed policy must split trials into a small number of batches. We propose a simple policy that operates under this contraint and show that a very small number of batches gives close to minimax optimal regret bounds. As a byproduct, we derive optima...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید