نتایج جستجو برای: minimax regret

تعداد نتایج: 12162  

2011
Tyler Lu Craig Boutilier

While voting schemes provide an effective means for aggregating preferences, methods for the effective elicitation of voter preferences have received little attention. We address this problem by first considering approximate winner determination when incomplete voter preferences are provided. Exploiting natural scoring metrics, we use max regret to measure the quality or robustness of proposed ...

2008
Jacob D. Abernethy Peter L. Bartlett Alexander Rakhlin Ambuj Tewari

A number of learning problems can be cast as an Online Convex Game: on each round, a learner makes a prediction x from a convex set, the environment plays a loss function f , and the learner’s long-term goal is to minimize regret. Algorithms have been proposed by Zinkevich, when f is assumed to be convex, and Hazan et al., when f is assumed to be strongly convex, that have provably low regret. ...

Journal: :J. Economic Theory 2011
Jörg Stoye

This paper unifies and extends the recent axiomatic literature on minimax regret. It compares several models of minimax regret, shows how to characterize the according choice correspondences in a unified setting, extends one of them to choice from convex (through randomization) sets, and connects them by defining a behavioral notion of perceived ambiguity. Substantively, a main idea is to behav...

2013
Brad Gulko Samantha Leung

We present a new decision rule, maximin safety, that seeks to maintain a large margin from the worst outcome, in much the same way minimax regret seeks to minimize distance from the best. We argue that maximin safety is valuable both descriptively and normatively. Descriptively, maximin safety explains the well-known decoy effect, in which the introduction of a dominated option changes preferen...

2008
Jacob Abernethy Peter L. Bartlett Alexander Rakhlin Ambuj Tewari

A number of learning problems can be cast as an Online Convex Game: on each round, a learner makes a prediction x from a convex set, the environment plays a loss function f , and the learner’s long-term goal is to minimize regret. Algorithms have been proposed by Zinkevich, when f is assumed to be convex, and Hazan et al., when f is assumed to be strongly convex, that have provably low regret. ...

Journal: :CoRR 2017
Kohei Miyaguchi

The normalized maximum likelihood (NML) is one of the most important distribution in coding theory and statistics. NML is the unique solution (if exists) to the pointwise minimax regret problem. However, NML is not defined even for simple family of distributions such as the normal distributions. Since there does not exist any meaningful minimax-regret distribution for such case, it has been poi...

Journal: :CoRR 2017
Susan Athey Stefan Wager

We consider the problem of using observational data to learn treatment assignment policies that satisfy certain constraints specified by a practitioner, such as budget, fairness, or functional form constraints. This problem has previously been studied in economics, statistics, and computer science, and several regret-consistent methods have been proposed. However, several key analytical compone...

2015
Peter L. Bartlett Wouter M. Koolen Alan Malek Eiji Takimoto Manfred K. Warmuth

We consider a linear regression game in which the covariates are known in advance: at each round, the learner predicts a real-value, the adversary reveals a label, and the learner incurs a squared error loss. The aim is to minimize the regret with respect to linear predictions. For a variety of constraints on the adversary’s labels, we show that the minimax optimal strategy is linear, with a pa...

Journal: :Artif. Intell. 2006
Craig Boutilier Relu Patrascu Pascal Poupart Dale Schuurmans

In many situations, a set of hard constraints encodes the feasible configurations of some system or product over which multiple users have distinct preferences. However, making suitable decisions requires that the preferences of a specific user for different configurations be articulated or elicited, something generally acknowledged to be onerous. We address two problems associated with prefere...

2012
Gábor Bartók Navid Zolghadr Csaba Szepesvári

We present a new anytime algorithm that achieves near-optimal regret for any instance of finite stochastic partial monitoring. In particular, the new algorithm achieves the minimax regret, within logarithmic factors, for both “easy” and “hard” problems. For easy problems, it additionally achieves logarithmic individual regret. Most importantly, the algorithm is adaptive in the sense that if the...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید