individual regret

نتایج جستجو برای: individual regret

تعداد نتایج: 446840 فیلتر نتایج به سال:

Varieties of Regret in Online Prediction

2004

Casey Marks Amy Greenwald David Gondek

We present a general framework for analyzing regret in the online prediction problem. We develop this from sets of linear transformations of strategies. We establish relationships among the varieties of regret and present a class of regret-matching algorithms. Finally we consider algorithms that exhibit the asymptotic no-regret property. Our main results are an analysis of observed regret in ex...

متن کامل

Sensitive Error Correcting Output Codes

2005

Sensitive error correcting output codes are a reduction from cost sensitive classi cation to binary classi cation. They are a modi cation of error correcting output codes [3] which satisfy an additional property: regret for binary classi cation implies at most 2 l2 regret for cost-estimation. This has several implications: 1) Any 0/1 regret minimizing online algorithm is (via the reduction) a r...

متن کامل

The Interplay Between Stability and Regret in Online Learning

Journal: :CoRR 2012

Ankan Saha Prateek Jain Ambuj Tewari

This paper considers the stability of online learning algorithms and its implications for learnability (bounded regret). We introduce a novel quantity called forward regret that intuitively measures how good an online learning algorithm is if it is allowed a one-step look-ahead into the future. We show that given stability, bounded forward regret is equivalent to bounded regret. We also show th...

متن کامل

A General Class of No-Regret Learning Algorithms and Game-Theoretic Equilibria

2003

Amy Greenwald Amir Jafari

A general class of no-regret learning algorithms, called no-Φ-regret learning algorithms, is defined which spans the spectrum from no-external-regret learning to no-internal-regret learning and beyond. The set Φ describes the set of strategies to which the play of a given learning algorithm is compared. A learning algorithm satisfies no-Φ-regret if no regret is experienced for playing as the al...

متن کامل

Dueling Bandits with Weak Regret

2017

Bangrui Chen Peter I. Frazier

We consider online content recommendation with implicit feedback through pairwise comparisons, formalized as the so-called dueling bandit problem. We study the dueling bandit problem in the Condorcet winner setting, and consider two notions of regret: the more well-studied strong regret, which is 0 only when both arms pulled are the Condorcet winner; and the less well-studied weak regret, which...

متن کامل

A consistent deterministic regression tree for non-parametric prediction of time series

Journal: :CoRR 2014

Pierre Gaillard Paul Baudin

We study online prediction of bounded stationary ergodic processes. To do so, we consider the setting of prediction of individual sequences and build a deterministic regression tree that performs asymptotically as well as the best L-Lipschitz constant predictors. Then, we show why the obtained regret bound entails the asymptotical optimality with respect to the class of bounded stationary ergod...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید

Varieties of Regret in Online Prediction

Sensitive Error Correcting Output Codes

The Interplay Between Stability and Regret in Online Learning

A General Class of No-Regret Learning Algorithms and Game-Theoretic Equilibria

Dueling Bandits with Weak Regret

A consistent deterministic regression tree for non-parametric prediction of time series

Abortion and regret

Live Without Regret

Learning from regret

Note—Testing Regret