Worst-case regret analysis of computationally budgeted online kernel selection

نویسندگان

چکیده

We study the problem of online kernel selection under computational constraints, where memory or time and prediction procedures is restricted to a fixed budget. In this paper, we analyze worst-case lower bounds on regret algorithm with subset observed examples, design algorithms enjoying corresponding upper bounds. also identify condition which constraints different from that constraints. To algorithms, reduce problems two sequential decision problems, is, expert advice multi-armed bandit an additional observation. Our invent some new techniques, such as sharing, hypothesis space discretization decoupled exploration-exploitation scheme. Numerical experiments regression classification are conducted verify our theoretical results.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Oracle inequalities for computationally budgeted model selection

We analyze general model selection procedures using penalized empirical loss minimization under computational constraints. While classical model selection approaches do not consider computational aspects of performing model selection, we argue that any practical model selection procedure must not only trade off estimation and approximation error, but also the effects of the computational effort...

متن کامل

Worst Case Competitive Analysis of Online Algorithms for Conic Optimization

Online optimization covers problems such as online resource allocation, online bipartite matching, adwords (a central problem in e-commerce and advertising), and adwords with separable concave returns. We analyze the worst case competitive ratio of two primal-dual algorithms for a class of online convex (conic) optimization problems that contains the previous examples as special cases defined o...

متن کامل

Optimistic posterior sampling for reinforcement learning: worst-case regret bounds

We present an algorithm based on posterior sampling (aka Thompson sampling) that achieves near-optimal worst-case regret bounds when the underlying Markov Decision Process (MDP) is communicating with a finite, though unknown, diameter. Our main result is a high probability regret upper bound of Õ(D √ SAT ) for any communicating MDP with S states, A actions and diameter D, when T ≥ SA. Here, reg...

متن کامل

Posterior sampling for reinforcement learning: worst-case regret bounds

We present an algorithm based on posterior sampling (aka Thompson sampling) that achieves near-optimal worst-case regret bounds when the underlying Markov Decision Process (MDP) is communicating with a finite, though unknown, diameter. Our main result is a high probability regret upper bound of Õ(D √ SAT ) for any communicating MDP with S states, A actions and diameter D, when T ≥ SA. Here, reg...

متن کامل

Online (Budgeted) Social Choice

We consider classic social choice problems in an online setting. In the problems we consider, a decision-maker must select a subset of candidates in accordance to reported preferences, e.g. to maximize the value of a scoring rule. However, agent preferences cannot be accessed directly; rather, agents arrive one at a time to report their preferences, and each agent cares only about those candida...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Machine Learning

سال: 2022

ISSN: ['0885-6125', '1573-0565']

DOI: https://doi.org/10.1007/s10994-021-06082-8