Worst-case regret analysis of computationally budgeted online kernel selection
نویسندگان
چکیده
We study the problem of online kernel selection under computational constraints, where memory or time and prediction procedures is restricted to a fixed budget. In this paper, we analyze worst-case lower bounds on regret algorithm with subset observed examples, design algorithms enjoying corresponding upper bounds. also identify condition which constraints different from that constraints. To algorithms, reduce problems two sequential decision problems, is, expert advice multi-armed bandit an additional observation. Our invent some new techniques, such as sharing, hypothesis space discretization decoupled exploration-exploitation scheme. Numerical experiments regression classification are conducted verify our theoretical results.
منابع مشابه
Oracle inequalities for computationally budgeted model selection
We analyze general model selection procedures using penalized empirical loss minimization under computational constraints. While classical model selection approaches do not consider computational aspects of performing model selection, we argue that any practical model selection procedure must not only trade off estimation and approximation error, but also the effects of the computational effort...
متن کاملWorst Case Competitive Analysis of Online Algorithms for Conic Optimization
Online optimization covers problems such as online resource allocation, online bipartite matching, adwords (a central problem in e-commerce and advertising), and adwords with separable concave returns. We analyze the worst case competitive ratio of two primal-dual algorithms for a class of online convex (conic) optimization problems that contains the previous examples as special cases defined o...
متن کاملOptimistic posterior sampling for reinforcement learning: worst-case regret bounds
We present an algorithm based on posterior sampling (aka Thompson sampling) that achieves near-optimal worst-case regret bounds when the underlying Markov Decision Process (MDP) is communicating with a finite, though unknown, diameter. Our main result is a high probability regret upper bound of Õ(D √ SAT ) for any communicating MDP with S states, A actions and diameter D, when T ≥ SA. Here, reg...
متن کاملPosterior sampling for reinforcement learning: worst-case regret bounds
We present an algorithm based on posterior sampling (aka Thompson sampling) that achieves near-optimal worst-case regret bounds when the underlying Markov Decision Process (MDP) is communicating with a finite, though unknown, diameter. Our main result is a high probability regret upper bound of Õ(D √ SAT ) for any communicating MDP with S states, A actions and diameter D, when T ≥ SA. Here, reg...
متن کاملOnline (Budgeted) Social Choice
We consider classic social choice problems in an online setting. In the problems we consider, a decision-maker must select a subset of candidates in accordance to reported preferences, e.g. to maximize the value of a scoring rule. However, agent preferences cannot be accessed directly; rather, agents arrive one at a time to report their preferences, and each agent cares only about those candida...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Machine Learning
سال: 2022
ISSN: ['0885-6125', '1573-0565']
DOI: https://doi.org/10.1007/s10994-021-06082-8