regret minimization

نتایج جستجو برای: regret minimization

تعداد نتایج: 37822 فیلتر نتایج به سال:

Improved Worst-Case Regret Bounds for Randomized Least-Squares Value Iteration

Journal: :Proceedings of the ... AAAI Conference on Artificial Intelligence 2021

This paper studies regret minimization with randomized value functions in reinforcement learning. In tabular finite-horizon Markov Decision Processes, we introduce a clipping variant of one classical Thompson Sampling (TS)-like algorithm, least-squares iteration (RLSVI). Our $\tilde{\mathrm{O}}(H^2S\sqrt{AT})$ high-probability worst-case bound improves the previous sharpest bounds for RLSVI and...

متن کامل

Online Parameter Tuning Methods for Adaptive Ecms Control Strategies in Hybrid Electric Vehicles

2014

Sascha Geulen

This paper presents two adaptive ECMS schemes which compared to the standard ECMS formulation are extended by an online learning algorithm based on the so-called regret minimization paradigm. While in the first approach the Shrinking Dartboard algorithm is applied to tune the parameter values of the ECMS adaptation rule, in the second approach the Weighted Fractional algorithm averages the resu...

متن کامل

Novel Selection Methods for Monte-carlo Tree Search

2014

Tom Pepels

Preface In this thesis I present the result of my investigation into regret minimization for Monte-Carlo Tree Search. The thesis presents the motivation, background, and formal definition of a novel search technique based on minimizing both simple and cumulative regret in a game tree: Hybrid MCTS (H-MCTS). The technique minimizes the two types of regret in a single search-tree. This ensures tha...

متن کامل

Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games Using Baselines

Journal: :Proceedings of the AAAI Conference on Artificial Intelligence 2019

متن کامل

Investigating Consumer Preferences for Sustainable Packaging Through a Different Behavioural Approach: A Random Regret Minimization Application

Journal: :Environmental and Resource Economics 2023

Abstract Plastic pollution causing the near-permanent contamination of environment is a preeminent concern. The largest market sector for plastic resins packaging, and food industry plays major role in producing packaging waste. Therefore, gradual switch system towards pro-environmental strategies required to contain waste issue. To this extent, study aimed investigate how consumers relatively ...

متن کامل

Random Regret Minimization and Random Utility Maximization in the Presence of Preference Heterogeneity: An Empirical Contrast

Journal: :Journal of Transportation Engineering 2016

متن کامل

Bipartite Ranking through Minimization of Univariate Loss

2011

Wojciech Kotlowski Krzysztof Dembczynski Eyke Hüllermeier

Minimization of the rank loss or, equivalently, maximization of the AUC in bipartite ranking calls for minimizing the number of disagreements between pairs of instances. Since the complexity of this problem is inherently quadratic in the number of training examples, it is tempting to ask how much is actually lost by minimizing a simple univariate loss function, as done by standard classificatio...

متن کامل

Consistent Multilabel Ranking through Univariate Losses

2012

Krzysztof Dembczynski Wojciech Kotlowski Eyke Hüllermeier

We consider the problem of rank loss minimization in the setting of multilabel classification, which is usually tackled by means of convex surrogate losses defined on pairs of labels. Very recently, this approach was put into question by a negative result showing that commonly used pairwise surrogate losses, such as exponential and logistic losses, are inconsistent. In this paper, we show a pos...

متن کامل

Evaluation of Influencing Factors on Tubal Sterilization Regret: A Cross-Sectional Study

Journal: International Journal of Fertility and Sterility 2018

Anoshirvan Kazemnejad, Eesa Mohamadi, Saeideh Ziaei, Shahideh Jahanian Sadatmahalleh,

Objective The aim of this study is to evaluate the menstrual pattern, sexual function, and anxiety, and depression in women with poststerilization regret, and potential influencing factors for regret following TL in Iranian women. MaterialsAndMethods In this cross-sectional study, 166 women with TL were subdivided into two groups including women with poststerilization regret (n=41) and women wi...

متن کامل

No-Regret Learnability for Piecewise Linear Losses

Journal: :CoRR 2014

Arthur Flajolet Patrick Jaillet

In the convex optimization approach to online regret minimization, many methodshave been developed to guarantee a O(√T ) bound on regret for subdifferentiableconvex loss functions with bounded subgradients, by using a reduction to linearloss functions. This suggests that linear loss functions tend to be the hardest onesto learn against, regardless of the underlying d...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید