minimax module

نتایج جستجو برای: minimax module

تعداد نتایج: 73366 فیلتر نتایج به سال:

Risk - Sensitive , Minimax , and Mixed Risk - Neutral / Minimax Control of Markov Decision Processes

1998

Stefano P. Coraluppi Steven I. Marcus

This paper analyzes a connection between risk-sensitive and minimax criteria for discrete-time, nite-state Markov Decision Processes (MDPs). We synthesize optimal policies with respect to both criteria, both for nite horizon and discounted in nite horizon problems. A generalized decision-making framework is introduced, leading to stationary risk-sensitive and minimax optimal policies on the in ...

متن کامل

Models for Minimax Stochastic Linear Optimization Problems with Risk Aversion

Journal: :Math. Oper. Res. 2010

Dimitris Bertsimas Xuan Vinh Doan Karthik Natarajan Chung-Piaw Teo

We propose a semidefinite optimization (SDP) model for the class of minimax two-stage stochastic linear optimization problems with risk aversion. The distribution of second-stage random variables belongs to a set of multivariate distributions with known first and second moments. For the minimax stochastic problem with random objective, we provide a tight SDP formulation. The problem with random...

متن کامل

Fast Concurrent Reinforcement Learners

2001

Bikramjit Banerjee Sandip Sen Jing Peng

When several agents learn concurrently, the payoff received by an agent is dependent on the behavior of the other agents. As the other agents learn, the reward of one agent becomes non-stationary. This makes learning in multiagent systems more difficult than single-agent learning. A few methods, however, are known to guarantee convergence to equilibrium in the limit in such systems. In this pap...

متن کامل

Minimax Multi-Task Learning and a Generalized Loss-Compositional Paradigm for MTL

2012

Nishant A. Mehta Dongryeol Lee Alexander G. Gray

Since its inception, the modus operandi of multi-task learning (MTL) has been to minimize the task-wise mean of the empirical risks. We introduce a generalized loss-compositional paradigm for MTL that includes a spectrum of formulations as a subfamily. One endpoint of this spectrum is minimax MTL: a new MTL formulation that minimizes the maximum of the tasks’ empirical risks. Via a certain rela...

متن کامل

Some Minimax Theorems.

Journal: :MATHEMATICA SCANDINAVICA 1972

متن کامل

Monte-Carlo Tree Search and Minimax Hybrids with Heuristic Evaluation Functions

2014

Hendrik Baier Mark H. M. Winands

Monte-Carlo Tree Search (MCTS) has been found to play suboptimally in some tactical domains due to its highly selective search, focusing only on the most promising moves. In order to combine the strategic strength of MCTS and the tactical strength of minimax, MCTSminimax hybrids have been introduced, embedding shallow minimax searches into the MCTS framework. Their results have been promising e...

متن کامل

Hierarchical Clustering With Prototypes via Minimax Linkage.

Journal: :Journal of the American Statistical Association 2011

Jacob Bien Robert Tibshirani

Agglomerative hierarchical clustering is a popular class of methods for understanding the structure of a dataset. The nature of the clustering depends on the choice of linkage-that is, on how one measures the distance between clusters. In this article we investigate minimax linkage, a recently introduced but little-studied linkage. Minimax linkage is unique in naturally associating a prototype ...

متن کامل

Minimax Lqg Control

2006

IAN R. PETERSEN

This paper presents an overview of some recent results concerning the emerging theory of minimax LQG control for uncertain systems with a relative entropy constraint uncertainty description. This is an important new robust control system design methodology providing minimax optimal performance in terms of a quadratic cost functional. The paper first considers some standard uncertainty descripti...

متن کامل

Conditional minimax estimates

Journal: :Journal of Computational and Applied Mathematics 1990

متن کامل

Minimax Regression Quantiles

Journal: :SSRN Electronic Journal 2010

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید