minimax module

minimax estimator of a lower bounded parameter of a discrete distribution under a squared log error loss function

Journal: :journal of sciences, islamic republic of iran 2012

n. nematollahi

the problem of estimating the parameter ?, when it is restricted to an interval of the form , in a class of discrete distributions, including binomial negative binomial discrete weibull and etc., is considered. we give necessary and sufficient conditions for which the bayes estimator of with respect to a two points boundary supported prior is minimax under squared log error loss function. for s...

متن کامل

QL2, a simple reinforcement learning scheme for two-player zero-sum Markov games

Journal: :Neurocomputing 2008

Benoît Frénay Marco Saerens

Markov games are a framework which formalises n-agent reinforcement learning. For instance, Littman proposed the minimax-Q algorithm to model two-agent zero-sum problems. This paper proposes a new simple algorithm in this framework, QL2, and compares it to several standard algorithms (Q-learning, Minimax and minimax-Q). Experiments show that QL2 converges to optimal mixed policies, as minimax-Q...

متن کامل

Error Performance of Quantum Minimax Receiver for 16QAM Coherent State Signal

2014

Kentaro Kato

Recently a numerical calculation method for nding the minimax solution to the minimax problem in quantum signal detection was reported [10]. In this paper, we evaluate the error performance of the minimax receiver for 16QAM coherent state signal by using this calculation method. Through this numerical simulation, it will be pointed out that the use of the minimax strategy has an advantage rathe...

متن کامل

UDC 519.244.3 An Asymptotic Minimax Theorem for Gaussian Two-Armed Bandit

2017

A. V. Kolnogorov

The asymptotic minimax theorem for Bernoulli two-armed bandit problem states that minimax risk has the order N as N → ∞, where N is the control horizon, and provides the estimates of the factor. For Gaussian twoarmed bandit with unit variances of one-step incomes and close expectations, we improve the asymptotic minimax theorem as follows: the minimax risk is approximately equal to 0.637N as N ...

متن کامل

Convergence Results of a Local Minimax Method for Finding Multiple Critical Points

Journal: :SIAM J. Scientific Computing 2003

Yongxin Li Jianxin Zhou

In [14], a new local minimax method that characterizes a saddle point as a solution to a local minimax problem is established. Based on the local characterization, a numerical minimax algorithm is designed for finding multiple saddle points. Numerical computations of many examples in semilinear elliptic PDE have been successfully carried out to solve for multiple solutions. One of the important...

متن کامل

Mixed risk-neutral/minimax control of discrete-time, finite-state Markov decision processes

Journal: :IEEE Trans. Automat. Contr. 2000

Stefano P. Coraluppi Steven I. Marcus

This paper addresses the control design problem for discrete-time, nite-state Markov Decision Processes (MDPs), when both risk-neutral and minimax objectives are of interest. We introduce the mixed risk-neutral/minimax objective, and utilize results from risk-neutral and minimax control to derive an information state process and dynamic programming equations for the value function. We synthesiz...

متن کامل

Nonparametric Regression and Prediction with Dependent Errors (running Title: Regression and Prediction under Dependence)

1997

Yuhong Yang

We study minimax rates of convergence for nonparametric regression and prediction under a random design with dependent errors. It is shown that dependence among errors in general does not hurt a prediction of the next response. For estimating the regression function, however, dependence may damage the minimax rate of convergence. Under the assumption that the errors are independent of the expla...

متن کامل

Minimax design criterion for fractional factorial designs

2015

Yue Yin Julie Zhou

An A-optimal minimax design criterion is proposed to construct fractional factorial designs, which extends the study of the D-optimal minimax design criterion in Lin and Zhou (Canadian Journal of Statistics 41, 325–340, 2013). The resulting Aoptimal and D-optimal minimax designs minimize, respectively, the maximum trace and determinant of the mean squared error matrix of the least squares estim...

متن کامل

Asymptotically Minimax Robust Hypothesis Testing

Journal: :CoRR 2017

Gökhan Gül

The design of asymptotically minimax robust hypothesis testing is formalized for the Bayesian and Neyman-Pearson tests of Type I and II. The uncertainty classes based on the KL-divergence, αdivergence, symmetrized α-divergence, total variation distance, as well as the band model, moment classes and p-point classes are considered. It is shown with a counterexample that minimax robust tests do no...

متن کامل

On Minimax and Idempotent Generalized Weak Solutions to the Hamilton–Jacobi Equation

2004

Ilya V. Roublev ILYA V. ROUBLEV

This paper provides a direct equivalence proof for minimax solutions of A.I. Subbotin and generalized weak solutions in the sense of idempotent analysis. It is shown that the Hamilton-Jacobi equation Vt+H(t, x,DxV ) = 0 (with the Hamiltonian H(t, x, s) concave in s), considered in the context of minimax generalized solutions, is linear w.r.t. ⊕ = min and ̄ = +. This leads to a representation fo...

متن کامل