Balanced Importance Sampling Estimation
نویسندگان
چکیده
In this paper we analyze a particular issue of estimation, namely the estimation of the expected value of an unknown function for a given distribution, with the samples drawn from other distributions. A motivation of this problem comes from machine learning. In reinforcement learning, an intelligent agent that learns to make decisions in an unknown environment encounters the problem of judging an arbitrary decision policy (the given distribution) on the basis of previous decisions and their outcomes suggested by previous policies (other distributions). The problem can be solved with the use of well established importance sampling estimators. To overcome a potential problem of excessive variance of such estimators, we introduce the family of balanced importance sampling estimators, prove their consistency and demonstrate empirically their superiority over the classical counterparts.
منابع مشابه
Efficient Analysis of Rare Events Associated with Individual Buffers in a Tandem Jackson Network
Over the last decade, importance sampling has been a popular technique for the efficient estimation of rare event probabilities. This paper presents an approach for applying balanced likelihood ratio importance sampling to the problem of quantifying the probability that the content of the second buffer in a two node tandem Jackson network reaches some high level before it becomes empty. Heurist...
متن کاملUnbalanced robust ANOVA for the estimation of measurement uncertainty at reduced cost
Location S1A1 S1A2 S2A1 S2A2 Fig. 1 Design for the estimation of the random components of uncertainty arising from both the sampling and the analytical procedures. (A) Balanced design. (B) Unbalanced design. An empirical estimation of the random components of measurement uncertainty arising from the sampling and analytical processes can be made via an experiment involving replication. Analysis ...
متن کاملEnergy-balanced multiple-sensor collaborative scheduling for maneuvering target tracking in wireless sensor networks
An energy-balanced multiple-sensor collaborative scheduling is proposed for maneuvering target tracking in wireless sensor networks (WSNs). According to the position of the maneuvering target, some sensor nodes in WSNs are awakened to form a sensor cluster for target tracking collaboratively. In the cluster, the cluster head node is selected to implement tracking task with changed sampling inte...
متن کاملOn Derivative Estimation of the Mean Time to Failure in Simulations of Highly Reliable Markovian Systems
The mean time to failure (MTTF) of a Markovian system can be expressed as a ratio of two expectations. For highly reliable Markovian systems, the resulting ratio formula consists of one expectation that cannot be estimated with bounded relative error when using standard simulation, while the other, which we call a non-rare expectation, can be estimated with bounded relative error. We show that ...
متن کاملA Balanced Likelihood Ratio Approach for Analyzing Rare Events in a Tandem Jackson Network
Balanced likelihood ratio importance sampling methods were originally developed for the analysis of fault-tolerant systems. This paper provides a basis for adapting this approach to analyze the rare event probability that total system size reaches a bound before returning to zero in tandem Jackson networks. An optimal importance sampling distribution for the single server case is derived throug...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006