نتایج جستجو برای: mixed strategy

تعداد نتایج: 554647  

2006
Stan Jarzabek

We summarize experiences from other projects in which we applied mixed-strategy. We observed similar benefits in terms of maintainability and reusability as in the Buffer library experiment described in Chapter 8 and 9, and FRS. Also, the structure of x-frameworks were similar in all the project. XVCL addresses issues changeability, evolution and reuse by applying the same underlying principles...

2004
Bernhard von Stengel

A game in strategic form does not always have a Nash equilibrium in which each player deterministically chooses one of his strategies. However, players may instead randomly select from among these pure strategies with certain probabilities. Randomizing one’s own choice in this way is called a mixed strategy. A profile of mixed strategies is called a mixed equilibrium if no player can gain on av...

2018
Ngoc Duy Nguyen Saeid Nahavandi Thanh Nguyen

In 2015, Google’s Deepmind announced an advancement in creating an autonomous agent based on deep reinforcement learning (DRL) that could beat a professional player in a series of 49 Atari games. However, the current manifestation of DRL is still immature, and has significant drawbacks. One of DRL’s imperfections is its lack of “exploration” during the training process, especially when working ...

2004
John Duggan

We prove existence of mixed strategy electoral equilibrium in the multidimensional Downsian model of elections. We do so by modelling voters explicitly as players, enabling us to resolve discontinuities in the game between the candidates, which have proved a barrier to existence. We then give a partial characterization: the supports of equilibrium mixed strategies must lie in the deep uncovered...

Journal: :Math. Meth. of OR 2004
Jason Shachat J. Todd Swarthout

We conducted an experiment in which each subject repeatedly played a game with a unique Nash equilibrium in mixed strategies against some computer-implemented mixed strategy. The results indicate subjects are successful at detecting and exploiting deviations from Nash equilibrium. However, there is heterogeneity in subject behavior and performance. We present a one variable model of dynamic ran...

Journal: :Entropy 2003
Edward Jiménez

This paper introduces Hermite’s polynomials, in the description of quantum games. Hermite’s polynomials are associated with gaussian probability density. The gaussian probability density represents minimum dispersion. I introduce the concept of minimum entropy as a paradigm of both Nash’s equilibrium (maximum utility MU) and Hayek equilibrium (minimum entropy ME). The ME concept is related to Q...

Journal: :Computational Statistics & Data Analysis 2004
Arthur Gilmour Brian Cullis Sue J. Welham Beverley J. Gogel Robin Thompson

After estimation of e3ects from a linear mixed model, it is often useful to form predicted values for certain factor/variate combinations. This process has been well-de5ned for linear models, but the introduction of random e3ects means that a decision has to be made about the inclusion or exclusion of random model terms from the predictions, including the residual error. For spatially correlate...

2011
Fu Xianping

Streaming media applications is currently limited by high bandwidth requirements. It is a challenging problem to provide the required quality of service (QoS) for the efficient transmission of video data under the varying network conditions such as the time-varying packet loss and fluctuating bandwidth. On Internet the most important part for streaming media transmission application is QoS cont...

2013
S. N. Ethier Jiyeon Lee

The casino game of baccara chemin de fer is a bimatrix game, not a matrix game, because the house collects a five percent commission on Banker wins. We generalize the game, allowing Banker’s strategy to be unconstrained and assuming a 100α percent commission on Banker wins, where 0 ≤ α < 2/5. Assuming for simplicity that cards are dealt with replacement, we show that, with one exception at α = ...

2004
V. Bhaskar George J. Mailath Stephen Morris

This paper investigates the Harsanyi (1973)-purifiability of mixed strategies in the repeated prisoners’ dilemma with perfect monitoring. We perturb the game so that in each period, a player receives a private payoff shock which is independently and identically distributed across players and periods. We focus on the purifiability of one-period memory mixed strategy equilibria used by Ely and Vä...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید