نتایج جستجو برای: Markov decision process Graph theory

تعداد نتایج: 2385831  

2013
Ofer Dekel Elad Hazan

We consider regret minimization in adversarial deterministic Markov Decision Processes (ADMDPs) with bandit feedback. We devise a new algorithm that pushes the state-of-theart forward in two ways: First, it attains a regret of O(T ) with respect to the best fixed policy in hindsight, whereas the previous best regret bound was O(T ). Second, the algorithm and its analysis are compatible with any...

2007
Theologos Bountourelis

Given a stochastic, acyclic, connected digraph with a single source node and a control agent that repetitively traverses this graph, each time starting from the source node, we want to define a control policy that will enable this agent to visit each of the graph terminal nodes a prespecified number of times, while minimizing the expected number of the graph traversals. We formulate this proble...

Journal: :J. Algorithms 2000
Martin E. Dyer Catherine S. Greenhill

Random independent sets in graphs arise, for example, in statistical physics, in the hard-core model of a gas. In 1997, Luby and Vigoda described a rapidly mixing Markov chain for independent sets, which we refer to as the Luby–Vigoda chain. A new rapidly mixing Markov chain for independent sets is defined in this paper. Using path coupling, we obtain a polynomial upper bound for the mixing tim...

Journal: :RAIRO - Operations Research 1980

Journal: :Operations Research 1992

2017
Adityanarayanan Radhakrishnan Liam Solus Caroline Uhler

Two directed acyclic graphs (DAGs) are called Markov equivalent if and only if they have the same underlying undirected graph (i.e. skeleton) and the same set of immoralities. When using observational data alone and typical identifiability assumptions, such as faithfulness, a DAG model can only be determined up to Markov equivalence. Therefore, it is desirable to understand the size and number ...

Journal: :JOURNAL OF MECHANICS OF CONTINUA AND MATHEMATICAL SCIENCES 2020

Journal: :Journal of the Operations Research Society of Japan 1987

1999
Acker J. Martin

Gendered relations of domination, subordination, and resistance are enacted daily in organizational practice (Acker, 1990; J. Martin, 1992). All work, including teamwork, involves interaction through which gender relations become produced and reproduced, consented and contested. The main task of this chapter is to examine, using the theoretical framework of gendered processes, the various inter...

2010
Philippe Jacquet Charles Knessl Wojciech Szpankowski

The method of types is one of the most popular techniques in information theory and combinatorics. Two sequences of equal length have the same type if they have identical empirical distributions. In this paper, we focus on Markov types, that is, sequences generated by a Markov source (of order one). We note that sequences having the same Markov type share the same so called balanced frequency m...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید