نتایج جستجو برای: mdp

تعداد نتایج: 3240  

Journal: :Journal of Machine Learning Research 2002
István Szita Bálint Takács András Lörincz

In this paper ε-MDP-models are introduced and convergence theorems are proven using the generalized MDP framework of Szepesvári and Littman. Using this model family, we show that Q-learning is capable of finding near-optimal policies in varying environments. The potential of this new family of MDP models is illustrated via a reinforcement learning algorithm called event-learning which separates...

Journal: :Dietetics 2022

The aim of this cross-sectional study was to understand how the public in a non-Mediterranean multi-ethnic society perceived Mediterranean dietary pattern (MDP) and its general health benefits. A total 373 participants took part study. Most sample were young adults, females had been living Australia for over 10 years. Knowledge MDP score, attitudes towards score an adherence MPD measured. Norma...

Journal: :The British journal of nutrition 2010
Elisa Martínez Rosa Llull Maria Del Mar Bibiloni Antoni Pons Josep A Tur

The aim of the present work was to assess the prevalence of the Mediterranean dietary pattern (MDP) in Balearic Islands adolescents, and socio-demographic and lifestyle factors that might determine adherence to the MDP. A cross-sectional nutritional survey was carried out in the Balearic Islands between 2007 and 2008. A random sample (n 1231) of the adolescent population (12-17 years old) was i...

2011
Junhua ZHANG Zhiqiu HUANG Zining CAO

For an embedded control system, different requirements maybe need be satisfied at same time. Always some of them makes the system to act inconsistently, or even conflicted. Conflict tolerant specification is provided to denote this situation. In such a system, there often exist probabilistic and non-deterministic behaviors. We use Markov Decision Process (MDP) to denote these features. Based on...

2012
Wei Huang Jun Zhang

In the course lectures, we have discussed a lot regarding unconstrained Markov Decision Process (MDP). The dynamic programming decomposition and optimal policies with MDP are also given. However, in this report we are going to discuss a different MDP model, which is constrained MDP. There are many realistic demand of studying constrained MDP. For instance, in the wireless sensors networks, each...

Journal: :Int. J. General Systems 2014
Abhijit Gosavi

In control systems theory, the Markov decision process (MDP) is a widely used optimization model involving selection of the optimal action in each state visited by a discrete-event system driven by Markov chains. The classical MDP model is suitable for an agent/decision-maker interested in maximizing expected revenues, but does not account for minimizing variability in the revenues. An MDP mode...

Journal: :Infection and immunity 1979
S Nagao A Tanaka Y Yamamoto T Koga K Onoue T Shiba K Kusumoto S Kotani

In the capillary tube migration system a synthetic muramyl dipeptide (MDP; N-acetylmuramyl-L-alanyl-D-isoglutamine), a part of bacterial cell wall peptidoglycans, inhibited the migration of peritoneal exudate macrophages from normal guinea pigs or rats. The migration inhibition was also caused by some MDP-containing peptidoglycan fragments from cell walls of Lactobacillus plantarum and Staphylo...

2010
Francisco S. Melo Manuel Lopes

In this paper we address the problem of learning a policy from demonstration. Assuming that the policy to be learned is the optimal policy for an underlying MDP, we propose a novel way of leveraging the underlying MDP structure in a kernel-based approach. Our proposed approach rests on the insight that the MDP structure can be encapsulated into an adequate state-space metric. In particular we s...

2017
Jie Chen Youyu Lan Yue He Chengsong He Fen Xu Yugao Zhang Yi Zhao Yi Liu

The aim of the present study was to examine the influence of technetium methylenediphosphonate (99Tc-MDP) on the proliferation and differentiation of human osteoblasts. Human iliac cancellous bone was isolated and cultured with either 99Tc‑MDP, β fibroblast growth factor (as a positive control) or medium only (as a negative control). Proliferation was assessed by direct cell counting, CCK‑8 ass...

Journal: :Circulation research 1989
K P Dresdner R P Kline A L Wit

A large reduction of intracellular potassium activity in depolarized subendocardial Purkinje fibers 24 hours after coronary artery ligation is accompanied by a much smaller increase in intracellular sodium activity. Similar intracellular ionic changes also occur during acute ischemia in ventricular muscle and are consistent with mechanisms based on intracellular acidification, which is known to...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید