نتایج جستجو برای: q value

تعداد نتایج: 842664  

Having knowledge of stability of an underground space depends on stresses and strains around it. Creating underground tunnels leads to significant changes in the rock mass stress. Therefore, to achieve the necessary stability, stresses and deformations around the tunnel must be examined carefully. Usually, stress-strain behavior analysis is conducted in two-dimensional mode. This paper was cond...

2017
Ying Xiong Jing Li Ningli Wang Xue Liu Zhao Wang Frank F. Tsai Xiuhua Wan

PURPOSE To determine corneal Q value and its related factors in Chinese subjects older than 30 years. DESIGN Cross sectional study. METHODS 1,683 participants (1,683 eyes) from the Handan Eye Study were involved, including 955 female and 728 male with average age of 53.64 years old (range from 30 to 107 years). The corneal Q values of anterior and posterior surfaces were measured at 3.0, 5....

2010
Hado van Hasselt

In some stochastic environments the well-known reinforcement learning algorithm Q-learning performs very poorly. This poor performance is caused by large overestimations of action values. These overestimations result from a positive bias that is introduced because Q-learning uses the maximum action value as an approximation for the maximum expected action value. We introduce an alternative way ...

Journal: :اقتصاد و توسعه کشاورزی 0
حسین محمدی فرشاد شعبانیان آهون کاسب

developing countries, including iran, have a high degree of volatility of macroeconomic variables. fluctuations inex change rate, bank interest rate and inflation rate can create insecure environment for in vestorsin iran. hence, this study examined the impact of macroeconomic variables on the tobin’s q index for the sugar companies of tehran stock exchange (tse) during the period between1380-1...

Background: Students sleep pattern, due to the stress of studying and teaching workload are different with other non-student peers. The aim of this study was to determine the prevalence of poor sleep quality in college students of Iran by a meta-analysis study, to be as a final measure for policy makers in this field. Methods: In this meta-analysis study, the databases of PubMed, Science Direct...

2013
Zheng Wen Benjamin Van Roy

We consider the problem of reinforcement learning over episodes of a finitehorizon deterministic system and as a solution propose optimistic constraint propagation (OCP), an algorithm designed to synthesize efficient exploration and value function generalization. We establish that when the true value function Q⇤ lies within the hypothesis class Q, OCP selects optimal actions over all but at mos...

2008
Dieter van Melkebeek Baris Aydinlioglu

the inequality follows from Hölder’s inequality: E [ fg] ≤ ∥f ∥∥ p ∥g ∥∥ q , if 1p + 1 q = 1 with p, q ≥ 1. If α = ±1 then (1) fails unless p = q or f is constant in absolute value. This follows because (T±1f)(x) = f(±x), where −x denotes x with all its bits flipped, and because the only functions f for which ∥f ∥∥ p = ∥f ∥∥ q for p 6= q are those that are constant in absolute value. In proving...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید