نتایج جستجو برای: λ logic algorithm

تعداد نتایج: 902091  

2014
Manel Tagorti Bruno Scherrer

We consider LSTD(λ), the least-squares temporal-difference algorithm with eligibility traces algorithm proposed by Boyan (2002). It computes a linear approximation of the value function of a fixed policy in a large Markov Decision Process. Under a β-mixing assumption, we derive, for any value of λ ∈ (0, 1), a high-probability estimate of the rate of convergence of this algorithm to its limit. W...

2015
Harm van Seijen A. Rupam Mahmood Patrick M. Pilarski Richard S. Sutton

The true online TD(λ) algorithm has recently been proposed (van Seijen and Sutton, 2014) as a universal replacement for the popular TD(λ) algorithm, in temporal-difference learning and reinforcement learning. True online TD(λ) has better theoretical properties than conventional TD(λ), and the expectation is that it also results in faster learning. In this paper, we put this hypothesis to the te...

2014
Harm van Seijen Richard S. Sutton

TD(λ) is a core algorithm of modern reinforcement learning. Its appeal comes from its equivalence to a clear and conceptually simple forward view, and the fact that it can be implemented online in an inexpensive manner. However, the equivalence between TD(λ) and the forward view is exact only for the off-line version of the algorithm (in which updates are made only at the end of each episode). ...

2009
Fabien Teytaud Olivier Teytaud

Motivated by parallel optimization, we experiment EDA-like adaptation-rules in the case of λ large. The rule we use, essentially based on estimation of multivariate normal algorithm, is (i) compliant with all families of distributions for which a density estimation algorithm exists (ii) simple (iii) parameter-free (iv) better than current rules in this framework of λ large. The speed-up as a fu...

A. Torkian and P. Khadivi, S. Samavi,

Fabrication of an integrated circuit with smaller area, besides reducing the cost of manufacturing, usually causes a reduction in the power dissipation and propagation delay. Using the static CMOS technology to fabricate a circuit that realizes a specific logic function and occupies a minimum space, it must be implemented with continuous diffusion runs. Therefore, at the design stage, an Euleri...

A. Torkian and P. Khadivi, S. Samavi,

Fabrication of an integrated circuit with smaller area, besides reducing the cost of manufacturing, usually causes a reduction in the power dissipation and propagation delay. Using the static CMOS technology to fabricate a circuit that realizes a specific logic function and occupies a minimum space, it must be implemented with continuous diffusion runs. Therefore, at the design stage, an Euleri...

ژورنال: اندیشه آماری 2015

Logic regression is a generalized regression and classification method that is able to make Boolean combinations as new predictive variables from the original binary variables. Logic regression was introduced for case control or cohort study with independent observations. Although in various studies, correlated observations occur due to different reasons, logic regression have not been studi...

2008
Judith J. McDonald DeAnne M. Morris

In this paper, we give necessary and sufficient conditions for a set of Jordan blocks to correspond to the peripheral spectrum of a nonnegative matrix. For each eigenvalue, λ, the λ-level characteristic (with respect to the spectral radius) is defined. The necessary and sufficient conditions include a requirement that the λ-level characteristic is majorized by the λ-height characteristic. An al...

Nowadays, permanent magnet synchronous motors have been widely used in industry due to the elimination of excitation losses, longer life and higher efficiency. Errors in engine and drive systems are unavoidable during operation. Therefore, a suitable scenario should be considered for when these systems fail. If the necessary predictions and control algorithms are not considered for the error co...

In practice, obtaining the global optimum for the economic dispatch {bf (ED)}problem with ramp rate limits and prohibited operating zones is presents difficulties. This paper presents a new andefficient method for solving the economic dispatch problem with non-smooth cost functions using aFuzzy Adaptive Genetic Algorithm (FAGA). The proposed algorithm  deals  with the issue ofcontrolling the ex...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید