return policy

نتایج جستجو برای: return policy

تعداد نتایج: 335824 فیلتر نتایج به سال:

Occupational Health and Wellness Policies

2017

Links [1] https://www.uoguelph.ca/hr/policies/back-care-policy [2] https://www.uoguelph.ca/hr/policies/bloodborne-pathogens-policy [3] https://www.uoguelph.ca/hr/policies/confidentiality-employee-health-and-medical-records-policy [4] https://www.uoguelph.ca/hr/policies/consumption-alcoholic-beverages-university-property-policy [5] https://www.uoguelph.ca/hr/policies/employee-assistance-program-...

متن کامل

A Joint Return Policy for a Multi-Item Perishable Inventory Model with Deterministic Demands, Return and All-Units Discount

Journal: :International Journal of Mathematical, Engineering and Management Sciences 2020

متن کامل

APRIL: Active Preference Learning-Based Reinforcement Learning

2012

Riad Akrour Marc Schoenauer Michèle Sebag

This paper focuses on reinforcement learning (RL) with limited prior knowledge. In the domain of swarm robotics for instance, the expert can hardly design a reward function or demonstrate the target behavior, forbidding the use of both standard RL and inverse reinforcement learning. Although with a limited expertise, the human expert is still often able to emit preferences and rank the agent de...

متن کامل

origins of armenia’s foreign policy and its foreign policy towards iran

پایان نامه :وزارت علوم، تحقیقات و فناوری - دانشگاه علامه طباطبایی 1388

یزدان کیخسرو دولتیاری, رحمن قهرمانپور, آتوسا گودرزی,

foreign policy takes root from complicated matters. however, this issue may be more truth about armenia. although the new government of armenia is less than 20 years, people of this territory are the first ones who officially accepted christianity. in very past times, these people were a part of great emperors like iran, rome, and byzantium.armenia is regarded as a nation with a privileged hist...

15 صفحه اول

Model-Free Monte Carlo-like Policy Evaluation

2010

Raphaël Fonteneau Susan A. Murphy Louis Wehenkel Damien Ernst

We propose an algorithm for estimating the finite-horizon expected return of a closed loop control policy from an a priori given (off-policy) sample of one-step transitions. It averages cumulated rewards along a set of “broken trajectories” made of one-step transitions selected from the sample on the basis of the control policy. Under some Lipschitz continuity assumptions on the system dynamics...

متن کامل

Policy Gradient Methods

2010

Jan Peters J. Andrew Bagnell

A policy gradient method is a reinforcement learning approach that directly optimizes a parametrized control policy by gradient descent. It belongs to the class of policy search techniques that maximize the expected return of a policy in a fixed policy class while traditional value function approximation approaches derive policies from a value function. Policy gradient approaches have various a...

متن کامل

Pii: S0022-4359(02)00065-9

2002

Amir Heiman Bruce McWilliams Jinhua Zhao David Zilberman

In this article, we model money-back guarantees (MBGs) as put options. This use of option theory provides retailers with a framework to optimize the price and the return option independently and under various market conditions. This separation of product price and option value enables retailers to offer an unbundled MBG policy, that is, to allow the customer to choose whether to purchase an MBG...

متن کامل

The Impact of Agricultural Water Conservation Policy on Economic Growth

2012

Aaron Benson Ray Huffaker

An agricultural water conservation policy prevalent worldwide encourages producers to improve on-farm irrigation efficiency. Contrary to intention, increasing empirical evidence reveals that this policy may set an ‘irrigation efficiency trap’ that worsens water crises by reducing water supplies and jeopardizing economic growth. We derive a pair of testable hydrologic-economic conditions require...

متن کامل

Strategic alliances and firm value creation in China

2013

Huawei Zhang Sami Torstila

This study investigates the impact of 306 strategic alliances on the increment of firm value in the case of China. I apply the event study methodology using OLS market model to examine the abnormal returns of sample firms. The results show that the announcements of strategic alliance in China generate significant positive average abnormal return on the announcement date (0.96%) which reaches 1%...

متن کامل

Combined Optimization and Reinforcement Learning for Manipulation Skills

2016

Peter Englert Marc Toussaint

—This work addresses the problem of how a robot can improve a manipulation skill in a sample-efficient and secure manner. As an alternative to the standard reinforcement learning formulation where all objectives are defined in a single reward function, we propose a generalized formulation that consists of three components: 1) A known analytic control cost function; 2) A black-box return functio...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید