reward penalty scheme

نتایج جستجو برای: reward penalty scheme

تعداد نتایج: 265788 فیلتر نتایج به سال:

A model for scheduling projects under the condition of inflation and under penalty and reward arrangements

Journal: :ORiON 2014

متن کامل

Agent Cooperatives for Effective Power Consumption Shifting

2013

Charilaos Akasiadis Georgios Chalkiadakis

In this paper, we present a directly applicable scheme for electricity consumption shifting and effective demand curve flattening. The scheme can employ the services of either individual or cooperating consumer agents alike. Agents participating in the scheme, however, are motivated to form cooperatives, in order to reduce their electricity bills via lower group prices granted for sizable consu...

متن کامل

An Advance Q Learning (AQL) Approach for Path Planning and Obstacle Avoidance of a Mobile Robot

Journal: :IJIMR 2013

Arpita Chakraborty Jyoti Sekhar Banerjee

The goal of this paper is to improve the performance of the well known Q learning algorithm, the robust technique of Machine learning to facilitate path planning in an environment. Until this time the Q learning algorithms like Classical Q learning(CQL)algorithm and Improved Q learning (IQL) algorithm deal with an environment without obstacles, while in a real environment an agent has to face o...

متن کامل

Multi-Agent Deep Reinforcement Learning With Progressive Negative Reward for Cryptocurrency Trading

Journal: :IEEE Access 2023

Recently, reinforcement learning has been applied to cryptocurrencies make profitable trades. However, cryptocurrency trading is a very challenging task due the volatility of market, especially during bearish periods. In addressing this problem, existing literature employs single-agent techniques such as deep Q-network (DQN), advantage actor-critic (A2C), and proximal policy optimization (PPO),...

متن کامل

Neural basis of reinforcement learning and decision making.

Journal: :Annual review of neuroscience 2012

Daeyeol Lee Hyojung Seo Min Whan Jung

Reinforcement learning is an adaptive process in which an animal utilizes its previous experience to improve the outcomes of future choices. Computational theories of reinforcement learning play a central role in the newly emerging areas of neuroeconomics and decision neuroscience. In this framework, actions are chosen according to their value functions, which describe how much future reward is...

متن کامل

a cooperative spectrum leasing scheme with guaranteed secrecy rate for primary link

Journal: :مهندسی برق مدرس 0

shokofe vatanpour m.sc. student, department of electrical engineering, shahrood university of technology, iran. mohammad reza jvan assistant professor, department of electrical engineering, shahrood university of technology, iran

in this paper, we consider a cooperative cognitive radio network in which there is an ofdm primary link and multiple single carrier secondary links.the primary link is required to maintain its secrecy rate above a predeﬁned threshold. if the secrecy rate requirement is not satisﬁed, the secondary system help primary link to maintain its secrecy rate requirement. in doing so, the secondary trans...

متن کامل

A Fuzzy Reward and Punishment Scheme for Vehicular Ad Hoc Networks

Journal: :International Journal of Advanced Computer Science and Applications 2023

Trust management is an important security approach for the successful implementation of Vehicular Ad Hoc Networks (VANETs). models evaluate messages to assign reward or punishment. This can be used influence a driver’s future behaviour. In author’s previous work, sender-side based trust framework developed which avoids receiver evaluation messages. However, this does not guarantee that trusted ...

متن کامل

Bayesian Inference of Natural Rankings in Incomplete Competition Networks

2014

Juyong Park Soon-Hyung Yook

Competition between a complex system's constituents and a corresponding reward mechanism based on it have profound influence on the functioning, stability, and evolution of the system. But determining the dominance hierarchy or ranking among the constituent parts from the strongest to the weakest--essential in determining reward and penalty--is frequently an ambiguous task due to the incomplete...

متن کامل

A Meshfree Solution of Tow-Dimensional Potential Flow Problems

2010

I. V. Singh A. Singh

In this paper, mesh-free element free Galerkin (EFG) method is extended to solve two-dimensional potential flow problems. Two ideal fluid flow problems (i.e. flow over a rigid cylinder and flow over a sphere) have been formulated using variational approach. Penalty and Lagrange multiplier techniques have been utilized for the enforcement of essential boundary conditions. Four point Gauss quadra...

متن کامل

تحلیل اصل تناسب جرایم و مجازات ها در حقوق جزای ایران

پایان نامه :وزارت علوم، تحقیقات و فناوری - دانشگاه پیام نور - دانشگاه پیام نور استان تهران - دانشکده حقوق 1389

فاطمه کریمی محسن آبادی, ولی الله انصاری, فاطمه سوهانیان,

if the precise implementation of the principle of proportion and:balance between the violation and the penalty as well as the other dimensions could be considered as a stick yard for the imptementation of justice any lack of preciseness in carrying out such principle would not indeed be much too far from injustice . naturally ,if it would be imagined that the objective of balance between...

15 صفحه اول

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید