Data-driven Offline Reinforcement Learning for HVAC-systems
نویسندگان
چکیده
This paper presents a novel framework for Offline Reinforcement Learning (RL) with online fine tuning Heating Ventilation and Air-conditioning (HVAC) systems. The method to do pre-training in black box model environment, where the models are built on data acquired under traditional control policy. focuses application of Underfloor (UFH) an air-to-water-based heat pump. However, should also generalize other HVAC applications. Because Black methods used is there little no commissioning time when applying this buildings/simulations beyond one presented study. explores deploys Artificial Neural Network (ANN) based design efficient controllers. Two ANN tested paper; Multilayer Perceptron (MLP) Long Short Term Memory (LSTM) method. It found that LSTM-based reduces prediction error by 45% compared MLP model. Additionally, different network architectures tested. creating new each step, performance can be improved additionally 19%. By using these paper, it shown Multi-Agent RL algorithm deployed without ever performing worse than industrial controller. Furthermore, if building from Building Management System (BMS) available, agent which performs close optimally first day deployment. An optimal policy cost heating 19.4 % simulation paper.
منابع مشابه
Autonomous HVAC Control, A Reinforcement Learning Approach
Recent high profile developments of autonomous learning thermostats by companies such as Nest Labs and Honeywell have brought to the fore the possibility of ever greater numbers of intelligent devices permeating our homes and working environments into the future. However, the specific learning approaches and methodologies utilised by these devices have never been made public. In fact little inf...
متن کاملOffline Evaluation of Online Reinforcement Learning Algorithms
In many real-world reinforcement learning problems, we have access to an existing dataset and would like to use it to evaluate various learning approaches. Typically, one would prefer not to deploy a fixed policy, but rather an algorithm that learns to improve its behavior as it gains more experience. Therefore, we seek to evaluate how a proposed algorithm learns in our environment, meaning we ...
متن کاملEmotion-Driven Reinforcement Learning
Existing computational models of emotion are primarily concerned with creating more realistic agents, with recent efforts looking into matching human data, including qualitative emotional responses and dynamics. In this paper, our work focuses on the functional benefits of emotion in a cognitive system where emotional feedback helps drive reinforcement learning. Our system is an integration of ...
متن کاملReinforcement Learning for CPG-Driven Biped Robot
Animal’s rhythmic movements such as locomotion are considered to be controlled by neural circuits called central pattern generators (CPGs). This article presents a reinforcement learning (RL) method for a CPG controller, which is inspired by the control mechanism of animals. Because the CPG controller is an instance of recurrent neural networks, a naive application of RL involves difficulties. ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Energy
سال: 2022
ISSN: ['1873-6785', '0360-5442']
DOI: https://doi.org/10.1016/j.energy.2022.125290