TY - GEN
T1 - Designing a generalised reward for Building Energy Management Reinforcement Learning agents
AU - Martinez, Ruben Mulero
AU - Goikolea, Benat Arregi
AU - Beitia, Inigo Mendialdua
AU - Martinez, Roberto Garay
AU - Mulero, Rubén
AU - Arregi, Beñat
AU - Mendialdua, Iñigo
AU - Garay, Roberto
N1 - Publisher Copyright:
© 2021 University of Split, FESB.
PY - 2021/9/8
Y1 - 2021/9/8
N2 - The reduction of the carbon footprint of buildings is a challenging task, partly due to the conflicting goals of maximising occupant comfort and minimising energy consumption. An intelligent management of Heating, Ventilation and Air Conditioning (HVAC) systems is creating a promising research line in which the creation of suitable algorithms could reduce energy consumption maintaining occupants' comfort. In this regard, Reinforcement Learning (RL) approaches are giving a good balance between data requirements and intelligent operations to control building systems. However, there is a gap concerning how to create a generalised reward signal that can train RL agents without delimiting the problem to a specific or controlled scenario. To tackle it, an analysis and discussion is presented about the necessary requirements for the creation of generalist rewards, with the objective of laying the foundations that allow the creation of generalist intelligent agents for building energy management.
AB - The reduction of the carbon footprint of buildings is a challenging task, partly due to the conflicting goals of maximising occupant comfort and minimising energy consumption. An intelligent management of Heating, Ventilation and Air Conditioning (HVAC) systems is creating a promising research line in which the creation of suitable algorithms could reduce energy consumption maintaining occupants' comfort. In this regard, Reinforcement Learning (RL) approaches are giving a good balance between data requirements and intelligent operations to control building systems. However, there is a gap concerning how to create a generalised reward signal that can train RL agents without delimiting the problem to a specific or controlled scenario. To tackle it, an analysis and discussion is presented about the necessary requirements for the creation of generalist rewards, with the objective of laying the foundations that allow the creation of generalist intelligent agents for building energy management.
KW - Reinforcement learning
KW - Reward
KW - Generalised
KW - Building
KW - Energy efficiency
KW - HVAC
KW - Reinforcement learning
KW - Reward
KW - Generalised
KW - Building
KW - Energy efficiency
KW - HVAC
UR - http://www.scopus.com/inward/record.url?scp=85118449323&partnerID=8YFLogxK
U2 - 10.23919/SpliTech52315.2021.9566345
DO - 10.23919/SpliTech52315.2021.9566345
M3 - Conference contribution
SN - 978-1-6654-4202-2
SN - 978-953-290-112-2
T3 - 2021 6th International Conference on Smart and Sustainable Technologies, SpliTech 2021
SP - 1
EP - 6
BT - unknown
A2 - Solic, Petar
A2 - Nizetic, Sandro
A2 - Rodrigues, Joel J. P. C.
A2 - Rodrigues, Joel J.P.C.
A2 - Gonzalez-de-Artaza, Diego Lopez-de-Ipina
A2 - Perkovic, Toni
A2 - Catarinucci, Luca
A2 - Patrono, Luigi
PB - IEEE
T2 - 6th International Conference on Smart and Sustainable Technologies, SpliTech 2021
Y2 - 8 September 2021 through 11 September 2021
ER -