B2RL: An open-source Dataset for Building Batch Reinforcement Learning

09/30/2022
by   Hsin-Yu Liu, et al.
10

Batch reinforcement learning (BRL) is an emerging research area in the RL community. It learns exclusively from static datasets (i.e. replay buffers) without interaction with the environment. In the offline settings, existing replay experiences are used as prior knowledge for BRL models to find the optimal policy. Thus, generating replay buffers is crucial for BRL model benchmark. In our B2RL (Building Batch RL) dataset, we collected real-world data from our building management systems, as well as buffers generated by several behavioral policies in simulation environments. We believe it could help building experts on BRL research. To the best of our knowledge, we are the first to open-source building datasets for the purpose of BRL learning.

READ FULL TEXT
research
09/17/2021

Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation

In recommender systems (RecSys) and real-time bidding (RTB) for online a...
research
10/04/2022

TripleE: Easy Domain Generalization via Episodic Replay

Learning how to generalize the model to unseen domains is an important a...
research
06/30/2019

Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog

Most deep reinforcement learning (RL) systems are not able to learn effe...
research
03/12/2023

Synthetic Experience Replay

A key theme in the past decade has been that when large neural networks ...
research
06/27/2023

Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning

In recent years, data-driven reinforcement learning (RL), also known as ...
research
02/19/2020

Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning

Off-policy reinforcement learning algorithms promise to be applicable in...
research
05/12/2021

Interpretable performance analysis towards offline reinforcement learning: A dataset perspective

Offline reinforcement learning (RL) has increasingly become the focus of...

Please sign up or login with your details

Forgot password? Click here to reset