An Offline Time-aware Apprenticeship Learning Framework for Evolving Reward Functions

05/15/2023
by   Xi Yang, et al.
0

Apprenticeship learning (AL) is a process of inducing effective decision-making policies via observing and imitating experts' demonstrations. Most existing AL approaches, however, are not designed to cope with the evolving reward functions commonly found in human-centric tasks such as healthcare, where offline learning is required. In this paper, we propose an offline Time-aware Hierarchical EM Energy-based Sub-trajectory (THEMES) AL framework to tackle the evolving reward functions in such tasks. The effectiveness of THEMES is evaluated via a challenging task – sepsis treatment. The experimental results demonstrate that THEMES can significantly outperform competitive state-of-the-art baselines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/14/2021

Programmatic Reward Design by Example

Reward design is a fundamental problem in reinforcement learning (RL). A...
research
03/24/2023

Optimal Transport for Offline Imitation Learning

With the advent of large datasets, offline reinforcement learning (RL) i...
research
10/22/2017

Safety-Aware Apprenticeship Learning

Apprenticeship learning (AL) is a class of "learning from demonstrations...
research
12/12/2020

Semi-supervised reward learning for offline reinforcement learning

In offline reinforcement learning (RL) agents are trained using a logged...
research
07/13/2021

Recent Advances in Leveraging Human Guidance for Sequential Decision-Making Tasks

A longstanding goal of artificial intelligence is to create artificial a...
research
01/05/2023

Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping

Developing agents that can execute multiple skills by learning from pre-...

Please sign up or login with your details

Forgot password? Click here to reset