Despite the seeming success of contemporary grounded text generation sys...
We consider the Imitation Learning (IL) setup where expert data are not
...
We investigate models that can generate arbitrary natural language text ...
Energy-based models, a.k.a. energy networks, perform inference by optimi...
We introduce RLDS (Reinforcement Learning Datasets), an ecosystem for
re...
In Reinforcement Learning (RL), discrete actions, as opposed to continuo...
Offline Reinforcement Learning (RL) aims at learning an optimal control ...
Adversarial imitation learning has become a popular framework for imitat...
We address the issue of tuning hyperparameters (HPs) for imitation learn...
Offline Reinforcement Learning methods seek to learn a policy from logge...
The study of exploration in Reinforcement Learning (RL) has a long histo...
In recent years, on-policy reinforcement learning (RL) has been successf...
Imitation Learning (IL) methods seek to match the behavior of an agent w...
This paper deals with adversarial attacks on perceptions of neural netwo...