Reinforcement learning is time-consuming for complex tasks due to the ne...
Animals and robots navigate through environments by building and refinin...
Learning from rewards (i.e., reinforcement learning or RL) and learning ...
Most offline reinforcement learning (RL) algorithms return a target poli...
This work studies the problem of ad hoc teamwork in teams composed of ag...
State-of-the-art reinforcement learning (RL) algorithms typically use ra...
In this paper, we leverage the rapid advances in imitation learning, a t...
State-of-the-art deep Q-learning methods update Q-values using state
tra...
We present a strong baseline that surpasses the performance of previousl...
Deep reinforcement learning (DRL) has been demonstrated to provide promi...
The concept of utilizing multi-step returns for updating value functions...
Off-policy ensemble reinforcement learning (RL) methods have demonstrate...
Model-based Reinforcement Learning (MBRL) allows data-efficient learning...
We present an adversarial exploration strategy, a simple yet effective
i...
Efficient exploration remains a challenging research problem in reinforc...
Collecting training data from the physical world is usually time-consumi...
We present DPIQN, a deep policy inference Q-network that targets multi-a...
We introduce two tactics to attack agents trained by deep reinforcement
...