Distributional Actor-Critic Ensemble for Uncertainty-Aware Continuous Control

07/27/2022
by   Takuya Kanazawa, et al.
0

Uncertainty quantification is one of the central challenges for machine learning in real-world applications. In reinforcement learning, an agent confronts two kinds of uncertainty, called epistemic uncertainty and aleatoric uncertainty. Disentangling and evaluating these uncertainties simultaneously stands a chance of improving the agent's final performance, accelerating training, and facilitating quality assurance after deployment. In this work, we propose an uncertainty-aware reinforcement learning algorithm for continuous control tasks that extends the Deep Deterministic Policy Gradient algorithm (DDPG). It exploits epistemic uncertainty to accelerate exploration and aleatoric uncertainty to learn a risk-sensitive policy. We conduct numerical experiments showing that our variant of DDPG outperforms vanilla DDPG without uncertainty estimation in benchmark tasks on robotic control and power-grid optimization.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/04/2023

Wasserstein Actor-Critic: Directed Exploration via Optimism for Continuous-Actions Control

Uncertainty quantification has been extensively used as a means to achie...
research
09/16/2021

Enabling risk-aware Reinforcement Learning for medical interventions through uncertainty decomposition

Reinforcement Learning (RL) is emerging as tool for tackling complex con...
research
04/02/2022

Risk-Aware Control and Optimization for High-Renewable Power Grids

The transition of the electrical power grid from fossil fuels to renewab...
research
05/20/2023

Bridging Active Exploration and Uncertainty-Aware Deployment Using Probabilistic Ensemble Neural Network Dynamics

In recent years, learning-based control in robotics has gained significa...
research
06/22/2021

Distributional Gradient Matching for Learning Uncertain Neural Dynamics Models

Differential equations in general and neural ODEs in particular are an e...
research
08/22/2022

Some Supervision Required: Incorporating Oracle Policies in Reinforcement Learning via Epistemic Uncertainty Metrics

An inherent problem in reinforcement learning is coping with policies th...
research
02/18/2023

Efficient exploration via epistemic-risk-seeking policy optimization

Exploration remains a key challenge in deep reinforcement learning (RL)....

Please sign up or login with your details

Forgot password? Click here to reset