Query The Agent: Improving sample efficiency through epistemic uncertainty estimation

10/05/2022
by   Julian Alverio, et al.
0

Curricula for goal-conditioned reinforcement learning agents typically rely on poor estimates of the agent's epistemic uncertainty or fail to consider the agents' epistemic uncertainty altogether, resulting in poor sample efficiency. We propose a novel algorithm, Query The Agent (QTA), which significantly improves sample efficiency by estimating the agent's epistemic uncertainty throughout the state space and setting goals in highly uncertain areas. Encouraging the agent to collect data in highly uncertain states allows the agent to improve its estimation of the value function rapidly. QTA utilizes a novel technique for estimating epistemic uncertainty, Predictive Uncertainty Networks (PUN), to allow QTA to assess the agent's uncertainty in all previously observed states. We demonstrate that QTA offers decisive sample efficiency improvements over preexisting methods.

READ FULL TEXT
research
03/07/2023

Exploration via Epistemic Value Estimation

How to efficiently explore in reinforcement learning is an open problem....
research
02/16/2021

DEUP: Direct Epistemic Uncertainty Prediction

Epistemic uncertainty is the part of out-of-sample prediction error due ...
research
06/02/2023

dugMatting: Decomposed-Uncertainty-Guided Matting

Cutting out an object and estimating its opacity mask, known as image ma...
research
08/25/2023

Escaping the Sample Trap: Fast and Accurate Epistemic Uncertainty Estimation with Pairwise-Distance Estimators

This work introduces a novel approach for epistemic uncertainty estimati...
research
05/28/2020

Robust Modeling of Epistemic Mental States

This work identifies and advances some research challenges in the analys...
research
08/22/2022

Some Supervision Required: Incorporating Oracle Policies in Reinforcement Learning via Epistemic Uncertainty Metrics

An inherent problem in reinforcement learning is coping with policies th...
research
06/26/2021

Fragility curves for power transmission towers in Odisha, India, based on observed damage during 2019 Cyclone Fani

Lifeline infrastructure systems such as a power transmission network in ...

Please sign up or login with your details

Forgot password? Click here to reset