Never Forget: Balancing Exploration and Exploitation via Learning Optical Flow

01/24/2019
by   Hsuan-Kung Yang, et al.
12

Exploration bonus derived from the novelty of the states in an environment has become a popular approach to motivate exploration for deep reinforcement learning agents in the past few years. Recent methods such as curiosity-driven exploration usually estimate the novelty of new observations by the prediction errors of their system dynamics models. Due to the capacity limitation of the models and difficulty of performing next-frame prediction, however, these methods typically fail to balance between exploration and exploitation in high-dimensional observation tasks, resulting in the agents forgetting the visited paths and exploring those states repeatedly. Such inefficient exploration behavior causes significant performance drops, especially in large environments with sparse reward signals. In this paper, we propose to introduce the concept of optical flow estimation from the field of computer vision to deal with the above issue. We propose to employ optical flow estimation errors to examine the novelty of new observations, such that agents are able to memorize and understand the visited states in a more comprehensive fashion. We compare our method against the previous approaches in a number of experimental experiments. Our results indicate that the proposed method appears to deliver superior and long-lasting performance than the previous methods. We further provide a set of comprehensive ablative analysis of the proposed method, and investigate the impact of optical flow estimation on the learning curves of the DRL agents.

READ FULL TEXT

page 4

page 6

page 8

research
05/24/2019

Exploration via Flow-Based Intrinsic Rewards

Exploration bonuses derived from the novelty of observations in an envir...
research
04/13/2021

MESD: Exploring Optical Flow Assessment on Edge of Motion Objects with Motion Edge Structure Difference

The optical flow estimation has been assessed in various applications. I...
research
02/20/2018

Uncertainty Estimates for Optical Flow with Multi-Hypotheses Networks

Recent work has shown that optical flow estimation can be formulated as ...
research
04/06/2020

Optical Flow Estimation in the Deep Learning Age

Akin to many subareas of computer vision, the recent advances in deep le...
research
06/01/2018

Deep Curiosity Search: Intra-Life Exploration Improves Performance on Challenging Deep Reinforcement Learning Problems

Traditional exploration methods in RL require agents to perform random a...
research
03/09/2022

Investigation of Factorized Optical Flows as Mid-Level Representations

In this paper, we introduce a new concept of incorporating factorized fl...
research
05/15/2023

MIMEx: Intrinsic Rewards from Masked Input Modeling

Exploring in environments with high-dimensional observations is hard. On...

Please sign up or login with your details

Forgot password? Click here to reset