Generative Augmented Flow Networks

10/07/2022
by   Ling Pan, et al.
0

The Generative Flow Network is a probabilistic framework where an agent learns a stochastic policy for object generation, such that the probability of generating an object is proportional to a given reward function. Its effectiveness has been shown in discovering high-quality and diverse solutions, compared to reward-maximizing reinforcement learning-based methods. Nonetheless, GFlowNets only learn from rewards of the terminal states, which can limit its applicability. Indeed, intermediate rewards play a critical role in learning, for example from intrinsic motivation to provide intermediate feedback even in particularly challenging sparse reward tasks. Inspired by this, we propose Generative Augmented Flow Networks (GAFlowNets), a novel learning framework to incorporate intermediate rewards into GFlowNets. We specify intermediate rewards by intrinsic motivation to tackle the exploration problem in sparse reward environments. GAFlowNets can leverage edge-based and state-based intrinsic rewards in a joint way to improve exploration. Based on extensive experiments on the GridWorld task, we demonstrate the effectiveness and efficiency of GAFlowNet in terms of convergence, performance, and diversity of solutions. We further show that GAFlowNet is scalable to a more complex and large-scale molecule generation domain, where it achieves consistent and significant performance improvement.

READ FULL TEXT
research
07/14/2021

Experimental Evidence that Empowerment May Drive Exploration in Sparse-Reward Environments

Reinforcement Learning (RL) is known to be often unsuccessful in environ...
research
06/28/2022

GAN-based Intrinsic Exploration For Sample Efficient Reinforcement Learning

In this study, we address the problem of efficient exploration in reinfo...
research
12/21/2022

Reward Bonuses with Gain Scheduling Inspired by Iterative Deepening Search

This paper introduces a novel method of adding intrinsic bonuses to task...
research
08/01/2019

Curiosity-driven Reinforcement Learning for Diverse Visual Paragraph Generation

Visual paragraph generation aims to automatically describe a given image...
research
05/25/2018

Visceral Machines: Reinforcement Learning with Intrinsic Rewards that Mimic the Human Nervous System

The human autonomic nervous system has evolved over millions of years an...
research
10/15/2020

An Empowerment-based Solution to Robotic Manipulation Tasks with Sparse Rewards

In order to provide adaptive and user-friendly solutions to robotic mani...
research
12/01/2019

Affect-based Intrinsic Rewards for Learning General Representations

Positive affect has been linked to increased interest, curiosity and sat...

Please sign up or login with your details

Forgot password? Click here to reset