Goal-Aware Cross-Entropy for Multi-Target Reinforcement Learning

10/25/2021
by   Kibeom Kim, et al.
0

Learning in a multi-target environment without prior knowledge about the targets requires a large amount of samples and makes generalization difficult. To solve this problem, it is important to be able to discriminate targets through semantic understanding. In this paper, we propose goal-aware cross-entropy (GACE) loss, that can be utilized in a self-supervised way using auto-labeled goal states alongside reinforcement learning. Based on the loss, we then devise goal-discriminative attention networks (GDAN) which utilize the goal-relevant information to focus on the given instruction. We evaluate the proposed methods on visual navigation and robot arm manipulation tasks with multi-target environments and show that GDAN outperforms the state-of-the-art methods in terms of task success ratio, sample efficiency, and generalization. Additionally, qualitative analyses demonstrate that our proposed method can help the agent become aware of and focus on the given instruction clearly, promoting goal-directed behavior.

READ FULL TEXT

page 6

page 10

page 17

research
08/08/2019

Vision-based Navigation Using Deep Reinforcement Learning

Deep reinforcement learning (RL) has been successfully applied to a vari...
research
05/23/2023

L-SA: Learning Under-Explored Targets in Multi-Target Reinforcement Learning

Tasks that involve interaction with various targets are called multi-tar...
research
09/18/2021

Density-based Curriculum for Multi-goal Reinforcement Learning with Sparse Rewards

Multi-goal reinforcement learning (RL) aims to qualify the agent to acco...
research
05/21/2020

Dynamics-Aware Latent Space Reachability for Exploration in Temporally-Extended Tasks

Self-supervised goal proposal and reaching is a key component of efficie...
research
05/21/2020

LEAF: Latent Exploration Along the Frontier

Self-supervised goal proposal and reaching is a key component for explor...
research
02/17/2020

GACEM: Generalized Autoregressive Cross Entropy Method for Multi-Modal Black Box Constraint Satisfaction

In this work we present a new method of black-box optimization and const...
research
08/14/2019

Reinforcement Learning Based Graph-to-Sequence Model for Natural Question Generation

Natural question generation (QG) is a challenging yet rewarding task, th...

Please sign up or login with your details

Forgot password? Click here to reset