Landmark Policy Optimization for Object Navigation Task

09/17/2021
by   Aleksey Staroverov, et al.
8

This work studies object goal navigation task, which involves navigating to the closest object related to the given semantic category in unseen environments. Recent works have shown significant achievements both in the end-to-end Reinforcement Learning approach and modular systems, but need a big step forward to be robust and optimal. We propose a hierarchical method that incorporates standard task formulation and additional area knowledge as landmarks, with a way to extract these landmarks. In a hierarchy, a low level consists of separately trained algorithms to the most intuitive skills, and a high level decides which skill is needed at this moment. With all proposed solutions, we achieve a 0.75 success rate in a realistic Habitat simulator. After a small stage of additional model training in a reconstructed virtual area at a simulator, we successfully confirmed our results in a real-world case.

READ FULL TEXT

page 1

page 2

page 3

page 6

page 9

page 10

research
06/07/2021

Hierarchical Robot Navigation in Novel Environments using Rough 2-D Maps

In robot navigation, generalizing quickly to unseen environments is esse...
research
05/31/2023

Adaptive and Explainable Deployment of Navigation Skills via Hierarchical Deep Reinforcement Learning

For robotic vehicles to navigate robustly and safely in unseen environme...
research
08/19/2023

Skill Transformer: A Monolithic Policy for Mobile Manipulation

We present Skill Transformer, an approach for solving long-horizon robot...
research
11/18/2019

Unsupervised Reinforcement Learning of Transferable Meta-Skills for Embodied Navigation

Visual navigation is a task of training an embodied agent by intelligent...
research
07/29/2020

Learning Object-conditioned Exploration using Distributed Soft Actor Critic

Object navigation is defined as navigating to an object of a given label...
research
01/25/2022

PONI: Potential Functions for ObjectGoal Navigation with Interaction-free Learning

State-of-the-art approaches to ObjectGoal navigation rely on reinforceme...

Please sign up or login with your details

Forgot password? Click here to reset