Long-Range Indoor Navigation with PRM-RL

02/25/2019
by   Anthony Francis, et al.
4

Long-range indoor navigation requires guiding robots with noisy sensors and controls through cluttered environments along paths that span a variety of buildings. We achieve this with PRM-RL, a hierarchical robot navigation method in which reinforcement learning agents that map noisy sensors to robot controls learn to solve short-range obstacle avoidance tasks, and then sampling-based planners map where these agents can reliably navigate in simulation; these roadmaps and agents are then deployed on-robot, guiding the robot along the shortest path where the agents are likely to succeed. Here we use Probabilistic Roadmaps (PRMs) as the sampling-based planner and AutoRL as the reinforcement learning method in the indoor navigation context. We evaluate the method in simulation for kinematic differential drive and kinodynamic car-like robots in several environments, and on-robot for differential-drive robots at two physical sites. Our results show PRM-RL with AutoRL is more successful than several baselines, is robust to noise, and can guide robots over hundreds of meters in the face of noise and obstacles in both simulation and on-robot, including over 3.3 kilometers of physical robot navigation.

READ FULL TEXT

page 1

page 2

page 3

page 5

page 8

page 9

page 14

page 18

research
10/11/2017

PRM-RL: Long-range Robotic Navigation Tasks by Combining Reinforcement Learning and Sampling-based Planning

We present PRM-RL, a hierarchical method for long-range navigation task ...
research
11/18/2020

Indoor Point-to-Point Navigation with Deep Reinforcement Learning and Ultra-wideband

Indoor autonomous navigation requires a precise and accurate localizatio...
research
09/27/2019

Long-Term Robot Navigation in Indoor Environments Estimating Patterns in Traversability Changes

Nowadays, mobile robots are deployed in many indoor environments, such a...
research
11/11/2022

Control Transformer: Robot Navigation in Unknown Environments through PRM-Guided Return-Conditioned Sequence Modeling

Learning long-horizon tasks such as navigation has presented difficult c...
research
07/12/2019

NH-TTC: A gradient-based framework for generalized anticipatory collision avoidance

We propose NH-TTC, a general method for fast, anticipatory collision avo...
research
05/01/2023

IndoorSim-to-OutdoorReal: Learning to Navigate Outdoors without any Outdoor Experience

We present IndoorSim-to-OutdoorReal (I2O), an end-to-end learned visual ...
research
11/24/2019

Prioritized Multi-agent Path Finding for Differential Drive Robots

Methods for centralized planning of the collision-free trajectories for ...

Please sign up or login with your details

Forgot password? Click here to reset