Robot navigation from human demonstration: learning control behaviors with environment feature maps

05/06/2022
by   Maggie Wigness, et al.
0

When working alongside human collaborators in dynamic and unstructured environments, such as disaster recovery or military operation, fast field adaptation is necessary for an unmanned ground vehicle (UGV) to perform its duties or learn novel tasks. In these scenarios, personnel and equipment are constrained, making training with minimal human supervision a desirable learning attribute. We address the problem of making UGVs more reliable and adaptable teammates with a novel framework that uses visual perception and inverse optimal control to learn traversal costs for environment features. Through extensive evaluation in a real-world environment, we show that our framework requires few human demonstrated trajectory exemplars to learn feature costs that reliably encode several different traversal behaviors. Additionally, we present an on-line version of the framework that allows a human teammate to intervene during live operation to correct deteriorated behavior or to adapt behavior to dynamic changes in complex and unstructured environments.

READ FULL TEXT

page 2

page 7

page 9

page 10

page 11

page 13

page 15

page 16

research
03/20/2020

Visual Navigation Among Humans with Optimal Control as a Supervisor

Real world navigation requires robots to operate in unfamiliar, dynamic ...
research
03/06/2019

Combining Optimal Control and Learning for Visual Navigation in Novel Environments

Model-based control is a popular paradigm for robot navigation because i...
research
07/31/2021

Risk Averse Bayesian Reward Learning for Autonomous Navigation from Human Demonstration

Traditional imitation learning provides a set of methods and algorithms ...
research
04/08/2019

Sim-Real Joint Reinforcement Transfer for 3D Indoor Navigation

There has been an increasing interest in 3D indoor navigation, where a r...
research
03/14/2023

RE-MOVE: An Adaptive Policy Design Approach for Dynamic Environments via Language-Based Feedback

Reinforcement learning-based policies for continuous control robotic nav...
research
06/28/2019

Motion Prediction with Recurrent Neural Network Dynamical Models and Trajectory Optimization

Predicting human motion in unstructured and dynamic environments is diff...
research
10/21/2017

Human Learning of Unknown Environments in Agile Guidance Tasks

Trained human pilots or operators still stand out through their efficien...

Please sign up or login with your details

Forgot password? Click here to reset