Human-Inspired Multi-Agent Navigation using Knowledge Distillation

03/18/2021
by   Pei Xu, et al.
0

Despite significant advancements in the field of multi-agent navigation, agents still lack the sophistication and intelligence that humans exhibit in multi-agent settings. In this paper, we propose a framework for learning a human-like general collision avoidance policy for agent-agent interactions in fully decentralized, multi-agent environments. Our approach uses knowledge distillation with reinforcement learning to shape the reward function based on expert policies extracted from human trajectory demonstrations through behavior cloning. We show that agents trained with our approach can take human-like trajectories in collision avoidance and goal-directed steering tasks not provided by the demonstrations, outperforming the experts as well as learning-based agents trained without knowledge distillation.

READ FULL TEXT
research
06/02/2021

Least-Restrictive Multi-Agent Collision Avoidance via Deep Meta Reinforcement Learning and Optimal Control

Multi-agent collision-free trajectory planning and control subject to di...
research
10/24/2019

Reciprocal Collision Avoidance for General Nonlinear Agents using Reinforcement Learning

Finding feasible and collision-free paths for multiple nonlinear agents ...
research
09/17/2022

Sample-Efficient Multi-Agent Reinforcement Learning with Demonstrations for Flocking Control

Flocking control is a significant problem in multi-agent systems such as...
research
03/27/2021

KnowRU: Knowledge Reusing via Knowledge Distillation in Multi-agent Reinforcement Learning

Recently, deep Reinforcement Learning (RL) algorithms have achieved dram...
research
10/17/2022

Learning Control Admissibility Models with Graph Neural Networks for Multi-Agent Navigation

Deep reinforcement learning in continuous domains focuses on learning co...
research
03/29/2020

Optimized Directed Roadmap Graph for Multi-Agent Path Finding Using Stochastic Gradient Descent

We present a novel approach called Optimized Directed Roadmap Graph (ODR...
research
06/24/2019

Training an Interactive Helper

Developing agents that can quickly adapt their behavior to new tasks rem...

Please sign up or login with your details

Forgot password? Click here to reset