A New Approach to Training Multiple Cooperative Agents for Autonomous Driving

09/05/2022
by   Ruiyang Yang, et al.
0

Training multiple agents to perform safe and cooperative control in the complex scenarios of autonomous driving has been a challenge. For a small fleet of cars moving together, this paper proposes Lepus, a new approach to training multiple agents. Lepus adopts a pure cooperative manner for training multiple agents, featured with the shared parameters of policy networks and the shared reward function of multiple agents. In particular, Lepus pre-trains the policy networks via an adversarial process, improving its collaborative decision-making capability and further the stability of car driving. Moreover, for alleviating the problem of sparse rewards, Lepus learns an approximate reward function from expert trajectories by combining a random network and a distillation network. We conduct extensive experiments on the MADRaS simulation platform. The experimental results show that multiple agents trained by Lepus can avoid collisions as many as possible while driving simultaneously and outperform the other four methods, that is, DDPG-FDE, PSDDPG, MADDPG, and MAGAIL(DDPG) in terms of stability.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/19/2019

Adversarial Inverse Reinforcement Learning for Decision Making in Autonomous Driving

Generative Adversarial Imitation Learning (GAIL) is an efficient way to ...
research
10/16/2021

Generative Adversarial Imitation Learning for End-to-End Autonomous Driving on Urban Environments

Autonomous driving is a complex task, which has been tackled since the f...
research
01/29/2019

Safe, Efficient, and Comfortable Velocity Control based on Reinforcement Learning for Autonomous Driving

A model used for velocity control during car following was proposed base...
research
03/21/2023

Deep Q-Network Based Decision Making for Autonomous Driving

Currently decision making is one of the biggest challenges in autonomous...
research
10/07/2020

Modeling Human Driving Behavior in Highway Scenario using Inverse Reinforcement Learning

Human driving behavior modeling is of great importance for designing saf...
research
01/20/2023

On Multi-Agent Deep Deterministic Policy Gradients and their Explainability for SMARTS Environment

Multi-Agent RL or MARL is one of the complex problems in Autonomous Driv...
research
10/12/2020

A DRL-based Multiagent Cooperative Control Framework for CAV Networks: a Graphic Convolution Q Network

Connected Autonomous Vehicle (CAV) Network can be defined as a collectio...

Please sign up or login with your details

Forgot password? Click here to reset