Rethinking Closed-loop Training for Autonomous Driving

06/27/2023
by   Chris Zhang, et al.
0

Recent advances in high-fidelity simulators have enabled closed-loop training of autonomous driving agents, potentially solving the distribution shift in training v.s. deployment and allowing training to be scaled both safely and cheaply. However, there is a lack of understanding of how to build effective training benchmarks for closed-loop training. In this work, we present the first empirical study which analyzes the effects of different training benchmark designs on the success of learning agents, such as how to design traffic scenarios and scale training environments. Furthermore, we show that many popular RL algorithms cannot achieve satisfactory performance in the context of autonomous driving, as they lack long-term planning and take an extremely long time to train. To address these issues, we propose trajectory value learning (TRAVL), an RL-based driving agent that performs planning with multistep look-ahead and exploits cheaply generated imagined data for efficient learning. Our experiments show that TRAVL can learn much faster and produce safer maneuvers compared to all the baselines. For more information, visit the project website: https://waabi.ai/research/travl

READ FULL TEXT

page 29

page 30

research
06/22/2021

nuPlan: A closed-loop ML-based planning benchmark for autonomous vehicles

In this work, we propose the world's first closed-loop ML-based planning...
research
06/23/2023

An Overview about Emerging Technologies of Autonomous Driving

Since DARPA started Grand Challenges in 2004 and Urban Challenges in 200...
research
09/19/2023

Rethinking Imitation-based Planner for Autonomous Driving

In recent years, imitation-based driving planners have reported consider...
research
11/12/2021

DriverGym: Democratising Reinforcement Learning for Autonomous Driving

Despite promising progress in reinforcement learning (RL), developing al...
research
10/09/2022

Are All Vision Models Created Equal? A Study of the Open-Loop to Closed-Loop Causality Gap

There is an ever-growing zoo of modern neural network models that can ef...
research
03/14/2021

Investigating Value of Curriculum Reinforcement Learning in Autonomous Driving Under Diverse Road and Weather Conditions

Applications of reinforcement learning (RL) are popular in autonomous dr...
research
12/02/2021

Evaluation of mathematical questioning strategies using data collected through weak supervision

A large body of research demonstrates how teachers' questioning strategi...

Please sign up or login with your details

Forgot password? Click here to reset