Tree-structured Policy Planning with Learned Behavior Models

01/27/2023
by   Yuxiao Chen, et al.
0

Autonomous vehicles (AVs) need to reason about the multimodal behavior of neighboring agents while planning their own motion. Many existing trajectory planners seek a single trajectory that performs well under all plausible futures simultaneously, ignoring bi-directional interactions and thus leading to overly conservative plans. Policy planning, whereby the ego agent plans a policy that reacts to the environment's multimodal behavior, is a promising direction as it can account for the action-reaction interactions between the AV and the environment. However, most existing policy planners do not scale to the complexity of real autonomous vehicle applications: they are either not compatible with modern deep learning prediction models, not interpretable, or not able to generate high quality trajectories. To fill this gap, we propose Tree Policy Planning (TPP), a policy planner that is compatible with state-of-the-art deep learning prediction models, generates multistage motion plans, and accounts for the influence of ego agent on the environment behavior. The key idea of TPP is to reduce the continuous optimization problem into a tractable discrete MDP through the construction of two tree structures: an ego trajectory tree for ego trajectory options, and a scenario tree for multi-modal ego-conditioned environment predictions. We demonstrate the efficacy of TPP in closed-loop simulations based on real-world nuScenes dataset and results show that TPP scales to realistic AV scenarios and significantly outperforms non-policy baselines.

READ FULL TEXT
research
06/18/2022

ScePT: Scene-consistent, Policy-based Trajectory Predictions for Planning

Trajectory prediction is a critical functionality of autonomous systems ...
research
10/26/2022

Planning with Occluded Traffic Agents using Bi-Level Variational Occlusion Models

Reasoning with occluded traffic agents is a significant open challenge f...
research
11/10/2022

Benchmark for Models Predicting Human Behavior in Gap Acceptance Scenarios

Autonomous vehicles currently suffer from a time-inefficient driving sty...
research
09/10/2021

Interactive multi-modal motion planning with Branch Model Predictive Control

Motion planning for autonomous robots and vehicles in presence of uncont...
research
10/10/2019

Jointly Learnable Behavior and Trajectory Planning for Self-Driving Vehicles

The motion planners used in self-driving vehicles need to generate traje...
research
09/27/2011

Probabilistic Hybrid Action Models for Predicting Concurrent Percept-driven Robot Behavior

This article develops Probabilistic Hybrid Action Models (PHAMs), a real...
research
12/06/2022

Receding Horizon Planning with Rule Hierarchies for Autonomous Vehicles

Autonomous vehicles must often contend with conflicting planning require...

Please sign up or login with your details

Forgot password? Click here to reset