ReCoAt: A Deep Learning-based Framework for Multi-Modal Motion Prediction in Autonomous Driving Application

07/02/2022
by   Zhiyu Huang, et al.
0

This paper proposes a novel deep learning framework for multi-modal motion prediction. The framework consists of three parts: recurrent neural networks to process the target agent's motion process, convolutional neural networks to process the rasterized environment representation, and a distance-based attention mechanism to process the interactions among different agents. We validate the proposed framework on a large-scale real-world driving dataset, Waymo open motion dataset, and compare its performance against other methods on the standard testing benchmark. The qualitative results manifest that the predicted trajectories given by our model are accurate, diverse, and in accordance with the road structure. The quantitative results on the standard benchmark reveal that our model outperforms other baseline methods in terms of prediction accuracy and other evaluation metrics. The proposed framework is the second-place winner of the 2021 Waymo open dataset motion prediction challenge.

READ FULL TEXT

page 3

page 4

page 5

research
09/14/2021

Multi-modal Motion Prediction with Transformer-based Neural Network for Autonomous Driving

Predicting the behaviors of other agents on the road is critical for aut...
research
06/03/2017

IDK Cascades: Fast Deep Learning by Learning not to Overthink

Advances in deep learning have led to substantial increases in predictio...
research
05/03/2022

TartanDrive: A Large-Scale Dataset for Learning Off-Road Dynamics Models

We present TartanDrive, a large scale dataset for learning dynamics mode...
research
04/23/2023

Learning-enabled multi-modal motion prediction in urban environments

Motion prediction is a key factor towards the full deployment of autonom...
research
10/28/2022

Towards Trustworthy Multi-Modal Motion Prediction: Evaluation and Interpretability

Predicting the motion of other road agents enables autonomous vehicles t...
research
07/18/2021

Multi-Modal Temporal Convolutional Network for Anticipating Actions in Egocentric Videos

Anticipating human actions is an important task that needs to be address...
research
04/26/2023

A Deep Learning Framework for Verilog Autocompletion Towards Design and Verification Automation

Innovative Electronic Design Automation (EDA) solutions are important to...

Please sign up or login with your details

Forgot password? Click here to reset