Predicting Pedestrian Crossing Intention with Feature Fusion and Spatio-Temporal Attention

04/12/2021
by   Dongfang Yang, et al.
0

Predicting vulnerable road user behavior is an essential prerequisite for deploying Automated Driving Systems (ADS) in the real-world. Pedestrian crossing intention should be recognized in real-time, especially for urban driving. Recent works have shown the potential of using vision-based deep neural network models for this task. However, these models are not robust and certain issues still need to be resolved. First, the global spatio-temproal context that accounts for the interaction between the target pedestrian and the scene has not been properly utilized. Second, the optimum strategy for fusing different sensor data has not been thoroughly investigated. This work addresses the above limitations by introducing a novel neural network architecture to fuse inherently different spatio-temporal features for pedestrian crossing intention prediction. We fuse different phenomena such as sequences of RGB imagery, semantic segmentation masks, and ego-vehicle speed in an optimum way using attention mechanisms and a stack of recurrent neural networks. The optimum architecture was obtained through exhaustive ablation and comparison studies. Extensive comparative experiments on the JAAD pedestrian action prediction benchmark demonstrate the effectiveness of the proposed method, where state-of-the-art performance was achieved. Our code is open-source and publicly available.

READ FULL TEXT

page 1

page 3

page 4

research
09/02/2021

TrouSPI-Net: Spatio-temporal attention on parallel atrous convolutions and U-GRUs for skeletal pedestrian crossing prediction

Understanding the behaviors and intentions of pedestrians is still one o...
research
05/15/2020

FuSSI-Net: Fusion of Spatio-temporal Skeletons for Intention Prediction Network

Pedestrian intention recognition is very important to develop robust and...
research
05/18/2021

IntFormer: Predicting pedestrian intention with the aid of the Transformer architecture

Understanding pedestrian crossing behavior is an essential goal in intel...
research
04/04/2022

High Efficiency Pedestrian Crossing Prediction

Predicting pedestrian crossing intention is an indispensable aspect of d...
research
05/27/2023

Analysis over vision-based models for pedestrian action anticipation

Anticipating human actions in front of autonomous vehicles is a challeng...
research
11/12/2020

Universal Embeddings for Spatio-Temporal Tagging of Self-Driving Logs

In this paper, we tackle the problem of spatio-temporal tagging of self-...
research
09/23/2020

A Real-time Vision Framework for Pedestrian Behavior Recognition and Intention Prediction at Intersections Using 3D Pose Estimation

Minimizing traffic accidents between vehicles and pedestrians is one of ...

Please sign up or login with your details

Forgot password? Click here to reset