Log In Sign Up

Bypass Enhancement RGB Stream Model for Pedestrian Action Recognition of Autonomous Vehicles

by   Dong Cao, et al.

Pedestrian action recognition and intention prediction are one of the core issues in the field of autonomous driving. In this research field, action recognition is one of the key technologies. A large number of scholars have done a lot of work to improve the accuracy of the algorithm. However, there are relatively few studies and improvements in the computational complexity of algorithms and system real-time. In the autonomous driving application scenario, the real-time performance and ultra-low latency of the algorithm are extremely important evaluation indicators, which are directly re-lated to the availability and safety of the autonomous driving system. To this end, we construct a bypass enhanced RGB flow model, which combines the previous two-branch algorithm to extract RGB feature information and optical flow feature information respectively. In the training phase, the two branches are merged by distillation method, and the bypass enhancement is combined in the inference phase to ensure accuracy. The real-time behavior of the behavior recognition algorithm is significantly improved on the premise that the accuracy does not decrease. Experiments confirm the superiority and effectiveness of our algorithm.


page 1

page 2

page 3

page 4


Cross-Enhancement Transform Two-Stream 3D ConvNets for Pedestrian Action Recognition of Autonomous Vehicles

Action recognition is an important research topic in machine vision. It ...

PORCA: Modeling and Planning for Autonomous Driving among Many Pedestrians

This paper presents a planning system for autonomous driving among many ...

Lane Change Classification and Prediction with Action Recognition Networks

Anticipating lane change intentions of surrounding vehicles is crucial f...

Self-Configurable Stabilized Real-Time Detection Learning for Autonomous Driving Applications

Guaranteeing real-time and accurate object detection simultaneously is p...

Simple yet efficient real-time pose-based action recognition

Recognizing human actions is a core challenge for autonomous systems as ...

Recognition and 3D Localization of Pedestrian Actions from Monocular Video

Understanding and predicting pedestrian behavior is an important and cha...

Efficient Video Understanding via Layered Multi Frame-Rate Analysis

One of the greatest challenges in the design of a real-time perception s...