A Novel Driver Distraction Behavior Detection Based on Self-Supervised Learning Framework with Masked Image Modeling

by   Yingzhi Zhang, et al.

Driver distraction causes a significant number of traffic accidents every year, resulting in economic losses and casualties. Currently, the level of automation in commercial vehicles is far from completely unmanned, and drivers still play an important role in operating and controlling the vehicle. Therefore, driver distraction behavior detection is crucial for road safety. At present, driver distraction detection primarily relies on traditional Convolutional Neural Networks (CNN) and supervised learning methods. However, there are still challenges such as the high cost of labeled datasets, limited ability to capture high-level semantic information, and weak generalization performance. In order to solve these problems, this paper proposes a new self-supervised learning method based on masked image modeling for driver distraction behavior detection. Firstly, a self-supervised learning framework for masked image modeling (MIM) is introduced to solve the serious human and material consumption issues caused by dataset labeling. Secondly, the Swin Transformer is employed as an encoder. Performance is enhanced by reconfiguring the Swin Transformer block and adjusting the distribution of the number of window multi-head self-attention (W-MSA) and shifted window multi-head self-attention (SW-MSA) detection heads across all stages, which leads to model more lightening. Finally, various data augmentation strategies are used along with the best random masking strategy to strengthen the model's recognition and generalization ability. Test results on a large-scale driver distraction behavior dataset show that the self-supervised learning method proposed in this paper achieves an accuracy of 99.60 of advanced supervised learning methods.


page 4

page 8

page 10

page 11

page 15


Robust Multiview Multimodal Driver Monitoring System Using Masked Multi-Head Self-Attention

Driver Monitoring Systems (DMSs) are crucial for safe hand-over actions ...

Passive and Active Learning of Driver Behavior from Electric Vehicles

Modeling driver behavior provides several advantages in the automotive i...

Self-supervised Social Relation Representation for Human Group Detection

Human group detection, which splits crowd of people into groups, is an i...

Drive-Net: Convolutional Network for Driver Distraction Detection

To help prevent motor vehicle accidents, there has been significant inte...

Semi-supervised Learning: Fusion of Self-supervised, Supervised Learning, and Multimodal Cues for Tactical Driver Behavior Detection

In this paper, we presented a preliminary study for tactical driver beha...

Enhanced Masked Image Modeling for Analysis of Dental Panoramic Radiographs

The computer-assisted radiologic informative report has received increas...

Please sign up or login with your details

Forgot password? Click here to reset