Long-term Multi-granularity Deep Framework for Driver Drowsiness Detection

01/08/2018
by   Jie Lyu, et al.
0

For real-world driver drowsiness detection from videos, the variation of head pose is so large that the existing methods on global face is not capable of extracting effective features, such as looking aside and lowering head. Temporal dependencies with variable length are also rarely considered by the previous approaches, e.g., yawning and speaking. In this paper, we propose a Long-term Multi-granularity Deep Framework to detect driver drowsiness in driving videos containing the frontal faces. The framework includes two key components: (1) Multi-granularity Convolutional Neural Network (MCNN), a novel network utilizes a group of parallel CNN extractors on well-aligned facial patches of different granularities, and extracts facial representations effectively for large variation of head pose, furthermore, it can flexibly fuse both detailed appearance clues of the main parts and local to global spatial constraints; (2) a deep Long Short Term Memory network is applied on facial representations to explore long-term relationships with variable length over sequential frames, which is capable to distinguish the states with temporal dependencies, such as blinking and closing eyes. Our approach achieves 90.05 accuracy and about 37 fps speed on the evaluation set of the public NTHU-DDD dataset, which is the state-of-the-art method on driver drowsiness detection. Moreover, we build a new dataset named FI-DDD, which is of higher precision of drowsy locations in temporal dimension.

READ FULL TEXT
research
09/05/2020

Player Identification in Hockey Broadcast Videos

We present a deep recurrent convolutional neural network (CNN) approach ...
research
08/07/2023

Video-based Person Re-identification with Long Short-Term Representation Learning

Video-based person Re-Identification (V-ReID) aims to retrieve specific ...
research
08/02/2016

Modeling Spatial and Temporal Cues for Multi-label Facial Action Unit Detection

Facial action units (AUs) are essential to decode human facial expressio...
research
06/02/2018

Squeeze-and-Excitation on Spatial and Temporal Deep Feature Space for Action Recognition

Spatial and temporal features are two key and complementary information ...
research
02/05/2018

An Occluded Stacked Hourglass Approach to Facial Landmark Localization and Occlusion Estimation

A key step to driver safety is to observe the driver's activities with t...
research
10/21/2020

In-the-wild Drowsiness Detection from Facial Expressions

Driving in a state of drowsiness is a major cause of road accidents, res...

Please sign up or login with your details

Forgot password? Click here to reset