MGPSN: Motion-Guided Pseudo Siamese Network for Indoor Video Head Detection

10/07/2021
by   Kailai Sun, et al.
2

Head detection in real-world videos is an important research topic in computer vision. However, existing studies face some challenges in complex scenes. The performance of head detectors deteriorates when objects which have similar head appearance exist for indoor videos. Moreover, heads have small scales and diverse poses, which increases the difficulty in detection. To handle these issues, we propose Motion-Guided Pseudo Siamese Network for Indoor Video Head Detection (MGPSN), an end-to-end model to learn the robust head motion features. MGPSN integrates spatial-temporal information on pixel level, guiding the model to extract effective head features. Experiments show that MGPSN is able to suppress static objects and enhance motion instances. Compared with previous methods, it achieves state-of-the-art performance on the crowd Brainwash dataset. Different backbone networks and detectors are evaluated to verify the flexibility and generality of MGPSN.

READ FULL TEXT

page 2

page 3

page 6

research
12/18/2017

Spatial-Temporal Memory Networks for Video Object Detection

We introduce Spatial-Temporal Memory Networks (STMN) for video object de...
research
04/06/2022

An Empirical Study of End-to-End Temporal Action Detection

Temporal action detection (TAD) is an important yet challenging task in ...
research
04/28/2021

Motion-guided Non-local Spatial-Temporal Network for Video Crowd Counting

We study video crowd counting, which is to estimate the number of object...
research
10/28/2022

Exploring Spatial-Temporal Features for Deepfake Detection and Localization

With the continuous research on Deepfake forensics, recent studies have ...
research
09/24/2019

Relational Learning for Joint Head and Human Detection

Head and human detection have been rapidly improved with the development...
research
01/19/2017

FusionSeg: Learning to combine motion and appearance for fully automatic segmention of generic objects in videos

We propose an end-to-end learning framework for segmenting generic objec...
research
01/21/2021

MPASNET: Motion Prior-Aware Siamese Network for Unsupervised Deep Crowd Segmentation in Video Scenes

Crowd segmentation is a fundamental task serving as the basis of crowded...

Please sign up or login with your details

Forgot password? Click here to reset