VLPD: Context-Aware Pedestrian Detection via Vision-Language Semantic Self-Supervision

04/06/2023
by   Mengyin Liu, et al.
0

Detecting pedestrians accurately in urban scenes is significant for realistic applications like autonomous driving or video surveillance. However, confusing human-like objects often lead to wrong detections, and small scale or heavily occluded pedestrians are easily missed due to their unusual appearances. To address these challenges, only object regions are inadequate, thus how to fully utilize more explicit and semantic contexts becomes a key problem. Meanwhile, previous context-aware pedestrian detectors either only learn latent contexts with visual clues, or need laborious annotations to obtain explicit and semantic contexts. Therefore, we propose in this paper a novel approach via Vision-Language semantic self-supervision for context-aware Pedestrian Detection (VLPD) to model explicitly semantic contexts without any extra annotations. Firstly, we propose a self-supervised Vision-Language Semantic (VLS) segmentation method, which learns both fully-supervised pedestrian detection and contextual segmentation via self-generated explicit labels of semantic classes by vision-language models. Furthermore, a self-supervised Prototypical Semantic Contrastive (PSC) learning method is proposed to better discriminate pedestrians and other classes, based on more explicit and semantic contexts obtained from VLS. Extensive experiments on popular benchmarks show that our proposed VLPD achieves superior performances over the previous state-of-the-arts, particularly under challenging circumstances like small scale and heavy occlusion. Code is available at https://github.com/lmy98129/VLPD.

READ FULL TEXT

page 1

page 2

page 7

research
05/21/2023

Unsupervised Multi-view Pedestrian Detection

With the prosperity of the video surveillance, multiple visual sensors h...
research
10/21/2020

Mutual-Supervised Feature Modulation Network for Occluded Pedestrian Detection

State-of-the-art pedestrian detectors have achieved significant progress...
research
03/21/2023

Learning Context-aware Classifier for Semantic Segmentation

Semantic segmentation is still a challenging task for parsing diverse co...
research
11/23/2022

Reason from Context with Self-supervised Learning

A tiny object in the sky cannot be an elephant. Context reasoning is cri...
research
03/19/2020

Pedestrian Detection: The Elephant In The Room

Pedestrian detection is used in many vision based applications ranging f...
research
06/26/2017

Illuminating Pedestrians via Simultaneous Detection & Segmentation

Pedestrian detection is a critical problem in computer vision with signi...
research
10/14/2019

Mask-Guided Attention Network for Occluded Pedestrian Detection

Pedestrian detection relying on deep convolution neural networks has mad...

Please sign up or login with your details

Forgot password? Click here to reset