Real-time Human-Centric Segmentation for Complex Video Scenes

08/16/2021
by   Ran Yu, et al.
8

Most existing video tasks related to "human" focus on the segmentation of salient humans, ignoring the unspecified others in the video. Few studies have focused on segmenting and tracking all humans in a complex video, including pedestrians and humans of other states (e.g., seated, riding, or occluded). In this paper, we propose a novel framework, abbreviated as HVISNet, that segments and tracks all presented people in given videos based on a one-stage detector. To better evaluate complex scenes, we offer a new benchmark called HVIS (Human Video Instance Segmentation), which comprises 1447 human instance masks in 805 high-resolution videos in diverse scenes. Extensive experiments show that our proposed HVISNet outperforms the state-of-the-art methods in terms of accuracy at a real-time inference speed (30 FPS), especially on complex video scenes. We also notice that using the center of the bounding box to distinguish different individuals severely deteriorates the segmentation accuracy, especially in heavily occluded conditions. This common phenomenon is referred to as the ambiguous positive samples problem. To alleviate this problem, we propose a mechanism named Inner Center Sampling to improve the accuracy of instance segmentation. Such a plug-and-play inner center sampling mechanism can be incorporated in any instance segmentation models based on a one-stage detector to improve the performance. In particular, it gains 4.1 mAP improvement on the state-of-the-art method in the case of occluded humans. Code and data are available at https://github.com/IIGROUP/HVISNet.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 7

page 8

research
10/07/2022

Humans need not label more humans: Occlusion Copy Paste for Occluded Human Instance Segmentation

Modern object detection and instance segmentation networks stumble when ...
research
07/29/2020

SipMask: Spatial Information Preservation for Fast Image and Video Instance Segmentation

Single-stage instance segmentation approaches have recently gained popul...
research
11/15/2021

Occluded Video Instance Segmentation: Dataset and ICCV 2021 Challenge

Although deep learning methods have achieved advanced video object recog...
research
02/02/2021

Occluded Video Instance Segmentation

Can our video understanding systems perceive objects when a heavy occlus...
research
12/22/2020

YolactEdge: Real-time Instance Segmentation on the Edge (Jetson AGX Xavier: 30 FPS, RTX 2080 Ti: 170 FPS)

We propose YolactEdge, the first competitive instance segmentation appro...
research
05/09/2023

Real-time instance segmentation with polygons using an Intersection-over-Union loss

Predicting a binary mask for an object is more accurate but also more co...
research
11/17/2020

SeekNet: Improved Human Instance Segmentation via Reinforcement Learning Based Optimized Robot Relocation

Amodal recognition is the ability of the system to detect occluded objec...

Please sign up or login with your details

Forgot password? Click here to reset