PP-HumanSeg: Connectivity-Aware Portrait Segmentation with a Large-Scale Teleconferencing Video Dataset

12/14/2021
by   Lutao Chu, et al.
28

As the COVID-19 pandemic rampages across the world, the demands of video conferencing surge. To this end, real-time portrait segmentation becomes a popular feature to replace backgrounds of conferencing participants. While feature-rich datasets, models and algorithms have been offered for segmentation that extract body postures from life scenes, portrait segmentation has yet not been well covered in a video conferencing context. To facilitate the progress in this field, we introduce an open-source solution named PP-HumanSeg. This work is the first to construct a large-scale video portrait dataset that contains 291 videos from 23 conference scenes with 14K fine-labeled frames and extensions to multi-camera teleconferencing. Furthermore, we propose a novel Semantic Connectivity-aware Learning (SCL) for semantic segmentation, which introduces a semantic connectivity-aware loss to improve the quality of segmentation results from the perspective of connectivity. And we propose an ultra-lightweight model with SCL for practical portrait segmentation, which achieves the best trade-off between IoU and the speed of inference. Extensive evaluations on our dataset demonstrate the superiority of SCL and our model. The source code is available at https://github.com/PaddlePaddle/PaddleSeg.

READ FULL TEXT

page 1

page 2

page 4

page 7

research
09/04/2015

Semantic Video Segmentation : Exploring Inference Efficiency

We explore the efficiency of the CRF inference beyond image level semant...
research
06/15/2022

S^2-FPN: Scale-ware Strip Attention Guided Feature Pyramid Network for Real-time Semantic Segmentation

Modern high-performance semantic segmentation methods employ a heavy bac...
research
01/26/2023

Boundary Aware U-Net for Glacier Segmentation

Large-scale study of glaciers improves our understanding of global glaci...
research
03/27/2022

Video Polyp Segmentation: A Deep Learning Perspective

In the deep learning era, we present the first comprehensive video polyp...
research
08/09/2022

Sports Video Analysis on Large-Scale Data

This paper investigates the modeling of automated machine description on...
research
07/06/2021

Depth-Aware Multi-Grid Deep Homography Estimation with Contextual Correlation

Homography estimation is an important task in computer vision, such as i...
research
10/14/2019

FireNet: Real-time Segmentation of Fire Perimeter from Aerial Video

In this paper, we share our approach to real-time segmentation of fire p...

Please sign up or login with your details

Forgot password? Click here to reset