X-Paste: Revisit Copy-Paste at Scale with CLIP and StableDiffusion

12/07/2022
by   Hanqing Zhao, et al.
0

Copy-Paste is a simple and effective data augmentation strategy for instance segmentation. By randomly pasting object instances onto new background images, it creates new training data for free and significantly boosts the segmentation performance, especially for rare object categories. Although diverse, high-quality object instances used in Copy-Paste result in more performance gain, previous works utilize object instances either from human-annotated instance segmentation datasets or rendered from 3D object models, and both approaches are too expensive to scale up to obtain good diversity. In this paper, we revisit Copy-Paste at scale with the power of newly emerged zero-shot recognition models (e.g., CLIP) and text2image models (e.g., StableDiffusion). We demonstrate for the first time that using a text2image model to generate images or zero-shot recognition model to filter noisily crawled images for different object categories is a feasible way to make Copy-Paste truly scalable. To make such success happen, we design a data acquisition and processing framework, dubbed "X-Paste", upon which a systematic study is conducted. On the LVIS dataset, X-Paste provides impressive improvements over the strong baseline CenterNet2 with Swin-L as the backbone. Specifically, it archives +2.6 box AP and +2.1 mask AP gains on all classes and even more significant gains with +6.8 box AP +6.5 mask AP on long-tail classes.

READ FULL TEXT

page 4

page 6

research
12/13/2020

Simple Copy-Paste is a Strong Data Augmentation Method for Instance Segmentation

Building instance segmentation models that are data-efficient and can ha...
research
11/12/2019

Equalization Loss for Large Vocabulary Instance Segmentation

Recent object detection and instance segmentation tasks mainly focus on ...
research
04/14/2021

Zero-Shot Instance Segmentation

Deep learning has significantly improved the precision of instance segme...
research
02/14/2023

Frustratingly Simple but Effective Zero-shot Detection and Segmentation: Analysis and a Strong Baseline

Methods for object detection and segmentation often require abundant ins...
research
04/27/2023

Zero-shot Unsupervised Transfer Instance Segmentation

Segmentation is a core computer vision competency, with applications spa...
research
10/18/2022

Scrape, Cut, Paste and Learn: Automated Dataset Generation Applied to Parcel Logistics

State-of-the-art approaches in computer vision heavily rely on sufficien...

Please sign up or login with your details

Forgot password? Click here to reset