Weak Supervision for Generating Pixel-Level Annotations in Scene Text Segmentation

11/19/2019
by   Simone Bonechi, et al.
12

Providing pixel-level supervisions for scene text segmentation is inherently difficult and costly, so that only few small datasets are available for this task. To face the scarcity of training data, previous approaches based on Convolutional Neural Networks (CNNs) rely on the use of a synthetic dataset for pre-training. However, synthetic data cannot reproduce the complexity and variability of natural images. In this work, we propose to use a weakly supervised learning approach to reduce the domain-shift between synthetic and real data. Leveraging the bounding-box supervision of the COCO-Text and the MLT datasets, we generate weak pixel-level supervisions of real images. In particular, the COCO-Text-Segmentation (COCO_TS) and the MLT-Segmentation (MLT_S) datasets are created and released. These two datasets are used to train a CNN, the Segmentation Multiscale Attention Network (SMANet), which is specifically designed to face some peculiarities of the scene text segmentation task. The SMANet is trained end-to-end on the proposed datasets, and the experiments show that COCO_TS and MLT_S are a valid alternative to synthetic images, allowing to use only a fraction of the training samples and improving significantly the performances.

READ FULL TEXT

page 1

page 2

page 4

page 5

page 6

page 8

research
04/01/2019

COCO_TS Dataset: Pixel-level Annotations Based on Weak Supervision for Scene Text Segmentation

The absence of large scale datasets with pixel-level supervisions is a s...
research
08/27/2019

Curved Text Detection in Natural Scene Images with Semi- and Weakly-Supervised Learning

Detecting curved text in the wild is very challenging. Recently, most st...
research
04/19/2019

Weakly Supervised Adversarial Domain Adaptation for Semantic Segmentation in Urban Scenes

Semantic segmentation, a pixel-level vision task, is developed rapidly b...
research
08/25/2023

Self-supervised Scene Text Segmentation with Object-centric Layered Representations Augmented by Text Regions

Text segmentation tasks have a very wide range of application values, su...
research
01/16/2022

Towards holistic scene understanding: Semantic segmentation and beyond

This dissertation addresses visual scene understanding and enhances segm...
research
03/23/2019

What Synthesis is Missing: Depth Adaptation Integrated with Weak Supervision for Indoor Scene Parsing

Scene Parsing is a crucial step to enable autonomous systems to understa...
research
06/12/2019

Handwritten Text Segmentation via End-to-End Learning of Convolutional Neural Network

We present a new handwritten text segmentation method by training a conv...

Please sign up or login with your details

Forgot password? Click here to reset