MTRNet: A Generic Scene Text Eraser

03/11/2019
by   Osman Tursun, et al.
0

Text removal algorithms have been proposed for uni-lingual scripts with regular shapes and layouts. However, to the best of our knowledge, a generic text removal method which is able to remove all or user-specified text regions regardless of font, script, language or shape is not available. Developing such a generic text eraser for real scenes is a challenging task, since it inherits all the challenges of multi-lingual and curved text detection and inpainting. To fill this gap, we propose a mask-based text removal network (MTRNet). MTRNet is a conditional adversarial generative network (cGAN) with an auxiliary mask. The introduced auxiliary mask not only makes the cGAN a generic text eraser, but also enables stable training and early convergence on a challenging large-scale synthetic dataset, initially proposed for text detection in real scenes. What's more, MTRNet achieves state-of-the-art results on several real-world datasets including ICDAR 2013, ICDAR 2017 MLT, and CTW1500, without being explicitly trained on this data, outperforming previous state-of-the-art methods trained directly on these datasets.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 6

research
12/16/2019

MTRNet++: One-stage Mask-based Scene Text Eraser

A precise, controllable, interpretable and easily trainable text removal...
research
06/13/2023

PSSTRNet: Progressive Segmentation-guided Scene Text Removal Network

Scene text removal (STR) is a challenging task due to the complex text f...
research
09/01/2023

Selective Scene Text Removal

Scene text removal (STR) is the image transformation task to remove text...
research
04/23/2021

Stroke-Based Scene Text Erasing Using Synthetic Data

Scene text erasing, which replaces text regions with reasonable content ...
research
11/19/2020

Scene text removal via cascaded text stroke detection and erasing

Recent learning-based approaches show promising performance improvement ...
research
11/12/2022

MSLKANet: A Multi-Scale Large Kernel Attention Network for Scene Text Removal

Scene text removal aims to remove the text and fill the regions with per...
research
12/07/2017

Stacked Conditional Generative Adversarial Networks for Jointly Learning Shadow Detection and Shadow Removal

Understanding shadows from a single image spontaneously derives into two...

Please sign up or login with your details

Forgot password? Click here to reset