Self-Training for Domain Adaptive Scene Text Detection

05/23/2020
by   Yudi Chen, et al.
30

Though deep learning based scene text detection has achieved great progress, well-trained detectors suffer from severe performance degradation for different domains. In general, a tremendous amount of data is indispensable to train the detector in the target domain. However, data collection and annotation are expensive and time-consuming. To address this problem, we propose a self-training framework to automatically mine hard examples with pseudo-labels from unannotated videos or images. To reduce the noise of hard examples, a novel text mining module is implemented based on the fusion of detection and tracking results. Then, an image-to-video generation method is designed for the tasks that videos are unavailable and only images can be used. Experimental results on standard benchmarks, including ICDAR2015, MSRA-TD500, ICDAR2017 MLT, demonstrate the effectiveness of our self-training method. The simple Mask R-CNN adapted with self-training and fine-tuned on real data can achieve comparable or even superior results with the state-of-the-art methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 7

research
09/03/2020

Synthetic-to-Real Unsupervised Domain Adaptation for Scene Text Detection in the Wild

Deep learning-based scene text detection can achieve preferable performa...
research
12/01/2022

Domain Adaptive Scene Text Detection via Subcategorization

Most existing scene text detectors require large-scale training data whi...
research
04/15/2019

Automatic adaptation of object detectors to new domains using self-training

This work addresses the unsupervised adaptation of an existing object de...
research
09/09/2023

Self-Supervised Transformer with Domain Adaptive Reconstruction for General Face Forgery Video Detection

Face forgery videos have caused severe social public concern, and variou...
research
07/30/2020

Deep Traffic Sign Detection and Recognition Without Target Domain Real Images

Deep learning has been successfully applied to several problems related ...
research
03/29/2021

Tracking Based Semi-Automatic Annotation for Scene Text Videos

Recently, video scene text detection has received increasing attention d...
research
01/11/2021

ArrowGAN : Learning to Generate Videos by Learning Arrow of Time

Training GANs on videos is even more sophisticated than on images becaus...

Please sign up or login with your details

Forgot password? Click here to reset