SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition

03/19/2022
by   Mingxin Huang, et al.
0

End-to-end scene text spotting has attracted great attention in recent years due to the success of excavating the intrinsic synergy of the scene text detection and recognition. However, recent state-of-the-art methods usually incorporate detection and recognition simply by sharing the backbone, which does not directly take advantage of the feature interaction between the two tasks. In this paper, we propose a new end-to-end scene text spotting framework termed SwinTextSpotter. Using a transformer encoder with dynamic head as the detector, we unify the two tasks with a novel Recognition Conversion mechanism to explicitly guide text localization through recognition loss. The straightforward design results in a concise framework that requires neither additional rectification module nor character-level annotation for the arbitrarily-shaped text. Qualitative and quantitative experiments on multi-oriented datasets RoIC13 and ICDAR 2015, arbitrarily-shaped datasets Total-Text and CTW1500, and multi-lingual datasets ReCTS (Chinese) and VinText (Vietnamese) demonstrate SwinTextSpotter significantly outperforms existing methods. Code is available at https://github.com/mxin262/SwinTextSpotter.

READ FULL TEXT

page 1

page 3

page 8

page 9

research
12/10/2019

A Feasible Framework for Arbitrary-Shaped Scene Text Recognition

Deep learning based methods have achieved surprising progress in Scene T...
research
08/20/2023

ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer

In recent years, end-to-end scene text spotting approaches are evolving ...
research
10/20/2021

ARTS: Eliminating Inconsistency between Text Detection and Recognition with Auto-Rectification Text Spotter

Recent approaches for end-to-end text spotting have achieved promising r...
research
12/15/2021

SPTS: Single-Point Text Spotting

Almost all scene text spotting (detection and recognition) methods rely ...
research
07/21/2022

Don't Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Context

Text removal has attracted increasingly attention due to its various app...
research
05/13/2021

Reciprocal Feature Learning via Explicit and Implicit Tasks in Scene Text Recognition

Text recognition is a popular topic for its broad applications. In this ...
research
10/28/2017

Total-Text: A Comprehensive Dataset for Scene Text Detection and Recognition

Text in curve orientation, despite being one of the common text orientat...

Please sign up or login with your details

Forgot password? Click here to reset