Text Spotting Transformers

04/05/2022
by   Xiang Zhang, et al.
6

In this paper, we present TExt Spotting TRansformers (TESTR), a generic end-to-end text spotting framework using Transformers for text detection and recognition in the wild. TESTR builds upon a single encoder and dual decoders for the joint text-box control point regression and character recognition. Other than most existing literature, our method is free from Region-of-Interest operations and heuristics-driven post-processing procedures; TESTR is particularly effective when dealing with curved text-boxes where special cares are needed for the adaptation of the traditional bounding-box representations. We show our canonical representation of control points suitable for text instances in both Bezier curve and polygon annotations. In addition, we design a bounding-box guided polygon detection (box-to-polygon) process. Experiments on curved and arbitrarily shaped datasets demonstrate state-of-the-art performances of the proposed TESTR algorithm.

READ FULL TEXT

page 2

page 3

page 5

page 7

page 8

research
01/06/2021

Line Segment Detection Using Transformers without Edges

In this paper, we present a holistically end-to-end algorithm for line s...
research
02/22/2022

Arbitrary Shape Text Detection using Transformers

Recent text detection frameworks require several handcrafted components ...
research
01/04/2023

SPTS v2: Single-Point Scene Text Spotting

End-to-end scene text spotting has made significant progress due to its ...
research
10/24/2018

Resolving Referring Expressions in Images With Labeled Elements

Images may have elements containing text and a bounding box associated w...
research
12/15/2021

SPTS: Single-Point Text Spotting

Almost all scene text spotting (detection and recognition) methods rely ...
research
06/06/2023

TextFormer: A Query-based End-to-End Text Spotter with Mixed Supervision

End-to-end text spotting is a vital computer vision task that aims to in...
research
05/24/2016

DeepText: A Unified Framework for Text Proposal Generation and Text Detection in Natural Images

In this paper, we develop a novel unified framework called DeepText for ...

Please sign up or login with your details

Forgot password? Click here to reset