Towards Weakly-Supervised Text Spotting using a Multi-Task Transformer

02/11/2022
by   Yair Kittenplon, et al.
12

Text spotting end-to-end methods have recently gained attention in the literature due to the benefits of jointly optimizing the text detection and recognition components. Existing methods usually have a distinct separation between the detection and recognition branches, requiring exact annotations for the two tasks. We introduce TextTranSpotter (TTS), a transformer-based approach for text spotting and the first text spotting framework which may be trained with both fully- and weakly-supervised settings. By learning a single latent representation per word detection, and using a novel loss function based on the Hungarian loss, our method alleviates the need for expensive localization annotations. Trained with only text transcription annotations on real data, our weakly-supervised method achieves competitive performance with previous state-of-the-art fully-supervised methods. When trained in a fully-supervised manner, TextTranSpotter shows state-of-the-art results on multiple benchmarks.

READ FULL TEXT

page 1

page 3

page 4

page 7

research
12/17/2022

Fully and Weakly Supervised Referring Expression Segmentation with End-to-End Learning

Referring Expression Segmentation (RES), which is aimed at localizing an...
research
06/05/2020

WOAD: Weakly Supervised Online Action Detection in Untrimmed Videos

Online action detection in untrimmed videos aims to identify an action a...
research
05/26/2022

Semantically Supervised Appearance Decomposition for Virtual Staging from a Single Panorama

We describe a novel approach to decompose a single panorama of an empty ...
research
11/27/2019

Towards Precise End-to-end Weakly Supervised Object Detection Network

It is challenging for weakly supervised object detection network to prec...
research
06/10/2021

DUET: Detection Utilizing Enhancement for Text in Scanned or Captured Documents

We present a novel deep neural model for text detection in document imag...
research
07/17/2020

Weakly-supervised Learning of Human Dynamics

This paper proposes a weakly-supervised learning framework for dynamics ...
research
03/24/2022

Weakly-Supervised End-to-End CAD Retrieval to Scan Objects

CAD model retrieval to real-world scene observations has shown strong pr...

Please sign up or login with your details

Forgot password? Click here to reset