DiffusionSTR: Diffusion Model for Scene Text Recognition

06/29/2023
by   Masato Fujitake, et al.
0

This paper presents Diffusion Model for Scene Text Recognition (DiffusionSTR), an end-to-end text recognition framework using diffusion models for recognizing text in the wild. While existing studies have viewed the scene text recognition task as an image-to-text transformation, we rethought it as a text-text one under images in a diffusion model. We show for the first time that the diffusion model can be applied to text recognition. Furthermore, experimental results on publicly available datasets show that the proposed method achieves competitive accuracy compared to state-of-the-art methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/13/2018

Advances of Scene Text Datasets

This article introduces publicly available datasets in scene text detect...
research
02/21/2023

A3S: Adversarial learning of semantic representations for Scene-Text Spotting

Scene-text spotting is a task that predicts a text area on natural scene...
research
05/12/2021

TextOCR: Towards large-scale end-to-end reasoning for arbitrary-shaped scene text

A crucial component for the scene text based reasoning required for Text...
research
11/24/2021

Utilizing Resource-Rich Language Datasets for End-to-End Scene Text Recognition in Resource-Poor Languages

This paper presents a novel training method for end-to-end scene text re...
research
10/10/2017

AdaDNNs: Adaptive Ensemble of Deep Neural Networks for Scene Text Recognition

Recognizing text in the wild is a really challenging task because of com...
research
04/26/2013

Reading Ancient Coin Legends: Object Recognition vs. OCR

Standard OCR is a well-researched topic of computer vision and can be co...
research
03/28/2023

Instruct 3D-to-3D: Text Instruction Guided 3D-to-3D conversion

We propose a high-quality 3D-to-3D conversion method, Instruct 3D-to-3D....

Please sign up or login with your details

Forgot password? Click here to reset