DISGO: Automatic End-to-End Evaluation for Scene Text OCR

08/25/2023
by   Mei-Yuh Hwang, et al.
0

This paper discusses the challenges of optical character recognition (OCR) on natural scenes, which is harder than OCR on documents due to the wild content and various image backgrounds. We propose to uniformly use word error rates (WER) as a new measurement for evaluating scene-text OCR, both end-to-end (e2e) performance and individual system component performances. Particularly for the e2e metric, we name it DISGO WER as it considers Deletion, Insertion, Substitution, and Grouping/Ordering errors. Finally we propose to utilize the concept of super blocks to automatically compute BLEU scores for e2e OCR machine translation. The small SCUT public test set is used to demonstrate WER performance by a modularized OCR system.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/14/2019

Towards End-to-End Text Spotting in Natural Scenes

Text spotting in natural scene images is of great importance for many im...
research
05/12/2021

TextOCR: Towards large-scale end-to-end reasoning for arbitrary-shaped scene text

A crucial component for the scene text based reasoning required for Text...
research
08/29/2019

PopEval: A Character-Level Approach to End-To-End Evaluation Compatible with Word-Level Benchmark Dataset

The most prevalent scope of interest for OCR applications used to be sca...
research
10/07/2013

End-to-End Text Recognition with Hybrid HMM Maxout Models

The problem of detecting and recognizing text in natural scenes has prov...
research
10/20/2020

Towards End-to-End In-Image Neural Machine Translation

In this paper, we offer a preliminary investigation into the task of in-...
research
11/25/2018

A pooling based scene text proposal technique for scene text reading in the wild

Automatic reading texts in scenes has attracted increasing interest in r...

Please sign up or login with your details

Forgot password? Click here to reset