An Evaluation of OCR on Egocentric Data

06/11/2022
by   Valentin Popescu, et al.
0

In this paper, we evaluate state-of-the-art OCR methods on Egocentric data. We annotate text in EPIC-KITCHENS images, and demonstrate that existing OCR methods struggle with rotated text, which is frequently observed on objects being handled. We introduce a simple rotate-and-merge procedure which can be applied to pre-trained OCR models that halves the normalized edit distance error. This suggests that future OCR attempts should incorporate rotation into model design and training procedures.

READ FULL TEXT

page 1

page 2

page 3

research
10/22/2020

mT5: A massively multilingual pre-trained text-to-text transformer

The recent "Text-to-Text Transfer Transformer" (T5) leveraged a unified ...
research
03/08/2021

Text Simplification by Tagging

Edit-based approaches have recently shown promising results on multiple ...
research
06/27/2012

A Split-Merge Framework for Comparing Clusterings

Clustering evaluation measures are frequently used to evaluate the perfo...
research
04/05/2022

Text2LIVE: Text-Driven Layered Image and Video Editing

We present a method for zero-shot, text-driven appearance manipulation i...
research
08/11/2022

A Deformation-based Edit Distance for Merge Trees

In scientific visualization, scalar fields are often compared through ed...
research
09/08/2023

GLS-CSC: A Simple but Effective Strategy to Mitigate Chinese STM Models' Over-Reliance on Superficial Clue

Pre-trained models have achieved success in Chinese Short Text Matching ...
research
11/27/2019

LucidDream: Controlled Temporally-Consistent DeepDream on Videos

In this work, we aim to propose a set of techniques to improve the contr...

Please sign up or login with your details

Forgot password? Click here to reset