Open Images V5 Text Annotation and Yet Another Mask Text Spotter

06/23/2021
by   ilya-krylov, et al.
0

A large scale human-labeled dataset plays an important role in creating high quality deep learning models. In this paper we present text annotation for Open Images V5 dataset. To our knowledge it is the largest among publicly available manually created text annotations. Having this annotation we trained a simple Mask-RCNN-based network, referred as Yet Another Mask Text Spotter (YAMTS), which achieves competitive performance or even outperforms current state-of-the-art approaches in some cases on ICDAR2013, ICDAR2015 and Total-Text datasets. Code for text spotting model available online at: https://github.com/openvinotoolkit/training_extensions. The model can be exported to OpenVINO-format and run on Intel CPUs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/29/2021

Why You Should Try the Real Data for the Scene Text Recognition

Recent works in the text recognition area have pushed forward the recogn...
research
12/19/2022

Transferring General Multimodal Pretrained Models to Text Recognition

This paper proposes a new method, OFA-OCR, to transfer multimodal pretra...
research
06/22/2020

MaskIt: Masking for efficient utilization of incomplete public datasets for training deep learning models

A major challenge in training deep learning models is the lack of high q...
research
01/14/2023

: Structured Dataset Preprocessing Annotations for Frictionless Extreme Multi-Task Learning and Evaluation

The HuggingFace Datasets Hub hosts thousands of datasets. This provides ...
research
02/18/2019

FreeLabel: A Publicly Available Annotation Tool based on Freehand Traces

Large-scale annotation of image segmentation datasets is often prohibiti...
research
06/06/2018

Open Domain Suggestion Mining: Problem Definition and Datasets

We propose a formal definition for the task of suggestion mining in the ...
research
11/28/2019

KPTimes: A Large-Scale Dataset for Keyphrase Generation on News Documents

Keyphrase generation is the task of predicting a set of lexical units th...

Please sign up or login with your details

Forgot password? Click here to reset