Chart-RCNN: Efficient Line Chart Data Extraction from Camera Images

11/25/2022
by   Shufan Li, et al.
0

Line Chart Data Extraction is a natural extension of Optical Character Recognition where the objective is to recover the underlying numerical information a chart image represents. Some recent works such as ChartOCR approach this problem using multi-stage networks combining OCR models with object detection frameworks. However, most of the existing datasets and models are based on "clean" images such as screenshots that drastically differ from camera photos. In addition, creating domain-specific new datasets requires extensive labeling which can be time-consuming. Our main contributions are as follows: we propose a synthetic data generation framework and a one-stage model that outputs text labels, mark coordinates, and perspective estimation simultaneously. We collected two datasets consisting of real camera photos for evaluation. Results show that our model trained only on synthetic data can be applied to real photos without any fine-tuning and is feasible for real-world application.

READ FULL TEXT

page 1

page 3

page 4

page 7

page 11

page 14

research
06/16/2023

The Big Data Myth: Using Diffusion Models for Dataset Generation to Train Deep Detection Models

Despite the notable accomplishments of deep object detection models, a m...
research
06/20/2023

Exploring the Effectiveness of Dataset Synthesis: An application of Apple Detection in Orchards

Deep object detection models have achieved notable successes in recent y...
research
02/26/2019

An Annotation Saved is an Annotation Earned: Using Fully Synthetic Training for Object Instance Detection

Deep learning methods typically require vast amounts of training data to...
research
05/10/2022

UNITS: Unsupervised Intermediate Training Stage for Scene Text Detection

Recent scene text detection methods are almost based on deep learning an...
research
11/26/2021

Traditional Chinese Synthetic Datasets Verified with Labeled Data for Scene Text Recognition

Scene text recognition (STR) has been widely studied in academia and ind...
research
12/17/2021

Interpreting Audiograms with Multi-stage Neural Networks

Audiograms are a particular type of line charts representing individuals...
research
05/10/2021

BIM Hyperreality: Data Synthesis Using BIM and Hyperrealistic Rendering for Deep Learning

Deep learning is expected to offer new opportunities and a new paradigm ...

Please sign up or login with your details

Forgot password? Click here to reset