Line Graphics Digitization: A Step Towards Full Automation

07/05/2023
by   Omar Moured, et al.
0

The digitization of documents allows for wider accessibility and reproducibility. While automatic digitization of document layout and text content has been a long-standing focus of research, this problem in regard to graphical elements, such as statistical plots, has been under-explored. In this paper, we introduce the task of fine-grained visual understanding of mathematical graphics and present the Line Graphics (LG) dataset, which includes pixel-wise annotations of 5 coarse and 10 fine-grained categories. Our dataset covers 520 images of mathematical graphics collected from 450 documents from different disciplines. Our proposed dataset can support two different computer vision tasks, i.e., semantic segmentation and object detection. To benchmark our LG dataset, we explore 7 state-of-the-art models. To foster further research on the digitization of statistical graphs, we will make the dataset, code, and models publicly available to the community.

READ FULL TEXT

page 9

page 13

research
06/28/2022

MACSA: A Multimodal Aspect-Category Sentiment Analysis Dataset with Multimodal Fine-grained Aligned Annotations

Multimodal fine-grained sentiment analysis has recently attracted increa...
research
06/01/2020

DocBank: A Benchmark Dataset for Document Layout Analysis

Document layout analysis usually relies on computer vision models to und...
research
06/03/2019

The iMet Collection 2019 Challenge Dataset

Existing computer vision technologies in artwork recognition focus mainl...
research
10/15/2021

Accurate Fine-grained Layout Analysis for the Historical Tibetan Document Based on the Instance Segmentation

Accurate layout analysis without subsequent text-line segmentation remai...
research
09/15/2023

EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding

Object understanding in egocentric visual data is arguably a fundamental...
research
08/20/2020

ImagiFilter: A resource to enable the semi-automatic mining of images at scale

Datasets (semi-)automatically collected from the web can easily scale to...
research
04/26/2021

InfographicVQA

Infographics are documents designed to effectively communicate informati...

Please sign up or login with your details

Forgot password? Click here to reset