Towards Automatic Parsing of Structured Visual Content through the Use of Synthetic Data

04/29/2022
by   Lukas Scholch, et al.
18

Structured Visual Content (SVC) such as graphs, flow charts, or the like are used by authors to illustrate various concepts. While such depictions allow the average reader to better understand the contents, images containing SVCs are typically not machine-readable. This, in turn, not only hinders automated knowledge aggregation, but also the perception of displayed in-formation for visually impaired people. In this work, we propose a synthetic dataset, containing SVCs in the form of images as well as ground truths. We show the usage of this dataset by an application that automatically extracts a graph representation from an SVC image. This is done by training a model via common supervised learning methods. As there currently exist no large-scale public datasets for the detailed analysis of SVC, we propose the Synthetic SVC (SSVC) dataset comprising 12,000 images with respective bounding box annotations and detailed graph representations. Our dataset enables the development of strong models for the interpretation of SVCs while skipping the time-consuming dense data annotation. We evaluate our model on both synthetic and manually annotated data and show the transferability of synthetic to real via various metrics, given the presented application. Here, we evaluate that this proof of concept is possible to some extend and lay down a solid baseline for this task. We discuss the limitations of our approach for further improvements. Our utilized metrics can be used as a tool for future comparisons in this domain. To enable further research on this task, the dataset is publicly available at https://bit.ly/3jN1pJJ

READ FULL TEXT
research
03/20/2022

VinDr-PCXR: An open, large-scale chest radiograph dataset for interpretation of common thoracic diseases in children

Computer-aided diagnosis systems in adult chest radiography (CXR) have r...
research
11/02/2020

Multi-Task Learning for Calorie Prediction on a Novel Large-Scale Recipe Dataset Enriched with Nutritional Information

A rapidly growing amount of content posted online, such as food recipes,...
research
06/29/2021

SinGAN-Seg: Synthetic Training Data Generation for Medical Image Segmentation

Processing medical data to find abnormalities is a time-consuming and co...
research
03/17/2022

deepNIR: Datasets for generating synthetic NIR images and improved fruit detection system using deep learning techniques

This paper presents datasets utilised for synthetic near-infrared (NIR) ...
research
03/16/2023

A New Benchmark: On the Utility of Synthetic Data with Blender for Bare Supervised Learning and Downstream Domain Adaptation

Deep learning in computer vision has achieved great success with the pri...
research
01/28/2019

PuppetGAN: Transferring Disentangled Properties from Synthetic to Real Images

In this work we propose a model that enables controlled manipulation of ...
research
05/26/2023

Conjunct Resolution in the Face of Verbal Omissions

Verbal omissions are complex syntactic phenomena in VP coordination stru...

Please sign up or login with your details

Forgot password? Click here to reset