Towards an efficient framework for Data Extraction from Chart Images

05/05/2021
by   Weihong Ma, et al.
0

In this paper, we fill the research gap by adopting state-of-the-art computer vision techniques for the data extraction stage in a data mining system. As shown in Fig.1, this stage contains two subtasks, namely, plot element detection and data conversion. For building a robust box detector, we comprehensively compare different deep learning-based methods and find a suitable method to detect box with high precision. For building a robust point detector, a fully convolutional network with feature fusion module is adopted, which can distinguish close points compared to traditional methods. The proposed system can effectively handle various chart data without making heuristic assumptions. For data conversion, we translate the detected element into data with semantic value. A network is proposed to measure feature similarities between legends and detected elements in the legend matching phase. Furthermore, we provide a baseline on the competition of Harvesting raw tables from Infographics. Some key factors have been found to improve the performance of each stage. Experimental results demonstrate the effectiveness of the proposed system.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/19/2021

Serial-parallel Multi-Scale Feature Fusion for Anatomy-Oriented Hand Joint Detection

Accurate hand joints detection from images is a fundamental topic which ...
research
09/15/2017

Feature-Fused SSD: Fast Detection for Small Objects

Small objects detection is a challenging task in computer vision due to ...
research
08/29/2021

MBDF-Net: Multi-Branch Deep Fusion Network for 3D Object Detection

Point clouds and images could provide complementary information when rep...
research
04/22/2021

Tablext: A Combined Neural Network And Heuristic Based Table Extractor

A significant portion of the data available today is found within tables...
research
09/10/2018

Geoseg: A Computer Vision Package for Automatic Building Segmentation and Outline Extraction

Recently, deep learning algorithms, especially fully convolutional netwo...
research
08/26/2020

Tabular Structure Detection from Document Images for Resource Constrained Devices Using A Row Based Similarity Measure

Tabular structures are used to present crucial information in a structur...
research
09/15/2023

A Real-time Faint Space Debris Detector With Learning-based LCM

With the development of aerospace technology, the increasing population ...

Please sign up or login with your details

Forgot password? Click here to reset