SlideImages: A Dataset for Educational Image Classification

01/19/2020
by   David Morris, et al.
0

In the past few years, convolutional neural networks (CNNs) have achieved impressive results in computer vision tasks, which however mainly focus on photos with natural scene content. Besides, non-sensor derived images such as illustrations, data visualizations, figures, etc. are typically used to convey complex information or to explore large datasets. However, this kind of images has received little attention in computer vision. CNNs and similar techniques use large volumes of training data. Currently, many document analysis systems are trained in part on scene images due to the lack of large datasets of educational image data. In this paper, we address this issue and present SlideImages, a dataset for the task of classifying educational illustrations. SlideImages contains training data collected from various sources, e.g., Wikimedia Commons and the AI2D dataset, and test data collected from educational slides. We have reserved all the actual educational images as a test dataset in order to ensure that the approaches using this dataset generalize well to new educational images, and potentially other domains. Furthermore, we present a baseline system using a standard deep neural architecture and discuss dealing with the challenge of limited training data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/10/2017

Analysis of Convolutional Neural Networks for Document Image Classification

Convolutional Neural Networks (CNNs) are state-of-the-art models for doc...
research
07/15/2019

The iWildCam 2019 Challenge Dataset

Camera Traps (or Wild Cams) enable the automatic collection of large qua...
research
07/14/2023

Defect Classification in Additive Manufacturing Using CNN-Based Vision Processing

The development of computer vision and in-situ monitoring using visual s...
research
09/25/2020

Training CNNs in Presence of JPEG Compression: Multimedia Forensics vs Computer Vision

Convolutional Neural Networks (CNNs) have proved very accurate in multip...
research
01/04/2017

Transforming Sensor Data to the Image Domain for Deep Learning - an Application to Footstep Detection

Convolutional Neural Networks (CNNs) have become the state-of-the-art in...
research
07/28/2016

25 years of CNNs: Can we compare to human abstraction capabilities?

We try to determine the progress made by convolutional neural networks o...
research
11/30/2018

Inferring Concept Prerequisite Relations from Online Educational Resources

The Internet has rich and rapidly increasing sources of high quality edu...

Please sign up or login with your details

Forgot password? Click here to reset