Improving Text Relationship Modeling with Artificial Data

10/27/2020
by   Peter Organisciak, et al.
0

Data augmentation uses artificially-created examples to support supervised machine learning, adding robustness to the resulting models and helping to account for limited availability of labelled data. We apply and evaluate a synthetic data approach to relationship classification in digital libraries, generating artificial books with relationships that are common in digital libraries but not easier inferred from existing metadata. We find that for classification on whole-part relationships between books, synthetic data improves a deep neural network classifier by 91 ability of synthetic data to learn a useful new text relationship class from fully artificial training data.

READ FULL TEXT

page 1

page 7

research
04/07/2023

Beyond Privacy: Navigating the Opportunities and Challenges of Synthetic Data

Generating synthetic data through generative models is gaining interest ...
research
08/15/2016

Generating Synthetic Data for Text Recognition

Generating synthetic images is an art which emulates the natural process...
research
01/29/2021

Synthetic Data and Hierarchical Object Detection in Overhead Imagery

The performance of neural network models is often limited by the availab...
research
12/09/2022

Synthetic Data for Object Classification in Industrial Applications

One of the biggest challenges in machine learning is data collection. Tr...
research
06/09/2014

Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition

In this work we present a framework for the recognition of natural scene...
research
03/03/2023

Revisiting Wright: Improving supervised classification of rat ultrasonic vocalisations using synthetic training data

Rodents communicate through ultrasonic vocalizations (USVs). These calls...

Please sign up or login with your details

Forgot password? Click here to reset