Synthesis in Style: Semantic Segmentation of Historical Documents using Synthetic Data

07/14/2021
by   Christian Bartz, et al.
0

One of the most pressing problems in the automated analysis of historical documents is the availability of annotated training data. In this paper, we propose a novel method for the synthesis of training data for semantic segmentation of document images. We utilize clusters found in intermediate features of a StyleGAN generator for the synthesis of RGB and label images at the same time. Our model can be applied to any dataset of scanned documents without the need for manual annotation of individual images, as each model is custom-fit to the dataset. In our experiments, we show that models trained on our synthetic data can reach competitive performance on open benchmark datasets for line segmentation.

READ FULL TEXT

page 5

page 7

page 8

research
09/18/2020

Synthetic Convolutional Features for Improved Semantic Segmentation

Recently, learning-based image synthesis has enabled to generate high-re...
research
09/02/2019

Semantic Segmentation of Panoramic Images Using a Synthetic Dataset

Panoramic images have advantages in information capacity and scene stabi...
research
09/04/2017

Dataset Augmentation with Synthetic Images Improves Semantic Segmentation

Although Deep Convolutional Neural Networks trained with strong pixel-le...
research
03/15/2021

Generating Synthetic Handwritten Historical Documents With OCR Constrained GANs

We present a framework to generate synthetic historical documents with p...
research
05/22/2019

A Comprehensive Study of ImageNet Pre-Training for Historical Document Image Analysis

Automatic analysis of scanned historical documents comprises a wide rang...
research
09/17/2022

Can segmentation models be trained with fully synthetically generated data?

In order to achieve good performance and generalisability, medical image...
research
02/27/2021

SUM: A Benchmark Dataset of Semantic Urban Meshes

Recent developments in data acquisition technology allow us to collect 3...

Please sign up or login with your details

Forgot password? Click here to reset