The ArtBench Dataset: Benchmarking Generative Models with Artworks

06/22/2022
by   Peiyuan Liao, et al.
0

We introduce ArtBench-10, the first class-balanced, high-quality, cleanly annotated, and standardized dataset for benchmarking artwork generation. It comprises 60,000 images of artwork from 10 distinctive artistic styles, with 5,000 training images and 1,000 testing images per style. ArtBench-10 has several advantages over previous artwork datasets. Firstly, it is class-balanced while most previous artwork datasets suffer from the long tail class distributions. Secondly, the images are of high quality with clean annotations. Thirdly, ArtBench-10 is created with standardized data collection, annotation, filtering, and preprocessing procedures. We provide three versions of the dataset with different resolutions (32×32, 256×256, and original image size), formatted in a way that is easy to be incorporated by popular machine learning frameworks. We also conduct extensive benchmarking experiments using representative image synthesis models with ArtBench-10 and present in-depth analysis. The dataset is available at https://github.com/liaopeiyuan/artbench under a Fair Use license.

READ FULL TEXT

page 2

page 5

page 7

page 8

page 9

page 24

page 25

page 26

research
08/25/2017

Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms

We present Fashion-MNIST, a new dataset comprising of 28x28 grayscale im...
research
05/19/2022

Oracle-MNIST: a Realistic Image Dataset for Benchmarking Machine Learning Algorithms

We introduce the Oracle-MNIST dataset, comprising of 28×28 grayscale ima...
research
10/11/2022

BanglaParaphrase: A High-Quality Bangla Paraphrase Dataset

In this work, we present BanglaParaphrase, a high-quality synthetic Bang...
research
11/15/2022

Will Large-scale Generative Models Corrupt Future Datasets?

Recently proposed large-scale text-to-image generative models such as DA...
research
06/30/2023

TTSWING: a Dataset for Table Tennis Swing Analysis

We introduce TTSWING, a novel dataset designed for table tennis swing an...
research
02/13/2022

FairStyle: Debiasing StyleGAN2 with Style Channel Manipulations

Recent advances in generative adversarial networks have shown that it is...
research
06/18/2023

OpenDataVal: a Unified Benchmark for Data Valuation

Assessing the quality and impact of individual data points is critical f...

Please sign up or login with your details

Forgot password? Click here to reset