Training on test data: Removing near duplicates in Fashion-MNIST

06/19/2019
by   Christopher Geier, et al.
0

MNIST and Fashion MNIST are extremely popular for testing in the machine learning space. Fashion MNIST improves on MNIST by introducing a harder problem, increasing the diversity of testing sets, and more accurately representing a modern computer vision task. In order to increase the data quality of FashionMNIST, this paper investigates near duplicate images between training and testing sets. Near-duplicates between testing and training sets artificially increase the testing accuracy of machine learning models. This paper identifies near-duplicate images in Fashion MNIST and proposes a dataset with near-duplicates removed.

READ FULL TEXT
research
08/25/2017

Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms

We present Fashion-MNIST, a new dataset comprising of 28x28 grayscale im...
research
10/29/2019

Distribution Density, Tails, and Outliers in Machine Learning: Metrics and Applications

We develop techniques to quantify the degree to which a given (training ...
research
12/31/2018

Batch Size Influence on Performance of Graphic and Tensor Processing Units during Training and Inference Phases

The impact of the maximally possible batch size (for the better runtime)...
research
12/23/2022

Rule Learning by Modularity

In this paper, we present a modular methodology that combines state-of-t...
research
01/28/2022

Systematic Training and Testing for Machine Learning Using Combinatorial Interaction Testing

This paper demonstrates the systematic use of combinatorial coverage for...
research
08/18/2023

The Impact of Background Removal on Performance of Neural Networks for Fashion Image Classification and Segmentation

Fashion understanding is a hot topic in computer vision, with many appli...
research
09/06/2022

Merged-GHCIDR: Geometrical Approach to Reduce Image Data

The computational resources required to train a model have been increasi...

Please sign up or login with your details

Forgot password? Click here to reset