Uncovering bias in the PlantVillage dataset

06/09/2022
by   Mehmet Alican Noyan, et al.
54

We report our investigation on the use of the popular PlantVillage dataset for training deep learning based plant disease detection models. We trained a machine learning model using only 8 pixels from the PlantVillage image backgrounds. The model achieved 49.0 above the random guessing accuracy of 2.6 PlantVillage dataset contains noise correlated with the labels and deep learning models can easily exploit this bias to make predictions. Possible approaches to alleviate this problem are discussed.

READ FULL TEXT

page 3

page 4

page 5

page 9

research
03/05/2020

Plant Disease Detection from Images

Plant disease detection is a huge problem and often require professional...
research
03/30/2022

Remember to correct the bias when using deep learning for regression!

When training deep learning models for least-squares regression, we cann...
research
03/13/2020

The TrojAI Software Framework: An OpenSource tool for Embedding Trojans into Deep Learning Models

In this paper, we introduce the TrojAI software framework, an open sourc...
research
09/17/2019

Ludwig: a type-based declarative deep learning toolbox

In this work we present Ludwig, a flexible, extensible and easy to use t...
research
06/22/2020

MaskIt: Masking for efficient utilization of incomplete public datasets for training deep learning models

A major challenge in training deep learning models is the lack of high q...
research
11/17/2022

Why Deep Learning Generalizes

Very large deep learning models trained using gradient descent are remar...
research
10/12/2021

Deep Learning for Bias Detection: From Inception to Deployment

To create a more inclusive workplace, enterprises are actively investing...

Please sign up or login with your details

Forgot password? Click here to reset