The Dataset Nutrition Label (2nd Gen): Leveraging Context to Mitigate Harms in Artificial Intelligence

01/10/2022
by   Kasia S. Chmielinski, et al.
0

As the production of and reliance on datasets to produce automated decision-making systems (ADS) increases, so does the need for processes for evaluating and interrogating the underlying data. After launching the Dataset Nutrition Label in 2018, the Data Nutrition Project has made significant updates to the design and purpose of the Label, and is launching an updated Label in late 2020, which is previewed in this paper. The new Label includes context-specific Use Cases Alerts presented through an updated design and user interface targeted towards the data scientist profile. This paper discusses the harm and bias from underlying training data that the Label is intended to mitigate, the current state of the work including new datasets being labeled, new and existing challenges, and further directions of the work, as well as Figures previewing the new label.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/18/2023

Mitigating Label Bias via Decoupled Confident Learning

Growing concerns regarding algorithmic fairness have led to a surge in m...
research
05/09/2018

The Dataset Nutrition Label: A Framework To Drive Higher Data Quality Standards

Artificial intelligence (AI) systems built on incomplete or biased data ...
research
05/28/2023

Mitigating Label Biases for In-context Learning

Various design settings for in-context learning (ICL), such as the choic...
research
10/23/2020

Towards Accountability for Machine Learning Datasets: Practices from Software Engineering and Infrastructure

Rising concern for the societal implications of artificial intelligence ...
research
08/22/2023

Empowering Refugee Claimants and their Lawyers: Using Machine Learning to Examine Decision-Making in Refugee Law

Our project aims at helping and supporting stakeholders in refugee statu...
research
07/19/2019

DaiMoN: A Decentralized Artificial Intelligence Model Network

We introduce DaiMoN, a decentralized artificial intelligence model netwo...
research
10/13/2020

Making Every Label Count: Handling Semantic Imprecision by Integrating Domain Knowledge

Noisy data, crawled from the web or supplied by volunteers such as Mecha...

Please sign up or login with your details

Forgot password? Click here to reset