Measuring Data

12/09/2022
by   Margaret Mitchell, et al.
0

We identify the task of measuring data to quantitatively characterize the composition of machine learning data and datasets. Similar to an object's height, width, and volume, data measurements quantify different attributes of data along common dimensions that support comparison. Several lines of research have proposed what we refer to as measurements, with differing terminology; we bring some of this work together, particularly in fields of computer vision and language, and build from it to motivate measuring data as a critical component of responsible AI development. Measuring data aids in systematically building and analyzing machine learning (ML) data towards specific goals and gaining better control of what modern ML systems will learn. We conclude with a discussion of the many avenues of future work, the limitations of data measurements, and how to leverage these measurement approaches in research and practice.

READ FULL TEXT
research
04/07/2022

Measuring AI Systems Beyond Accuracy

Current test and evaluation (T E) methods for assessing machine learni...
research
09/20/2021

SoK: Machine Learning Governance

The application of machine learning (ML) in computer systems introduces ...
research
07/29/2020

Integrating Machine Learning for Planetary Science: Perspectives for the Next Decade

Machine learning (ML) methods can expand our ability to construct, and d...
research
08/17/2020

Intelligence plays dice: Stochasticity is essential for machine learning

Many fields view stochasticity as a way to gain computational efficiency...
research
04/24/2019

Machine Learning Tips and Tricks for Power Line Communications

A great deal of attention has been recently given to Machine Learning (M...
research
04/16/2018

Tree Morphology for Phenotyping from Semantics-Based Mapping in Orchard Environments

Measuring tree morphology for phenotyping is an essential but labor-inte...
research
08/29/2023

Measurement Tampering Detection Benchmark

When training powerful AI systems to perform complex tasks, it may be ch...

Please sign up or login with your details

Forgot password? Click here to reset