Ecosystem-level Analysis of Deployed Machine Learning Reveals Homogeneous Outcomes

07/12/2023
by   Connor Toups, et al.
0

Machine learning is traditionally studied at the model level: researchers measure and improve the accuracy, robustness, bias, efficiency, and other dimensions of specific models. In practice, the societal impact of machine learning is determined by the surrounding context of machine learning deployments. To capture this, we introduce ecosystem-level analysis: rather than analyzing a single model, we consider the collection of models that are deployed in a given context. For example, ecosystem-level analysis in hiring recognizes that a job candidate's outcomes are not only determined by a single hiring algorithm or firm but instead by the collective decisions of all the firms they applied to. Across three modalities (text, images, speech) and 11 datasets, we establish a clear trend: deployed machine learning is prone to systemic failure, meaning some users are exclusively misclassified by all models available. Even when individual models improve at the population level over time, we find these improvements rarely reduce the prevalence of systemic failure. Instead, the benefits of these improvements predominantly accrue to individuals who are already correctly classified by other models. In light of these trends, we consider medical imaging for dermatology where the costs of systemic failure are especially high. While traditional analyses reveal racial performance disparities for both models and humans, ecosystem-level analysis reveals new forms of racial disparity in model predictions that do not present in human predictions. These examples demonstrate ecosystem-level analysis has unique strengths for characterizing the societal impact of machine learning.

READ FULL TEXT

page 5

page 7

page 14

page 15

page 17

page 19

research
07/15/2020

Differential Replication in Machine Learning

When deployed in the wild, machine learning models are usually confronte...
research
03/31/2023

Evaluation Challenges for Geospatial ML

As geospatial machine learning models and maps derived from their predic...
research
11/25/2022

Picking on the Same Person: Does Algorithmic Monoculture lead to Outcome Homogenization?

As the scope of machine learning broadens, we observe a recurring theme ...
research
10/02/2017

Extracting Insights from the Topology of the JavaScript Package Ecosystem

Software ecosystems have had a tremendous impact on computing and societ...
research
10/04/2022

MEDFAIR: Benchmarking Fairness for Medical Imaging

A multitude of work has shown that machine learning-based medical diagno...
research
03/28/2023

Ecosystem Graphs: The Social Footprint of Foundation Models

Foundation models (e.g. ChatGPT, StableDiffusion) pervasively influence ...

Please sign up or login with your details

Forgot password? Click here to reset