Scaling Laws Do Not Scale

07/05/2023
by   Fernando Diaz, et al.
0

Recent work has proposed a power law relationship, referred to as “scaling laws,” between the performance of artificial intelligence (AI) models and aspects of those models' design (e.g., dataset size). In other words, as the size of a dataset (or model parameters, etc) increases, the performance of a given model trained on that dataset will correspondingly increase. However, while compelling in the aggregate, this scaling law relationship overlooks the ways that metrics used to measure performance may be precarious and contested, or may not correspond with how different groups of people may perceive the quality of models' output. In this paper, we argue that as the size of datasets used to train large AI models grows, the number of distinct communities (including demographic groups) whose data is included in a given dataset is likely to grow, each of whom may have different values. As a result, there is an increased risk that communities represented in a dataset may have values or preferences not captured by (or in the worst case, at odds with) the metrics used to evaluate model performance for scaling laws. We end the paper with implications for AI scaling laws – that models may not, in fact, continue to improve as the datasets get larger – at least not for all people or communities impacted by those models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/24/2021

Is the Number of Trainable Parameters All That Actually Matters?

Recent work has identified simple empirical scaling laws for language mo...
research
08/17/2022

Understanding Scaling Laws for Recommendation Models

Scale has been a major driving force in improving machine learning perfo...
research
09/18/2022

On the existence of Scaling laws across Indian districts: A new prospect for urban scaling

Urban scaling analysis in generally performed based on cities. In this s...
research
03/22/2016

Completely random measures for modeling power laws in sparse graphs

Network data appear in a number of applications, such as online social n...
research
05/28/2020

Scaling Participation – What Does the Concept of Managed Communities Offer for Participatory Design?

This paper investigates mechanisms for scaling participation in particip...
research
06/22/2023

On Hate Scaling Laws For Data-Swamps

`Scale the model, scale the data, scale the GPU-farms' is the reigning s...
research
08/17/2021

Scaling Laws for Deep Learning

Running faster will only get you so far – it is generally advisable to f...

Please sign up or login with your details

Forgot password? Click here to reset