Plotting the cumulative deviation of a subgroup from the full population as a function of score

08/04/2020
by   Mark Tygert, et al.
0

Assessing whether a subgroup of a full population is getting treated equitably often involves assigning numerical "scores" to all individuals such that similar individuals get similar scores; matching via propensity scores is common, for example. Given such scores, equitable treatment could mean that individuals with similar scores attain similar outcomes independent of the individuals' memberships in the subgroup. The traditional graphical methods for visualizing inequities are known as "reliability diagrams" or "calibration plots," which bin the scores into a partition of all possible values, and for each bin plot both the average outcomes for only individuals in the subgroup as well as the average outcomes for all individuals in the full population; comparing the graph for the subgroup with that for the full population gives some sense of how the averages for the subgroup deviate from the averages for the full population. Unfortunately, real data sets contain only finitely many observations, limiting the usable resolution of the bins, and so the conventional methods can obscure important variations due to the choice of bins. Fortunately, plotting cumulative deviation of the subgroup from the full population sidesteps the problematic binning. The cumulative plots encode subgroup deviation directly as the slopes of secant lines for the graphs. Slope is easy to perceive even when the constant offsets of the secant lines are irrelevant. The cumulative approach avoids binning that smooths over deviations of the subgroup from the full population.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/05/2021

Cumulative differences between subpopulations

Comparing the differences in outcomes (that is, in "dependent variables"...
research
06/03/2020

Plots of the cumulative differences between observed and expected values of ordered Bernoulli variates

Many predictions are probabilistic in nature; for example, a prediction ...
research
05/19/2022

Metrics of calibration for probabilistic predictions

Predictions are often probabilities; e.g., a prediction could be for pre...
research
07/27/2022

Ties in ranking scores can be treated as weighted samples

Prior proposals for cumulative statistics suggest making tiny random per...
research
01/31/2022

Calibration of P-values for calibration and for deviation of a subpopulation from the full population

The author's recent research papers, "Cumulative deviation of a subpopul...
research
05/18/2023

Cumulative differences between paired samples

The simplest, most common paired samples consist of observations from tw...
research
12/24/2022

Inclusive Artificial Intelligence

Prevailing methods for assessing and comparing generative AIs incentiviz...

Please sign up or login with your details

Forgot password? Click here to reset