Predicting Early Dropout: Calibration and Algorithmic Fairness Considerations

03/16/2021
by   Marzieh Karimi-Haghighi, et al.
10

In this work, the problem of predicting dropout risk in undergraduate studies is addressed from a perspective of algorithmic fairness. We develop a machine learning method to predict the risks of university dropout and underperformance. The objective is to understand if such a system can identify students at risk while avoiding potential discriminatory biases. When modeling both risks, we obtain prediction models with an Area Under the ROC Curve (AUC) of 0.77-0.78 based on the data available at the enrollment time, before the first year of studies starts. This data includes the students' demographics, the high school they attended, and their admission (average) grade. Our models are calibrated: they produce estimated probabilities for each risk, not mere scores. We analyze if this method leads to discriminatory outcomes for some sensitive groups in terms of prediction accuracy (AUC) and error rates (Generalized False Positive Rate, GFPR, or Generalized False Negative Rate, GFNR). The models exhibit some equity in terms of AUC and GFNR along groups. The similar GFNR means a similar probability of failing to detect risk for students who drop out. The disparities in GFPR are addressed through a mitigation process that does not affect the calibration of the model.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/11/2022

Is calibration a fairness requirement? An argument from the point of view of moral philosophy and decision theory

In this paper, we provide a moral analysis of two criteria of statistica...
research
06/07/2021

Bias Mitigation of Face Recognition Models Through Calibration

Face recognition models suffer from bias: for example, the probability o...
research
09/29/2022

Proportional Multicalibration

Multicalibration is a desirable fairness criteria that constrains calibr...
research
02/17/2023

On (assessing) the fairness of risk score models

Recent work on algorithmic fairness has largely focused on the fairness ...
research
11/24/2021

Fairness for AUC via Feature Augmentation

We study fairness in the context of classification where the performance...
research
09/11/2021

College Student Retention Risk Analysis From Educational Database using Multi-Task Multi-Modal Neural Fusion

We develop a Multimodal Spatiotemporal Neural Fusion network for Multi-T...
research
06/11/2019

ProPublica's COMPAS Data Revisited

In this paper I re-examine the COMPAS recidivism score and criminal hist...

Please sign up or login with your details

Forgot password? Click here to reset