A Cautionary Tail: A Framework and Case Study for Testing Predictive Model Validity

07/10/2018
by   Peter C. Casey, et al.
0

Data scientists frequently train predictive models on administrative data. However, the process that generates this data can bias predictive models, making it important to test models against their intended use. We provide a field assessment framework that we use to validate a model predicting rat infestations in Washington, D.C. The model was developed with data from the city's 311 service request system. Although the model performs well against new 311 data, we find that it does not perform well when predicting the outcomes of inspections in our field assessment. We recommend that data scientists expand the use of field assessments to test their models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/10/2018

A Cautionary Tail: A Framework and Casey Study for Testing Predictive Model Validity

Data scientists frequently train predictive models on administrative dat...
research
05/24/2023

Is Your Model "MADD"? A Novel Metric to Evaluate Algorithmic Fairness for Predictive Student Models

Predictive student models are increasingly used in learning environments...
research
12/09/2019

Prediction of Sewer Pipe Deterioration Using Random Forest Classification

Wastewater infrastructure systems deteriorate over time due to a combina...
research
11/05/2014

Using Twitter to predict football outcomes

Twitter has been proven to be a notable source for predictive modelling ...
research
07/01/2019

A Compositional Framework for Scientific Model Augmentation

Scientists construct and analyze computational models to understand the ...
research
10/01/2018

Utilizing a Transparency-driven Environment toward Trusted Automatic Genre Classification: A Case Study in Journalism History

With the growing abundance of unlabeled data in real-world tasks, resear...
research
07/05/2021

Creating Unbiased Public Benchmark Datasets with Data Leakage Prevention for Predictive Process Monitoring

Advances in AI, and especially machine learning, are increasingly drawin...

Please sign up or login with your details

Forgot password? Click here to reset