Mutual Information Scoring: Increasing Interpretability in Categorical Clustering Tasks with Applications to Child Welfare Data

08/03/2022
by   Pranav Sankhe, et al.
0

Youth in the American foster care system are significantly more likely than their peers to face a number of negative life outcomes, from homelessness to incarceration. Administrative data on these youth have the potential to provide insights that can help identify ways to improve their path towards a better life. However, such data also suffer from a variety of biases, from missing data to reflections of systemic inequality. The present work proposes a novel, prescriptive approach to using these data to provide insights about both data biases and the systems and youth they track. Specifically, we develop a novel categorical clustering and cluster summarization methodology that allows us to gain insights into subtle biases in existing data on foster youth, and to provide insight into where further (often qualitative) research is needed to identify potential ways of assisting youth.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/15/2022

An integrated approach to test for missingness not at random

Missing data can lead to inefficiencies and biases in analyses, in parti...
research
06/30/2020

Hierarchical Qualitative Clustering – clustering mixed datasets with critical qualitative information

Clustering can be used to extract insights from data or to verify some o...
research
12/21/2018

Cluster Lifecycle Analysis: Challenges, Techniques, and Framework

Novel forms of data analysis methods have emerged as a significant resea...
research
12/30/2021

Contrastive Fine-grained Class Clustering via Generative Adversarial Networks

Unsupervised fine-grained class clustering is practical yet challenging ...
research
10/13/2020

KLearn: Background Knowledge Inference from Summarization Data

The goal of text summarization is to compress documents to the relevant ...
research
06/19/2017

On comparing clusterings: an element-centric framework unifies overlaps and hierarchy

Clustering is one of the most universal approaches for understanding com...

Please sign up or login with your details

Forgot password? Click here to reset