-
Defuse: Harnessing Unrestricted Adversarial Examples for Debugging Models Beyond Test Accuracy
We typically compute aggregate statistics on held-out test data to asses...
read it
-
Differentially Private Language Models Benefit from Public Pre-training
Language modeling is a keystone task in natural language processing. Whe...
read it
-
How Much Should I Trust You? Modeling Uncertainty of Black Box Explanations
As local explanations of black box models are increasingly being employe...
read it
-
Fair Meta-Learning: Learning How to Learn Fairly
Data sets for fairness relevant tasks can lack examples or be biased acc...
read it
-
How can we fool LIME and SHAP? Adversarial Attacks on Post hoc Explanation Methods
As machine learning black boxes are increasingly being deployed in domai...
read it
-
Fairness Warnings and Fair-MAML: Learning Fairly with Minimal Data
In this paper, we advocate for the study of fairness techniques in low d...
read it
-
Assessing the Local Interpretability of Machine Learning Models
The increasing adoption of machine learning tools has led to calls for a...
read it

Dylan Slack
is this you? claim profile