Balanced Filtering via Non-Disclosive Proxies

06/26/2023
by   Siqi Deng, et al.
0

We study the problem of non-disclosively collecting a sample of data that is balanced with respect to sensitive groups when group membership is unavailable or prohibited from use at collection time. Specifically, our collection mechanism does not reveal significantly more about group membership of any individual sample than can be ascertained from base rates alone. To do this, we adopt a fairness pipeline perspective, in which a learner can use a small set of labeled data to train a proxy function that can later be used for this filtering task. We then associate the range of the proxy function with sampling probabilities; given a new candidate, we classify it using our proxy function, and then select it for our sample with probability proportional to the sampling probability corresponding to its proxy classification. Importantly, we require that the proxy classification itself not reveal significant information about the sensitive group membership of any individual sample (i.e., it should be sufficiently non-disclosive). We show that under modest algorithmic assumptions, we find such a proxy in a sample- and oracle-efficient manner. Finally, we experimentally evaluate our algorithm and analyze generalization properties.

READ FULL TEXT
research
07/09/2021

Multiaccurate Proxies for Downstream Fairness

We study the problem of training a model that must obey demographic fair...
research
10/28/2022

Fairness Certificates for Differentially Private Classification

In this work, we theoretically study the impact of differential privacy ...
research
07/25/2022

Estimating and Controlling for Fairness via Sensitive Attribute Predictors

Although machine learning classifiers have been increasingly used in hig...
research
12/02/2021

Sequential Spatially Balanced Sampling

Sequential sampling occurs when the entire population is not known in ad...
research
11/18/2022

Informative Sample-Aware Proxy for Deep Metric Learning

Among various supervised deep metric learning methods proxy-based approa...
research
06/14/2023

A Proxy-Free Strategy for Practically Improving the Poisoning Efficiency in Backdoor Attacks

Poisoning efficiency is a crucial factor in poisoning-based backdoor att...
research
06/25/2019

Proxy Certificates: The Missing Link in the Web's Chain of Trust

The ability to quickly revoke a compromised key is critical to the secur...

Please sign up or login with your details

Forgot password? Click here to reset