BriarPatches: Pixel-Space Interventions for Inducing Demographic Parity

12/17/2018
by   Alexey A. Gritsenko, et al.
0

We introduce the BriarPatch, a pixel-space intervention that obscures sensitive attributes from representations encoded in pre-trained classifiers. The patches encourage internal model representations not to encode sensitive information, which has the effect of pushing downstream predictors towards exhibiting demographic parity with respect to the sensitive information. The net result is that these BriarPatches provide an intervention mechanism available at user level, and complements prior research on fair representations that were previously only applicable by model developers and ML experts.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset