Outlier-Robust Group Inference via Gradient Space Clustering

10/13/2022
by   Yuchen Zeng, et al.
0

Traditional machine learning models focus on achieving good performance on the overall training distribution, but they often underperform on minority groups. Existing methods can improve the worst-group performance, but they can have several limitations: (i) they require group annotations, which are often expensive and sometimes infeasible to obtain, and/or (ii) they are sensitive to outliers. Most related works fail to solve these two issues simultaneously as they focus on conflicting perspectives of minority groups and outliers. We address the problem of learning group annotations in the presence of outliers by clustering the data in the space of gradients of the model parameters. We show that data in the gradient space has a simpler structure while preserving information about minority groups and outliers, making it suitable for standard clustering methods like DBSCAN. Extensive experiments demonstrate that our method significantly outperforms state-of-the-art both in terms of group identification and downstream worst-group performance.

READ FULL TEXT

page 14

page 15

page 16

research
07/19/2021

Just Train Twice: Improving Group Robustness without Training Group Information

Standard training via empirical risk minimization (ERM) can produce mode...
research
05/24/2023

Centering the Margins: Outlier-Based Identification of Harmed Populations in Toxicity Detection

A standard method for measuring the impacts of AI on marginalized commun...
research
08/28/2023

Some issues in robust clustering

Some key issues in robust clustering are discussed with focus on Gaussia...
research
01/10/2022

Towards Group Robustness in the presence of Partial Group Labels

Learning invariant representations is an important requirement when trai...
research
10/27/2021

Simple data balancing achieves competitive worst-group-accuracy

We study the problem of learning classifiers that perform well across (k...
research
03/26/2020

Zero-Assignment Constraint for Graph Matching with Outliers

Graph matching (GM), as a longstanding problem in computer vision and pa...
research
04/20/2022

Improved Worst-Group Robustness via Classifier Retraining on Independent Splits

High-capacity deep neural networks (DNNs) trained with Empirical Risk Mi...

Please sign up or login with your details

Forgot password? Click here to reset