Slice-based Learning: A Programming Model for Residual Learning in Critical Data Slices

09/13/2019
by   Vincent S. Chen, et al.
34

In real-world machine learning applications, data subsets correspond to especially critical outcomes: vulnerable cyclist detections are safety-critical in an autonomous driving task, and "question" sentences might be important to a dialogue agent's language understanding for product purposes. While machine learning models can achieve high quality performance on coarse-grained metrics like F1-score and overall accuracy, they may underperform on critical subsets---we define these as slices, the key abstraction in our approach. To address slice-level performance, practitioners often train separate "expert" models on slice subsets or use multi-task hard parameter sharing. We propose Slice-based Learning, a new programming model in which the slicing function (SF), a programming interface, specifies critical data subsets for which the model should commit additional capacity. Any model can leverage SFs to learn slice expert representations, which are combined with an attention mechanism to make slice-aware predictions. We show that our approach maintains a parameter-efficient representation while improving over baselines by up to 19.0 F1 on slices and 4.6 F1 overall on datasets spanning language understanding (e.g. SuperGLUE), computer vision, and production-scale industrial systems.

READ FULL TEXT
research
06/04/2021

Learning Slice-Aware Representations with Mixture of Attentions

Real-world machine learning systems are achieving remarkable performance...
research
10/07/2020

Slice-Aware Neural Ranking

Understanding when and why neural ranking models fail for an IR task via...
research
04/26/2021

Handling Long-Tail Queries with Slice-Aware Conversational Systems

We have been witnessing the usefulness of conversational AI systems such...
research
07/16/2018

Slice Finder: Automated Data Slicing for Model Validation

As machine learning (ML) systems become democratized, it becomes increas...
research
03/10/2020

Slice Tuner: A Selective Data Collection Framework for Accurate and Fair Machine Learning Models

As machine learning becomes democratized in the era of Software 2.0, one...
research
06/13/2023

Where Does My Model Underperform? A Human Evaluation of Slice Discovery Algorithms

Machine learning (ML) models that achieve high average accuracy can stil...
research
03/24/2022

Domino: Discovering Systematic Errors with Cross-Modal Embeddings

Machine learning models that achieve high overall accuracy often make sy...

Please sign up or login with your details

Forgot password? Click here to reset