MDB: Interactively Querying Datasets and Models

08/13/2023
by   Aaditya Naik, et al.
0

As models are trained and deployed, developers need to be able to systematically debug errors that emerge in the machine learning pipeline. We present MDB, a debugging framework for interactively querying datasets and models. MDB integrates functional programming with relational algebra to build expressive queries over a database of datasets and model predictions. Queries are reusable and easily modified, enabling debuggers to rapidly iterate and refine queries to discover and characterize errors and model behaviors. We evaluate MDB on object detection, bias discovery, image classification, and data imputation tasks across self-driving videos, large language models, and medical records. Our experiments show that MDB enables up to 10x faster and 40% shorter queries than other baselines. In a user study, we find developers can successfully construct complex queries that describe errors of machine learning models.

READ FULL TEXT
research
05/12/2023

Text2Cohort: Democratizing the NCI Imaging Data Commons with Natural Language Cohort Discovery

The Imaging Data Commons (IDC) is a cloud-based database that provides r...
research
04/04/2023

Learning to Name Classes for Vision and Language Models

Large scale vision and language models can achieve impressive zero-shot ...
research
08/09/2023

Data-Free Model Extraction Attacks in the Context of Object Detection

A significant number of machine learning models are vulnerable to model ...
research
04/09/2019

Explaining Wrong Queries Using Small Examples

For testing the correctness of SQL queries, e.g., evaluating student sub...
research
03/02/2023

Do Machine Learning Models Learn Common Sense?

Machine learning models can make basic errors that are easily hidden wit...
research
09/13/2023

VLSlice: Interactive Vision-and-Language Slice Discovery

Recent work in vision-and-language demonstrates that large-scale pretrai...
research
09/25/2019

On the Expressiveness of LARA: A Unified Language for Linear and Relational Algebra

We study the expressive power of the LARA language – a recently proposed...

Please sign up or login with your details

Forgot password? Click here to reset