FreaAI: Automated extraction of data slices to test machine learning models

08/12/2021
by   Samuel Ackerman, et al.
0

Machine learning (ML) solutions are prevalent. However, many challenges exist in making these solutions business-grade. One major challenge is to ensure that the ML solution provides its expected business value. In order to do that, one has to bridge the gap between the way ML model performance is measured and the solution requirements. In previous work (Barash et al, "Bridging the gap...") we demonstrated the effectiveness of utilizing feature models in bridging this gap. Whereas ML performance metrics, such as the accuracy or F1-score of a classifier, typically measure the average ML performance, feature models shed light on explainable data slices that are too far from that average, and therefore might indicate unsatisfied requirements. For example, the overall accuracy of a bank text terms classifier may be very high, say 98%± 2%, yet it might perform poorly for terms that include short descriptions and originate from commercial accounts. A business requirement, which may be implicit in the training data, may be to perform well regardless of the type of account and length of the description. Therefore, the under-performing data slice that includes short descriptions and commercial accounts suggests poorly-met requirements. In this paper we show the feasibility of automatically extracting feature models that result in explainable data slices over which the ML solution under-performs. Our novel technique, IBM FreaAI aka FreaAI, extracts such slices from structured ML test data or any other labeled data. We demonstrate that FreaAI can automatically produce explainable and statistically-significant data slices over seven open datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/22/2021

Classifier Data Quality: A Geometric Complexity Based Method for Automated Baseline And Insights Generation

Testing Machine Learning (ML) models and AI-Infused Applications (AIIAs)...
research
07/16/2018

Slice Finder: Automated Data Sclicing for Model Validation

As machine learning (ML) systems become democratized, it becomes increas...
research
07/16/2018

Slice Finder: Automated Data Slicing for Model Validation

As machine learning (ML) systems become democratized, it becomes increas...
research
06/13/2023

Where Does My Model Underperform? A Human Evaluation of Slice Discovery Algorithms

Machine learning (ML) models that achieve high average accuracy can stil...
research
01/02/2022

Experiment Based Crafting and Analyzing of Machine Learning Solutions

The crafting of machine learning (ML) based systems requires statistical...
research
07/16/2018

Automated Data Slicing for Model Validation:A Big data - AI Integration Approach

As machine learning systems become democratized, it becomes increasingly...
research
05/14/2023

Automatic Generation of Attention Rules For Containment of Machine Learning Model Errors

Machine learning (ML) solutions are prevalent in many applications. Howe...

Please sign up or login with your details

Forgot password? Click here to reset