ALT-MAS: A Data-Efficient Framework for Active Testing of Machine Learning Algorithms

04/11/2021
by   Huong Ha, et al.
0

Machine learning models are being used extensively in many important areas, but there is no guarantee a model will always perform well or as its developers intended. Understanding the correctness of a model is crucial to prevent potential failures that may have significant detrimental impact in critical application areas. In this paper, we propose a novel framework to efficiently test a machine learning model using only a small amount of labeled test data. The idea is to estimate the metrics of interest for a model-under-test using Bayesian neural network (BNN). We develop a novel data augmentation method helping to train the BNN to achieve high accuracy. We also devise a theoretic information based sampling strategy to sample data points so as to achieve accurate estimations for the metrics of interest. Finally, we conduct an extensive set of experiments to test various machine learning models for different types of metrics. Our experiments show that the metrics estimations by our method are significantly better than existing baselines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/20/2021

Calabi-Yau Metrics, Energy Functionals and Machine-Learning

We apply machine learning to the problem of finding numerical Calabi-Yau...
research
10/25/2021

QuantifyML: How Good is my Machine Learning Model?

The efficacy of machine learning models is typically determined by compu...
research
11/03/2021

Data Synthesis for Testing Black-Box Machine Learning Models

The increasing usage of machine learning models raises the question of t...
research
02/19/2021

Mutation Testing framework for Machine Learning

This is an article or technical note which is intended to provides an in...
research
03/22/2023

Real-time forecasting within soccer matches through a Bayesian lens

This paper employs a Bayesian methodology to predict the results of socc...
research
08/01/2023

Beam Detection Based on Machine Learning Algorithms

The positions of free electron laser beams on screens are precisely dete...
research
11/12/2019

Learning from Data-Rich Problems: A Case Study on Genetic Variant Calling

Next Generation Sequencing can sample the whole genome (WGS) or the 1-2 ...

Please sign up or login with your details

Forgot password? Click here to reset