Machine Learning Capability: A standardized metric using case difficulty with applications to individualized deployment of supervised machine learning

02/09/2023
by   Adrienne Kline, et al.
0

Model evaluation is a critical component in supervised machine learning classification analyses. Traditional metrics do not currently incorporate case difficulty. This renders the classification results unbenchmarked for generalization. Item Response Theory (IRT) and Computer Adaptive Testing (CAT) with machine learning can benchmark datasets independent of the end-classification results. This provides high levels of case-level information regarding evaluation utility. To showcase, two datasets were used: 1) health-related and 2) physical science. For the health dataset a two-parameter IRT model, and for the physical science dataset a polytonomous IRT model, was used to analyze predictive features and place each case on a difficulty continuum. A CAT approach was used to ascertain the algorithms' performance and applicability to new data. This method provides an efficient way to benchmark data, using only a fraction of the dataset (less than 1 computationally efficient than traditional metrics. This novel metric, termed Machine Learning Capability (MLC) has additional benefits as it is unbiased to outcome classification and a standardized way to make model comparisons within and across datasets. MLC provides a metric on the limitation of supervised machine learning algorithms. In situations where the algorithm falls short, other input(s) are required for decision-making.

READ FULL TEXT

page 1

page 7

research
07/11/2018

Morse Code Datasets for Machine Learning

We present an algorithm to generate synthetic datasets of tunable diffic...
research
01/11/2019

Machine Learning Automation Toolbox (MLaut)

In this paper we present MLaut (Machine Learning AUtomation Toolbox) for...
research
03/31/2023

Evaluation Challenges for Geospatial ML

As geospatial machine learning models and maps derived from their predic...
research
03/28/2013

Relevance As a Metric for Evaluating Machine Learning Algorithms

In machine learning, the choice of a learning algorithm that is suitable...
research
07/14/2023

Multi-Dimensional Ability Diagnosis for Machine Learning Algorithms

Machine learning algorithms have become ubiquitous in a number of applic...
research
08/29/2018

Application of Machine Learning in Rock Facies Classification with Physics-Motivated Feature Augmentation

With recent progress in algorithms and the availability of massive amoun...
research
11/29/2018

BCCNet: Bayesian classifier combination neural network

Machine learning research for developing countries can demonstrate clear...

Please sign up or login with your details

Forgot password? Click here to reset