Comprehensive Algorithm Portfolio Evaluation using Item Response Theory

07/29/2023
by   Sevvandi Kandanaarachchi, et al.
0

Item Response Theory (IRT) has been proposed within the field of Educational Psychometrics to assess student ability as well as test question difficulty and discrimination power. More recently, IRT has been applied to evaluate machine learning algorithm performance on a single classification dataset, where the student is now an algorithm, and the test question is an observation to be classified by the algorithm. In this paper we present a modified IRT-based framework for evaluating a portfolio of algorithms across a repository of datasets, while simultaneously eliciting a richer suite of characteristics - such as algorithm consistency and anomalousness - that describe important aspects of algorithm performance. These characteristics arise from a novel inversion and reinterpretation of the traditional IRT model without requiring additional dataset feature computations. We test this framework on algorithm portfolios for a wide range of applications, demonstrating the broad applicability of this method as an insightful algorithm evaluation tool. Furthermore, the explainable nature of IRT parameters yield an increased understanding of algorithm portfolios.

READ FULL TEXT

page 10

page 16

page 17

page 18

page 21

page 25

page 35

page 38

research
03/10/2019

β^3-IRT: A New Item Response Model and its Applications

Item Response Theory (IRT) aims to assess latent abilities of respondent...
research
05/27/2019

Enhancing Item Response Theory for Cognitive Diagnosis

Cognitive diagnosis is a fundamental and crucial task in many educationa...
research
08/17/2021

BOBCAT: Bilevel Optimization-Based Computerized Adaptive Testing

Computerized adaptive testing (CAT) refers to a form of tests that are p...
research
02/24/2018

PSO-based Fuzzy Markup Language for Student Learning Performance Evaluation and Educational Application

This paper proposes an agent with particle swarm optimization (PSO) base...
research
05/28/2016

Building an Evaluation Scale using Item Response Theory

Evaluation of NLP methods requires testing against a previously vetted g...
research
07/19/2023

Amortised Design Optimization for Item Response Theory

Item Response Theory (IRT) is a well known method for assessing response...
research
09/09/2019

Curve Fitting from Probabilistic Emissions and Applications to Dynamic Item Response Theory

Item response theory (IRT) models are widely used in psychometrics and e...

Please sign up or login with your details

Forgot password? Click here to reset