Block-regularized 5×2 Cross-validated McNemar's Test for Comparing Two Classification Algorithms

04/08/2023
by   Ruibo Wang, et al.
0

In the task of comparing two classification algorithms, the widely-used McNemar's test aims to infer the presence of a significant difference between the error rates of the two classification algorithms. However, the power of the conventional McNemar's test is usually unpromising because the hold-out (HO) method in the test merely uses a single train-validation split that usually produces a highly varied estimation of the error rates. In contrast, a cross-validation (CV) method repeats the HO method in multiple times and produces a stable estimation. Therefore, a CV method has a great advantage to improve the power of McNemar's test. Among all types of CV methods, a block-regularized 5×2 CV (BCV) has been shown in many previous studies to be superior to the other CV methods in the comparison task of algorithms because the 5×2 BCV can produce a high-quality estimator of the error rate by regularizing the numbers of overlapping records between all training sets. In this study, we compress the 10 correlated contingency tables in the 5×2 BCV to form an effective contingency table. Then, we define a 5×2 BCV McNemar's test on the basis of the effective contingency table. We demonstrate the reasonable type I error and the promising power of the proposed 5×2 BCV McNemar's test on multiple simulated and real-world data sets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/03/2021

A New Approach to Multilabel Stratified Cross Validation with Application to Large and Sparse Gene Ontology Datasets

Multilabel learning is an important topic in machine learning research. ...
research
03/29/2017

Comparison of ontology alignment algorithms across single matching task via the McNemar test

Ontology alignment is widely used to find the correspondences between di...
research
01/29/2018

Tournament Leave-pair-out Cross-validation for Receiver Operating Characteristic (ROC) Analysis

Receiver operating characteristic (ROC) analysis is widely used for eval...
research
05/20/2017

( β, ϖ)-stability for cross-validation and the choice of the number of folds

In this paper, we introduce a new concept of stability for cross-validat...
research
03/13/2013

Estimation Stability with Cross Validation (ESCV)

Cross-validation (CV) is often used to select the regularization paramet...
research
12/29/2021

Application of the Pythagorean Expected Wins Percentage and Cross-Validation Methods in Estimating Team Quality

The Pythagorean Expected Wins Percentage Model was developed by Bill Jam...
research
02/06/2023

Error-rate Prediction for Mouse-based Rectangular-target Pointing with no Knowledge of Movement Angles

In rectangular-target pointing, movement angles towards targets are know...

Please sign up or login with your details

Forgot password? Click here to reset