Sample size calculations for the experimental comparison of multiple algorithms on multiple problem instances

08/05/2019
by   Felipe Campelo, et al.
0

This work presents a statistically principled method for estimating the required number of instances in the experimental comparison of multiple algorithms on a given problem class of interest. This approach generalises earlier results by allowing researchers to design experiments based on the desired best, worst, mean or median-case statistical power to detect differences between algorithms larger than a certain threshold. Holm's step-down procedure is used to maintain the overall significance level controlled at desired levels, without resulting in overly conservative experiments. This paper also presents an approach for sampling each algorithm on each instance, based on optimal sample size ratios that minimise the total required number of runs subject to a desired accuracy in the estimation of paired differences. A case study investigating the effect of 21 variants of a custom-tailored Simulated Annealing for a class of scheduling problems is used to illustrate the application of the proposed methods for sample size calculations in the experimental comparison of algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/09/2018

Sample size estimation for power and accuracy in the experimental comparison of algorithms

Experimental comparisons of performance represent an important aspect of...
research
05/21/2019

Assurance for sample size determination in reliability demonstration testing

Manufacturers are required to demonstrate products meet reliability targ...
research
05/25/2023

All about sample-size calculations for A/B testing: Novel extensions and practical guide

While there exists a large amount of literature on the general challenge...
research
12/23/2020

Comparison of Classification Algorithms Towards Subject-Specific and Subject-Independent BCI

Motor imagery brain computer interface designs are considered difficult ...
research
06/04/2020

Median regression with differential privacy

Median regression analysis has robustness properties which make it attra...
research
09/25/2014

Feature-based tuning of simulated annealing applied to the curriculum-based course timetabling problem

We consider the university course timetabling problem, which is one of t...

Please sign up or login with your details

Forgot password? Click here to reset