TY - JOUR
T1 - Sample size estimation for power and accuracy in the experimental comparison of algorithms
AU - Campelo, Felipe
AU - Takahashi, Fernanda
N1 - © Springer Nature B.V. 2018. The final publication is available at Springer via http://dx.doi.org/10.1007/s10732-018-9396-7
PY - 2019/4/15
Y1 - 2019/4/15
N2 - Experimental comparisons of performance represent an important aspect of research on optimization algorithms. In this work we present a methodology for defining the required sample sizes for designing experiments with desired statistical properties for the comparison of two methods on a given problem class. The proposed approach allows the experimenter to define desired levels of accuracy for estimates of mean performance differences on individual problem instances, as well as the desired statistical power for comparing mean performances over a problem class of interest. The method calculates the required number of problem instances, and runs the algorithms on each test instance so that the accuracy of the estimated differences in performance is controlled at the predefined level. Two examples illustrate the application of the proposed method, and its ability to achieve the desired statistical properties with a methodologically sound definition of the relevant sample sizes.
AB - Experimental comparisons of performance represent an important aspect of research on optimization algorithms. In this work we present a methodology for defining the required sample sizes for designing experiments with desired statistical properties for the comparison of two methods on a given problem class. The proposed approach allows the experimenter to define desired levels of accuracy for estimates of mean performance differences on individual problem instances, as well as the desired statistical power for comparing mean performances over a problem class of interest. The method calculates the required number of problem instances, and runs the algorithms on each test instance so that the accuracy of the estimated differences in performance is controlled at the predefined level. Two examples illustrate the application of the proposed method, and its ability to achieve the desired statistical properties with a methodologically sound definition of the relevant sample sizes.
KW - Accuracy of parameter estimation
KW - Experimental comparison of algorithms
KW - Iterative sampling
KW - Sample size estimation
KW - Statistical methods
UR - https://link.springer.com/article/10.1007%2Fs10732-018-9396-7
UR - http://www.scopus.com/inward/record.url?scp=85054582106&partnerID=8YFLogxK
U2 - 10.1007/s10732-018-9396-7
DO - 10.1007/s10732-018-9396-7
M3 - Article
SN - 1381-1231
VL - 25
SP - 305
EP - 338
JO - Journal of Heuristics
JF - Journal of Heuristics
IS - 2
ER -