Sample size estimation for power and accuracy in the experimental comparison of algorithms

Felipe Campelo; Fernanda Takahashi

doi:10.1007/s10732-018-9396-7

Sample size estimation for power and accuracy in the experimental comparison of algorithms

Research output: Contribution to journal › Article › peer-review

Abstract

Experimental comparisons of performance represent an important aspect of research on optimization algorithms. In this work we present a methodology for defining the required sample sizes for designing experiments with desired statistical properties for the comparison of two methods on a given problem class. The proposed approach allows the experimenter to define desired levels of accuracy for estimates of mean performance differences on individual problem instances, as well as the desired statistical power for comparing mean performances over a problem class of interest. The method calculates the required number of problem instances, and runs the algorithms on each test instance so that the accuracy of the estimated differences in performance is controlled at the predefined level. Two examples illustrate the application of the proposed method, and its ability to achieve the desired statistical properties with a methodologically sound definition of the relevant sample sizes.

Original language	English
Pages (from-to)	305-338
Number of pages	34
Journal	Journal of Heuristics
Volume	25
Issue number	2
Early online date	4 Oct 2018
DOIs	https://doi.org/10.1007/s10732-018-9396-7
Publication status	Published - 15 Apr 2019

Bibliographical note

Keywords

Accuracy of parameter estimation
Experimental comparison of algorithms
Iterative sampling
Sample size estimation
Statistical methods

Access to Document

10.1007/s10732-018-9396-7

Sample size estimation for power and accuracy in the experimental comparison of algorithms
© Springer Nature B.V. 2018. The final publication is available at Springer via http://dx.doi.org/10.1007/s10732-018-9396-7
Accepted author manuscript, 1.22 MB

https://arxiv.org/pdf/1808.02997

Cite this

@article{0cfcc244360049f882ebe8042ab2ee6c,

title = "Sample size estimation for power and accuracy in the experimental comparison of algorithms",

abstract = "Experimental comparisons of performance represent an important aspect of research on optimization algorithms. In this work we present a methodology for defining the required sample sizes for designing experiments with desired statistical properties for the comparison of two methods on a given problem class. The proposed approach allows the experimenter to define desired levels of accuracy for estimates of mean performance differences on individual problem instances, as well as the desired statistical power for comparing mean performances over a problem class of interest. The method calculates the required number of problem instances, and runs the algorithms on each test instance so that the accuracy of the estimated differences in performance is controlled at the predefined level. Two examples illustrate the application of the proposed method, and its ability to achieve the desired statistical properties with a methodologically sound definition of the relevant sample sizes.",

keywords = "Accuracy of parameter estimation, Experimental comparison of algorithms, Iterative sampling, Sample size estimation, Statistical methods",

author = "Felipe Campelo and Fernanda Takahashi",

note = "{\textcopyright} Springer Nature B.V. 2018. The final publication is available at Springer via http://dx.doi.org/10.1007/s10732-018-9396-7",

year = "2019",

month = apr,

day = "15",

doi = "10.1007/s10732-018-9396-7",

language = "English",

volume = "25",

pages = "305--338",

journal = "Journal of Heuristics",

issn = "1381-1231",

publisher = "Springer",

number = "2",

}

TY - JOUR

T1 - Sample size estimation for power and accuracy in the experimental comparison of algorithms

AU - Campelo, Felipe

AU - Takahashi, Fernanda

N1 - © Springer Nature B.V. 2018. The final publication is available at Springer via http://dx.doi.org/10.1007/s10732-018-9396-7

PY - 2019/4/15

Y1 - 2019/4/15

N2 - Experimental comparisons of performance represent an important aspect of research on optimization algorithms. In this work we present a methodology for defining the required sample sizes for designing experiments with desired statistical properties for the comparison of two methods on a given problem class. The proposed approach allows the experimenter to define desired levels of accuracy for estimates of mean performance differences on individual problem instances, as well as the desired statistical power for comparing mean performances over a problem class of interest. The method calculates the required number of problem instances, and runs the algorithms on each test instance so that the accuracy of the estimated differences in performance is controlled at the predefined level. Two examples illustrate the application of the proposed method, and its ability to achieve the desired statistical properties with a methodologically sound definition of the relevant sample sizes.

AB - Experimental comparisons of performance represent an important aspect of research on optimization algorithms. In this work we present a methodology for defining the required sample sizes for designing experiments with desired statistical properties for the comparison of two methods on a given problem class. The proposed approach allows the experimenter to define desired levels of accuracy for estimates of mean performance differences on individual problem instances, as well as the desired statistical power for comparing mean performances over a problem class of interest. The method calculates the required number of problem instances, and runs the algorithms on each test instance so that the accuracy of the estimated differences in performance is controlled at the predefined level. Two examples illustrate the application of the proposed method, and its ability to achieve the desired statistical properties with a methodologically sound definition of the relevant sample sizes.

KW - Accuracy of parameter estimation

KW - Experimental comparison of algorithms

KW - Iterative sampling

KW - Sample size estimation

KW - Statistical methods

UR - https://link.springer.com/article/10.1007%2Fs10732-018-9396-7

UR - http://www.scopus.com/inward/record.url?scp=85054582106&partnerID=8YFLogxK

U2 - 10.1007/s10732-018-9396-7

DO - 10.1007/s10732-018-9396-7

M3 - Article

SN - 1381-1231

VL - 25

SP - 305

EP - 338

JO - Journal of Heuristics

JF - Journal of Heuristics

IS - 2

ER -

Sample size estimation for power and accuracy in the experimental comparison of algorithms

Abstract

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

CAISEr: Comparison of Algorithms with Iterative Sample Size Estimation

Cite this