Statistical Significance Testing

Nathalie Japkowicz; Mohak Shah

doi:10.1017/CBO9780511921803.007

6 - Statistical Significance Testing

Published online by Cambridge University Press: 05 August 2011

Nathalie Japkowicz and

Mohak Shah

Show author details

Nathalie Japkowicz: Affiliation:
American University, Washington DC
Mohak Shah: Affiliation:
Praescivi Advisors

Book contents

Get access

Summary

The advances in performance measure characterization discussed in Chapters 3 and 4 have armed researchers with more precise estimates of classifier performance. However, these are not by themselves sufficient to fully evaluate the difference in performances between classifiers on one or more test domains. More precisely, even though the performance of different classifiers may be shown to be different on specified sets of data, it needs to be confirmed whether the observed differences are statistically significant and not merely coincidental. Chapter 5 started to look at this issue, but focused primarily on the objectivity and stability of the results. This can be construed as the first step to assessing the significance of a difference. Only in the case of the comparison of two classifiers on a single domain did the discussion actually move on to significance issues. Statistical significance testing, which is the subject of this chapter, enables researchers to move on to more precise assessments of significance of the results obtained (within certain constraints). The importance of statistical significance testing hence cannot be overstated. Nonetheless, the use of available statistical tools for such testing in the fields of machine learning and data mining has been limited at best. Researchers have concentrated on using the paired t test, many times inappropriately, to confirm the difference in classifiers' performance. Moreover, this has sometimes been done at the cost of excluding other, more appropriate, tests.

Type: Chapter
Information: Evaluating Learning Algorithms
A Classification Perspective
, pp. 206 - 291

DOI: https://doi.org/10.1017/CBO9780511921803.007 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2011

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

6 - Statistical Significance Testing

Summary

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive