Statistical Analysis

Nathalie Japkowicz; Zois Boukouvalas

doi:10.1017/9781009003872.010

7 - Statistical Analysis

from Part II - Evaluation for Classification

Published online by Cambridge University Press: 07 November 2024

Nathalie Japkowicz and

Zois Boukouvalas

Show author details

Nathalie Japkowicz: Affiliation:
American University, Washington DC
Zois Boukouvalas: Affiliation:
American University, Washington DC

Book contents

Get access

Summary

In Chapter 7, the history of statistical analysis is reviewed and its legacy discussed. Four situations of interest to machine learning evaluation are subsequently discussed within different statistical paradigms: the comparison of two classifiers on a single domain; the comparison of multiple classifiers on a single domain; the comparison of two classifiers on multiple domains; and the comparison of multiple classifiers on multiple domains. The three statistical paradigms considered for each of these situations are the null hypothesis statistical testing (NHST) setting; an enhanced Fisher-flavored methodology that adds the notions of confidence intervals, effect size, and power analysis to NHST; and a newer approach based on Bayesian reasoning.

Type: Chapter
Information: Machine Learning Evaluation
Towards Reliable and Responsible AI
, pp. 154 - 208

DOI: https://doi.org/10.1017/9781009003872.010 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2024

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

7 - Statistical Analysis

Summary

Access options

Book purchase

Temporarily unavailable

Book contents

7 - Statistical Analysis

Summary

Access options

Book purchase

Temporarily unavailable

Save book to Kindle

Save book to Dropbox

Save book to Google Drive