Traditional Machine Learning Evaluation

Nathalie Japkowicz; Zois Boukouvalas

doi:10.1017/9781009003872.006

4 - Traditional Machine Learning Evaluation

from Part I - Preliminary Considerations

Published online by Cambridge University Press: 07 November 2024

Nathalie Japkowicz and

Zois Boukouvalas

Show author details

Nathalie Japkowicz: Affiliation:
American University, Washington DC
Zois Boukouvalas: Affiliation:
American University, Washington DC

Book contents

Get access

Summary

Chapter 4 reviews frequently used machine learning evaluation procedures. In particular, it presents popular evaluation metrics for binary and multi-class classification (e.g., accuracy, precision/recall, ROC analysis), regression analysis (e.g., mean squared error, root mean squared error, R-squared error), clustering (e.g., Davies–Bouldin Index). It then reviews popular resampling approaches (e.g.,holdout, cross-validation) and statistical tests (e.g., the t-test and the sign test). It concludes with an explanation of why it is important to go beyond these well-known methods in order to achieve reliable evaluation results in all cases.

Type: Chapter
Information: Machine Learning Evaluation
Towards Reliable and Responsible AI
, pp. 51 - 80

DOI: https://doi.org/10.1017/9781009003872.006 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2024

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

4 - Traditional Machine Learning Evaluation

Summary

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive