Search results for Pattern Recognition and Machine Learning

Preface
Nathalie Japkowicz, American University, Washington DC, Zois Boukouvalas, American University, Washington DC
Book:

Machine Learning Evaluation

Published online:

07 November 2024

Print publication:

21 November 2024, pp xv-xvi
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Abbreviations
Nathalie Japkowicz, American University, Washington DC, Zois Boukouvalas, American University, Washington DC
Book:

Machine Learning Evaluation

Published online:

07 November 2024

Print publication:

21 November 2024, pp xi-xiv
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

10 - Industrial-Strength Evaluation
from Part IV - Evaluation from a Practical Perspective
Nathalie Japkowicz, American University, Washington DC, Zois Boukouvalas, American University, Washington DC
Book:

Machine Learning Evaluation

Published online:

07 November 2024

Print publication:

21 November 2024, pp 289-307
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In Chapter 10, the book turns to practical considerations. In particular, it surveys the software engineering discipline with its rigorous software testing methods, and asks how these techniques can be adapted to the subfield of machine learning. The adaptation is not straightforward, as machine learning algorithms behave in non-deterministic ways aggravated by data, algorithm, and platform imperfections. These issues are discussed and some of the steps taken to handle them are reviewed. The chapter then turns to the practice of online testing and addresses the ethics of machine learning deployment. The chapter concludes with a discussion of current industry practice along with suggestions on how to improve the safety of industrial deployment in the future.

5 - Metrics
from Part II - Evaluation for Classification
Nathalie Japkowicz, American University, Washington DC, Zois Boukouvalas, American University, Washington DC
Book:

Machine Learning Evaluation

Published online:

07 November 2024

Print publication:

21 November 2024, pp 83-127
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter 5 starts with an analysis of the classification metrics presented in Chapter 4, outlining their strengths and weaknesses. It then presents more advanced metrics such as Cohen’s kappa, Youden’s index, and likelihood ratios. This is followed by a discussion about data and classifier complexities such as the class imbalance problem and classifier uncertainty that require particular scrutiny to ensure that the results are trustworthy. The chapter concludes with a detailed discussion of ROC analysis to complement its introduction in Chapter 4, and a presentation of other visualization metrics.

3 - Machine Learning Preliminaries
from Part I - Preliminary Considerations
Nathalie Japkowicz, American University, Washington DC, Zois Boukouvalas, American University, Washington DC
Book:

Machine Learning Evaluation

Published online:

07 November 2024

Print publication:

21 November 2024, pp 33-50
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter 3 discusses the field of machine learning from a theoretical perspective. The review will advance the discussion of advanced metrics in Chapter 5 and error estimation methods in Chapter 6. The specific concepts surveyed in this chapter include loss functions, empirical risk, generalization error, empirical and structural risk minimization, regularization, and learning bias. The unsupervised learning paradigm is also reviewed and the chapter concludes with a discussion of the bias/variance tradeoff.

Part II - Evaluation for Classification
Nathalie Japkowicz, American University, Washington DC, Zois Boukouvalas, American University, Washington DC
Book:

Machine Learning Evaluation

Published online:

07 November 2024

Print publication:

21 November 2024, pp 81-82
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

9 - Unsupervised Learning
from Part III - Evaluation for Other Settings
Nathalie Japkowicz, American University, Washington DC, Zois Boukouvalas, American University, Washington DC
Book:

Machine Learning Evaluation

Published online:

07 November 2024

Print publication:

21 November 2024, pp 252-286
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter 9 is devoted to evaluation methods for an important category of classical learning paradigms left out of Chapter 8 so as to receive fuller coverage: unsupervised learning. In this chapter, a number of different unsupervised learning schemes are considered and their evaluation discussed. The particular tasks considered are clustering and hierarchical clustering, dimensionality reduction, latent variable modeling, and generative models including probabilistic PCA, variational autoencoders, and GANs. Evaluation methodology is discussed discussed for each of these tasks.

11 - Responsible Machine Learning
from Part IV - Evaluation from a Practical Perspective
Nathalie Japkowicz, American University, Washington DC, Zois Boukouvalas, American University, Washington DC
Book:

Machine Learning Evaluation

Published online:

07 November 2024

Print publication:

21 November 2024, pp 308-341
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter 11 completes the discussion of Chapter 10 by raising the question of how to practice machine learning in a responsible manner. It describes the dangers of data bias, and surveys data bias detection and mitigation methods; it lists the benefits of explainability and discusses techniques, such as LIME and SHAP, that have been proposed to explain the decisions made by opaque models; it underlines the risks of discrimination and discusses how to enhance fairness and prevent discrimination in machine learning algorithms. The issues of privacy and security are then presented, and the need to practice human-centered machine learning emphasized. The chapter concludes with the important issues of repeatability, reproducibility, and replicability in machine learning.

1 - Introduction
from Part I - Preliminary Considerations
Nathalie Japkowicz, American University, Washington DC, Zois Boukouvalas, American University, Washington DC
Book:

Machine Learning Evaluation

Published online:

07 November 2024

Print publication:

21 November 2024, pp 3-7
- Chapter
- - You have access
- PDF
- Export citation
Summary

Chapter 1 discusses the motivation for the book and the rationale for its organization into four parts: preliminary considerations, evaluation for classification, evaluation in other settings, and evaluation from a practical perspective. In more detail, the first part provides the statistical tools necessary for evaluation and reviews the main machine learning principles as well as frequently used evaluation practices. The second part discusses the most common setting in which machine learning evaluation has been applied: classification. The third part extends the discussion to other paradigms such as multi-label classification, regression analysis, data stream mining, and unsupervised learning. The fourth part broadens the conversation by moving it from the laboratory setting to the practical setting, specifically discussing issues of robustness and responsible deployment.

Part IV - Evaluation from a Practical Perspective
Nathalie Japkowicz, American University, Washington DC, Zois Boukouvalas, American University, Washington DC
Book:

Machine Learning Evaluation

Published online:

07 November 2024

Print publication:

21 November 2024, pp 287-288
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

References
Nathalie Japkowicz, American University, Washington DC, Zois Boukouvalas, American University, Washington DC
Book:

Machine Learning Evaluation

Published online:

07 November 2024

Print publication:

21 November 2024, pp 387-402
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

8 - Supervised Settings Other Than Simple Classification
from Part III - Evaluation for Other Settings
Nathalie Japkowicz, American University, Washington DC, Zois Boukouvalas, American University, Washington DC
Book:

Machine Learning Evaluation

Published online:

07 November 2024

Print publication:

21 November 2024, pp 211-251
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter 8 introduces evaluation procedures for paradigms other than classification. In particular, it discusses evaluation for classical problems such as regression analysis, time-series analysis, outlier detection, and reinforcement learning, along with evaluation approaches for newer tasks such as positive-unlabelled classification, ordinal classification, multi-labeled classification, image segmentation, text generation, data stream mining, and lifelong learning.

7 - Statistical Analysis
from Part II - Evaluation for Classification
Nathalie Japkowicz, American University, Washington DC, Zois Boukouvalas, American University, Washington DC
Book:

Machine Learning Evaluation

Published online:

07 November 2024

Print publication:

21 November 2024, pp 154-208
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In Chapter 7, the history of statistical analysis is reviewed and its legacy discussed. Four situations of interest to machine learning evaluation are subsequently discussed within different statistical paradigms: the comparison of two classifiers on a single domain; the comparison of multiple classifiers on a single domain; the comparison of two classifiers on multiple domains; and the comparison of multiple classifiers on multiple domains. The three statistical paradigms considered for each of these situations are the null hypothesis statistical testing (NHST) setting; an enhanced Fisher-flavored methodology that adds the notions of confidence intervals, effect size, and power analysis to NHST; and a newer approach based on Bayesian reasoning.

Frontmatter
Nathalie Japkowicz, American University, Washington DC, Zois Boukouvalas, American University, Washington DC
Book:

Machine Learning Evaluation

Published online:

07 November 2024

Print publication:

21 November 2024, pp i-iv
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Machine Learning Evaluation

Towards Reliable and Responsible AI
Nathalie Japkowicz, Zois Boukouvalas
Published online:

07 November 2024

Print publication:

21 November 2024
- Book
- - Get access
    
    Buy a print copy
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
As machine learning applications gain widespread adoption and integration in a variety of applications, including safety and mission-critical systems, the need for robust evaluation methods grows more urgent. This book compiles scattered information on the topic from research papers and blogs to provide a centralized resource that is accessible to students, practitioners, and researchers across the sciences. The book examines meaningful metrics for diverse types of learning paradigms and applications, unbiased estimation methods, rigorous statistical analysis, fair training sets, and meaningful explainability, all of which are essential to building robust and reliable machine learning products. In addition to standard classification, the book discusses unsupervised learning, regression, image segmentation, and anomaly detection. The book also covers topics such as industry-strength evaluation, fairness, and responsible AI. Implementations using Python and scikit-learn are available on the book's website.

Linear Algebra for Data Science, Machine Learning, and Signal Processing

Jeffrey A. Fessler, Raj Rao Nadakuditi
Published online:

01 November 2024

Print publication:

16 May 2024
- Textbook
- - Get access
    
    Buy a print copy
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Maximise student engagement and understanding of matrix methods in data-driven applications with this modern teaching package. Students are introduced to matrices in two preliminary chapters, before progressing to advanced topics such as the nuclear norm, proximal operators and convex optimization. Highlighted applications include low-rank approximation, matrix completion, subspace learning, logistic regression for binary classification, robust PCA, dimensionality reduction and Procrustes problems. Extensively classroom-tested, the book includes over 200 multiple-choice questions suitable for in-class interactive learning or quizzes, as well as homework exercises (with solutions available for instructors). It encourages active learning with engaging 'explore' questions, with answers at the back of each chapter, and Julia code examples to demonstrate how the mathematics is actually used in practice. A suite of computational notebooks offers a hands-on learning experience for students. This is a perfect textbook for upper-level undergraduates and first-year graduate students who have taken a prior course in linear algebra basics.

Index
Jeffrey A. Fessler, University of Michigan, Ann Arbor, Raj Rao Nadakuditi, University of Michigan, Ann Arbor
Book:

Linear Algebra for Data Science, Machine Learning, and Signal Processing

Published online:

01 November 2024

Print publication:

16 May 2024, pp 423-431
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

7 - Low-Rank Approximation and Multidimensional Scaling
Jeffrey A. Fessler, University of Michigan, Ann Arbor, Raj Rao Nadakuditi, University of Michigan, Ann Arbor
Book:

Linear Algebra for Data Science, Machine Learning, and Signal Processing

Published online:

01 November 2024

Print publication:

16 May 2024, pp 238-282
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In many applications, dimensionality reduction is important. Uses of dimensionality reduction include visualization, removing noise, and decreasing compute and memory requirements, such as for image compression. This chapter focuses on low-rank approximation of a matrix. There are theoretical models for why big matrices should be approximately low rank. Low-rank approximations are also used to compress large neural network models to reduce computation and storage. The chapter begins with the classic approach to approximating a matrix by a low-rank matrix, using a nonconvex formulation that has a remarkably simple singular value decomposition solution. It then applies this approach to the source localization application via the multidimensional scaling method and to the photometric stereo application. It then turns to convex formulations of low-rank approximation based on proximal operators that involve singular value shrinkage. It discusses methods for choosing the rank of the approximation, and describes the optimal shrinkage method called OptShrink. It discusses related dimensionality reduction methods including (linear) autoencoders and principal component analysis. It applies the methods to learning low-dimensionality subspaces from training data for subspace-based classification problems. Finally, it extends the method to streaming applications with time-varying data. This chapter bridges the classical singular value decomposition tool with modern applications in signal processing and machine learning.

Contents
Jeffrey A. Fessler, University of Michigan, Ann Arbor, Raj Rao Nadakuditi, University of Michigan, Ann Arbor
Book:

Linear Algebra for Data Science, Machine Learning, and Signal Processing

Published online:

01 November 2024

Print publication:

16 May 2024, pp vii-xiv
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

References
Jeffrey A. Fessler, University of Michigan, Ann Arbor, Raj Rao Nadakuditi, University of Michigan, Ann Arbor
Book:

Linear Algebra for Data Science, Machine Learning, and Signal Processing

Published online:

01 November 2024

Print publication:

16 May 2024, pp 405-422
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Pattern Recognition and Machine Learning

Refine search

Refine search

Actions for selected content:

2190 results in Pattern Recognition and Machine Learning

Preface

Abbreviations

10 - Industrial-Strength Evaluation

Summary

5 - Metrics

Summary

3 - Machine Learning Preliminaries

Summary

Part II - Evaluation for Classification

9 - Unsupervised Learning

Summary

11 - Responsible Machine Learning

Summary

1 - Introduction

Summary

Part IV - Evaluation from a Practical Perspective

References

8 - Supervised Settings Other Than Simple Classification

Summary

7 - Statistical Analysis

Summary

Frontmatter

Machine Learning Evaluation

Linear Algebra for Data Science, Machine Learning, and Signal Processing

Index

7 - Low-Rank Approximation and Multidimensional Scaling

Summary

Contents

References

Pattern Recognition and Machine Learning

Refine search

Refine search

Actions for selected content:

Save Search

2190 results in Pattern Recognition and Machine Learning

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Machine Learning Evaluation

Linear Algebra for Data Science, Machine Learning, and Signal Processing

Summary