Nonuniform Learnability

Shai Shalev-Shwartz; Shai Ben-David

doi:10.1017/CBO9781107298019.008

7 - Nonuniform Learnability

from Part 1 - Foundations

Published online by Cambridge University Press: 05 July 2014

Shai Shalev-Shwartz and

Shai Ben-David

Show author details

Shai Shalev-Shwartz: Affiliation:
Hebrew University of Jerusalem
Shai Ben-David: Affiliation:
University of Waterloo, Ontario

Book contents

Get access

Summary

The notions of PAC learnability discussed so far in the book allow the sample sizes to depend on the accuracy and confidence parameters, but they are uniform with respect to the labeling rule and the underlying data distribution. Consequently, classes that are learnable in that respect are limited (they must have a finite VC-dimension, as stated by Theorem 6.7). In this chapter we consider more relaxed, weaker notions of learnability. We discuss the usefulness of such notions and provide characterization of the concept classes that are learnable using these definitions.

We begin this discussion by defining a notion of “nonuniform learnability” that allows the sample size to depend on the hypothesis to which the learner is compared. We then provide a characterization of nonuniform learnability and show that nonuniform learnability is a strict relaxation of agnostic PAC learnability. We also show that a sufficient condition for nonuniform learnability is that H is a countable union of hypothesis classes, each of which enjoys the uniform convergence property. These results will be proved in Section 7.2 by introducing a new learning paradigm, which is called Structural Risk Minimization (SRM). In Section 7.3 we specify the SRM paradigm for countable hypothesis classes, which yields the Minimum Description Length (MDL) paradigm. The MDL paradigm gives a formal justification to a philosophical principle of induction called Occam's razor. Next, in Section 7.4 we introduce consistency as an even weaker notion of learnability.

Type: Chapter
Information: Understanding Machine Learning
From Theory to Algorithms
, pp. 58 - 72

DOI: https://doi.org/10.1017/CBO9781107298019.008 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2014

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

7 - Nonuniform Learnability

Summary

Access options

Book purchase

Temporarily unavailable

Book contents

7 - Nonuniform Learnability

Summary

Access options

Book purchase

Temporarily unavailable

Save book to Kindle

Save book to Dropbox

Save book to Google Drive