Learning as Optimization

Martin Anthony; Peter L. Bartlett

doi:10.1017/CBO9780511624216.024

23 - Learning as Optimization

Published online by Cambridge University Press: 26 February 2010

Martin Anthony and

Peter L. Bartlett

Show author details

Martin Anthony: Affiliation:
London School of Economics and Political Science
Peter L. Bartlett: Affiliation:
Australian National University, Canberra

Book contents

Get access

Summary

Introduction

The previous chapter demonstrated that efficient SEM and approximate-SEM algorithms for graded classes F = ∪Fn give rise to efficient learning algorithms, provided the expressive power of Fn grows polynomially with n (in, respectively, the binary classification and real prediction learning models). In this chapter we show that randomized SEM and approximate-SEM algorithms suffice, and that a converse result then holds: if efficient learning is possible then there must exist an efficient randomized approximate-SEM algorithm. (Hence, for the case of a binary function class, there must be an efficient randomized SEM algorithm.) This will establish that, in both models of learning, efficient learning is intimately related to the optimization problem of finding a hypothesis with small sample error.

Randomized Algorithms

For our purposes, a randomized algorithm has available to it a random number generator that produces a sequence of independent, uniformly distributed bits. We shall assume that examining one bit of this random sequence takes one unit of time. (It is sometimes convenient to assume that the algorithm has access to a sequence of independent uniformly distributed integers in the set {0, 1, …, I}, for some I ≥ 1; it is easy to construct such a sequence from a sequence of random bits.) The randomized algorithm A uses these random bits as part of its input, but it is useful to think of this input as somehow ‘internal’ to the algorithm, and to think of the algorithm as defining a mapping from an ‘external’ input to a probability distribution over outputs.

Type: Chapter
Information: Neural Network Learning
Theoretical Foundations
, pp. 307 - 315

DOI: https://doi.org/10.1017/CBO9780511624216.024 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 1999

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

23 - Learning as Optimization

Summary

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive