Learning Compositional Models for Object Categories from Small Sample Sets

doi:10.1017/CBO9780511635465.014

13 - Learning Compositional Models for Object Categories from Small Sample Sets

Published online by Cambridge University Press: 20 May 2010

Benjamin Yao and

Edited by

Bernt Schiele and

Sven J. Dickinson: Affiliation:
University of Toronto
Aleš Leonardis: Affiliation:
University of Ljubljana
Bernt Schiele: Affiliation:
Technische Universität, Darmstadt, Germany
Michael J. Tarr: Affiliation:
Carnegie Mellon University, Pennsylvania

Book contents

Get access

Summary

Introduction

Modeling object categories is a challenging task owing to the many structural variations between instances of the same category. There have been many nonhierarchical approaches to modeling object categories, all with limited levels of success. Appearance-based models, which represent objects primarily by their photometric properties, such as global PGA, KPCA, fragments, SIFTs, and patches (Lowe 2004; Nayar et al. 1996; Ullman et al. 2001; Weber et al. 2000), tend to disregard geometric information about the position of important keypoints within an object. Thus, they are not well-suited for recognition in scenarios where pose, occlusion, or part reconfiguration are factors. Structure-based models, which include information about relative or absolute positions of features, such as the constellation model and pictorial structures (Felzenszwalb and Huttenlocher 2005; Fischler and Elschlager 1973;Weber et al. 2000), are more powerful than appearance-based approaches as they can model relationships between groups of parts and thus improve recognition accuracy, but are rarely hierarchical and, as such, cannot account for radical transformations of the part positions.

Very recently there has been a resurgence in modeling object categories using grammars (Jin and Geman 2006; Todorovic and Ahuja 2006; Zhu and Mumford 2006). Work by Fu (Fu 1981; You and Fu 1980) and Ohta (1985) in the 1970s and 1980s, and later by Dickinson and Siddiqi (Dickinson et al. 1992; Keselman and Dickinson 2001; Siddiqi et al. 199?) introduced these grammars to account for structural variance. Han and Zhu (2005) and Chen et al. (2006) used attributed graph grammars to describe rectilinear scenes and model clothes, but these models were hardcoded for one category of images.

Type: Chapter
Information: Object Categorization
Computer and Human Vision Perspectives
, pp. 241 - 256

DOI: https://doi.org/10.1017/CBO9780511635465.014 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2009

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

13 - Learning Compositional Models for Object Categories from Small Sample Sets

Summary

Access options

Book purchase

Temporarily unavailable

Book contents

13 - Learning Compositional Models for Object Categories from Small Sample Sets

Summary

Access options

Book purchase

Temporarily unavailable

Save book to Kindle

Save book to Dropbox

Save book to Google Drive