Common Representations of Multimedia Features

K. Selçuk  Candan; Maria Luisa  Sapino

doi:10.1017/CBO9780511781636.004

3 - Common Representations of Multimedia Features

Published online by Cambridge University Press: 05 July 2014

K. Selçuk Candan and

Maria Luisa Sapino

Show author details

K. Selçuk Candan: Affiliation:
Arizona State University
Maria Luisa Sapino: Affiliation:
Università degli Studi di Torino, Italy

Book contents

Get access

Summary

Most features can be represented in the form of one (or more) of the four common base models: vectors, strings, graphs/trees, and fuzzy/probabilistic logic-based representations.

Many features, such as colors, textures, and shapes, are commonly represented in the form of histograms that quantify the contribution of each individual property (or feature instance) to themedia object.Given n different properties of interest, the vector model associates an n-dimensional feature vector space, where the ith dimension corresponds to the ith property. Thus, each vector describes the composition of a given multimedia data object in terms of its quantifiable properties.

Strings, on the other hand, are commonly used for representing media of sequential (or temporal) nature, when the ordinal relationships between events are more important than the quantitative differences between their occurrences. As we have seen in Section 2.3.6.4, because of their simplicity, string-based models are also used as less complex representations for more complex features, such as the spatial distributions of points of interest.

Graphs and trees are used for representing complex media, composed of other smaller objects/events that cannot be ordered to form sequences. Such media include hierarchical data, such as taxonomies and X3D worlds (which are easily represented as trees), and directed/undirected networks, such as hypermedia and social networks (where the edges of the graph represent explicit or implicit relationships between media objects or individuals).

When vectors, strings, trees, or graphs are not sufficient to represent the underlying imprecision of the data, fuzzy or probabilistic models can be used to deal with this complexity.

Type: Chapter
Information: Data Management for Multimedia Retrieval , pp. 99 - 142

DOI: https://doi.org/10.1017/CBO9780511781636.004 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2010

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

3 - Common Representations of Multimedia Features

Summary

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive