Skip to main content Accessibility help
×
Hostname: page-component-cd9895bd7-dzt6s Total loading time: 0 Render date: 2024-12-26T13:50:16.436Z Has data issue: false hasContentIssue false

6 - Cluster Analysis

from II - Factors and Groupings

Published online by Cambridge University Press:  05 June 2014

Inge Koch
Affiliation:
University of Adelaide
Get access

Summary

There is no sense in being precise when you don't even know what you're talking about (John von Neumann, 1903–1957).

Introduction

Cluster Analysis is an exploratory technique which partitions observations into different clusters or groupings. In medicine, biology, psychology, marketing or finance, multivariate measurements of objects or individuals are the data of interest. In biology, human blood cells of one or more individuals – such as the HIV flow cytometry data – might be the objects one wants to analyse. Cells with similar multivariate responses are grouped together, and cells whose responses differ considerably from each other are partitioned into different clusters. The analysis of cells from a number of individuals such as HIV+ and HIV− individuals may result in different cluster patterns. These differences are informative for the biologist and might allow him or her to draw conclusions about the onset or progression of a disease or a patient's response to treatment.

Clustering techniques are applicable whenever a mountain of data needs to be grouped into manageable and meaningful piles. In some applications we know that the data naturally fall into two groups, such as HIV+ or HIV−, but in many cases the number of clusters is not known. The goal of Cluster Analysis is to determine

  1. • the cluster allocation for each observation, and

  2. • the number of clusters.

For some clustering methods – such as k-means – the user has to specify the number of clusters prior to applying the method.

Type
Chapter
Information
Publisher: Cambridge University Press
Print publication year: 2013

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Save book to Kindle

To save this book to your Kindle, first ensure [email protected] is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

  • Cluster Analysis
  • Inge Koch, University of Adelaide
  • Book: Analysis of Multivariate and High-Dimensional Data
  • Online publication: 05 June 2014
  • Chapter DOI: https://doi.org/10.1017/CBO9781139025805.008
Available formats
×

Save book to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

  • Cluster Analysis
  • Inge Koch, University of Adelaide
  • Book: Analysis of Multivariate and High-Dimensional Data
  • Online publication: 05 June 2014
  • Chapter DOI: https://doi.org/10.1017/CBO9781139025805.008
Available formats
×

Save book to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

  • Cluster Analysis
  • Inge Koch, University of Adelaide
  • Book: Analysis of Multivariate and High-Dimensional Data
  • Online publication: 05 June 2014
  • Chapter DOI: https://doi.org/10.1017/CBO9781139025805.008
Available formats
×