Hostname: page-component-745bb68f8f-d8cs5 Total loading time: 0 Render date: 2025-01-07T17:53:00.659Z Has data issue: false hasContentIssue false

Classification by Multivariate Analysis

Published online by Cambridge University Press:  01 January 2025

T. W. Anderson*
Affiliation:
Columbia University

Abstract

The problem considered is the use of a set of measurements on an individual to decide from which of several populations he has been drawn. It is assumed that in each population there is a probability distribution of the measurements. Principles for choosing the rule of classification are based on costs of misclassification. Optimum procedures are derived in general terms. If the measurements are normally distributed, the procedures use one discriminant function in the case of two populations and several discriminant functions in the cases of more populations. The numerical example given involves three normal populations.

Type
Original Paper
Copyright
Copyright © 1951 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Footnotes

*

Sponsored in part by the Office of Naval Research.

The general theory described in this paper can be deduced as a special case of A. Wald's theory (9). M. A. Girshick presented some of this theory to the meeting of the Institute of Mathematical Statistics at Berkeley, June 16, 1949, in “Bayes, Minimax and Other Approaches to Multiple Classification Problems,” and G. W. Brown (2) described some of the results before the American Statistical Asaociation at Cleveland, December 27, 1948.

References

Anderson, T. W. The non-central Wishart distribution and certain problems of multivariate statistics. Ann. math. Stat., 1946, 17, 409431.CrossRefGoogle Scholar
Brown, George W. Basic principles for construction and application of discriminators. J. clin. Psychol., 1950, 6, 5861.3.0.CO;2-B>CrossRefGoogle Scholar
Fisher, R. A. The use of multiple measurements in taxonomic problems. Ann. Eugen., 1936, 7, 179188.CrossRefGoogle Scholar
Fisher, R. A. The statistical utilization of multiple measurements. Ann. Eugen., 1938, 8, 376386.CrossRefGoogle Scholar
Pearson, K. Tables for Statisticians and Biometricians, Part II. Cambridge University Press, 1931.Google Scholar
Rao, C. R. The utilization of multiple measurements in problems of biological classification. J. roy. stat. Soc., 1948, 10, 159193.CrossRefGoogle Scholar
von Mises, R. On the classification of observation data into distinct groups. Ann. math. Stat., 1945, 16, 6873.CrossRefGoogle Scholar
Wald, A. On a statistical problem arising in the classification of an individual into one of two groups. Ann. math. Stat., 1944, 15, 145163.CrossRefGoogle Scholar
Wald, A. Foundations of a general theory of sequential decision functions. Econometrica, 1947, 15, 279313.CrossRefGoogle Scholar
Welch, B. L. Note on discriminant functions. Biometrika, 1939, 13, 218220.Google Scholar