Hostname: page-component-745bb68f8f-l4dxg Total loading time: 0 Render date: 2025-01-07T19:15:34.510Z Has data issue: false hasContentIssue false

A Study of the Classification Capabilities of Neural Networks Using Unsupervised Learning: A Comparison with K-means Clustering

Published online by Cambridge University Press:  01 January 2025

P. V. (Sundar) Balakrishnan
Affiliation:
Business Administration Program, University of Washington, Bothell
Martha C. Cooper*
Affiliation:
Department of Marketing, The Ohio State University
Varghese S. Jacob
Affiliation:
Department of Accounting and Mis, The Ohio State University
Phillip A. Lewis
Affiliation:
Department of Marketing, Rowan College of New Jersey
*
Requests for reprints should be sent to Martha Cooper, College of Business, The Ohio State University, 1775 College Road, Columbus, OH 43210-1399.

Abstract

Several neural networks have been proposed in the general literature for pattern recognition and clustering, but little empirical comparison with traditional methods has been done. The results reported here compare neural networks using Kohonen learning with a traditional clustering method (K-means) in an experimental design using simulated data with known cluster solutions. Two types of neural networks were examined, both of which used unsupervised learning to perform the clustering. One used Kohonen learning with a conscience and the other used Kohonen learning without a conscience mechanism. The performance of these nets was examined with respect to changes in the number of attributes, the number of clusters, and the amount of error in the data. Generally, the K-means procedure had fewer points misclassified while the classification accuracy of neural networks worsened as the number of clusters in the data increased from two to five.

Type
Original Paper
Copyright
Copyright © 1994 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Footnotes

Acknowledgements: Sara Dickson, Vidya Nair, and Beth Means assisted with the neural network analyses.

References

Ahalt, S. C., Krishnamurthy, A. K., Chen, P., Melton, D. E. (1990). Competitive learning algorithms for vector quantization. Neural Networks, 3, 277290.CrossRefGoogle Scholar
Balakrishnan, P. V., Cooper, M. C., Jacob, V. S., & Lewis, P. A. (1992). A cluster analysis of brand choice data using neural networks. In Jacob, V. S. & Pirkul, H. (Eds.), Proceedings of the Twentieth International Business School Computer Users Group (pp. 365375). Columbus, OH.Google Scholar
Bezdek, J. C. (1990). A note on generalized self-organizing network algorithms. Proceedings of the Society of Photo Optical Instrumentation Engineers Applications of Artificial Intelligence VII (pp. 260267). Bellingham, WA: Society of Photo Optical Instrumentation Engineers.Google Scholar
Bezdek, J. C. (1991). Pattern recognition, neural networks, and artificial intelligence. In Trivedi, M. (Eds.), Proceedings of the Society of Photo Optical Instrumentation Engineers Applications of Artificial Intelligence VII (pp. 112). Bellingham, WA: Society of Photo Optical Instrumentation Engineers.Google Scholar
Bezdek, J. C., Bobrowski, L. (1990). Matching ‘boxy’ data: Clustering with the Minkowski norms. Proceedings of Third International Conference on Information Proceeding and Management of Uncertainty in Knowledge-Based Systems (pp. 295299). Paris, France: Ministere de la Recherche et de la Technologie.Google Scholar
Bezdek, J. C., Hathaway, R. J. (1990). Generalized regression and clustering. Proceedings of the International Conference on Fuzzy Logic and Neural Networks (pp. 575578). Iisuka, Japan: Kyusu Institute of Technology.Google Scholar
Bucklin, R. E., Srinivasan, V. (1991). Determining interbrand substitutability through survey measurement of consumer preference structures. Journal of Marketing Research, 27 (February), 5871.CrossRefGoogle Scholar
Caudill, M. (1987). Neural networks primer part I. A1 Expert, 8 (December), 4652.Google Scholar
Cooper, M. C., Milligan, G. W. (1988). The effect of measurement error on determining the number of clusters in cluster analysis. In Gaul, W., Schader, M. (Eds.), Data analysis, decision support and expert knowledge representation in marketing and related areas of research (pp. 319328). Berlin: Springer Verlag.Google Scholar
Cormack, R. M. (1971). A review of classification. Journal of the Royal Statistical Society, Series A, 134, 321367.CrossRefGoogle Scholar
DeSieno, D. (1988). Adding a conscience to competitive learning. Proceedings of the International Conference on Neural Networks (pp. 117124). San Diego, CA: IEEE Society.CrossRefGoogle Scholar
Dubes, R. C. (1987). How many clusters are best?—An experiment. Pattern Recognition, 20(6), 645663.CrossRefGoogle Scholar
Fort, J. C. (1991). Kohonen algorithm and the traveling salesman problem. In Murtagh, F. (Eds.), Neural networks for statistical and economic data (pp. 113118). Dublin: Munotec.Google Scholar
Gath, I., Geva, A. B. (1988). Unsupervised optimal fuzzy clustering. IEEE Transactions on Pattern Analysis and Machine Intelligence, 11(7), 773781.CrossRefGoogle Scholar
Hartigan, J. A. (1975). Clustering algorithms, New York: John Wiley and Sons.Google Scholar
Hecht-Nielsen, R. (1990). Neurocomputing, Reading, MA: Addison-Wesley.Google Scholar
Huntsberger, T. L., Ajjimarangee, P. (1990). Parallel self-organizing feature maps for unsupervised pattern recognition. International Journal of General Systems, 16(4), 357372.CrossRefGoogle Scholar
Kangas, J. A., Kohonen, T. K., Laaksonen, J. T. (1990). Variants of self-organizing maps. IEEE Transactions on Neural Networks, 1(1), 9399.CrossRefGoogle ScholarPubMed
Kohonen, T. (1982). Self-organized formation of topologically correct feature maps. Biological Cybernetics, 43, 5969.CrossRefGoogle Scholar
Kohonen, T. (1982). Analysis of a simple self organizing process. Biological Cybernetics, 44, 135140.CrossRefGoogle Scholar
Kohonen, T. (1989). Self organization and associative memory 3rd edition,, New York: Springer-Verlag.CrossRefGoogle Scholar
Krishnamurthy, A. K., Ahalt, S. C., Melton, D. E., Chen, P. (1990). Neural networks for vector quantization of speech and images. IEEE Journal on Selected Areas in Communications, 8, 14491457.CrossRefGoogle Scholar
Lippmann, R. P. (1987). An introduction to computing with neural nets. IEEE ASSP Magazine, 4, 422.CrossRefGoogle Scholar
MacQueen, J. B. (1967). Some methods for classification and analysis of multivariate observations. Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, 1, 281297.Google Scholar
Milligan, G. W. (1980). An examination of the effect of six types of error perturbation on fifteen clustering algorithms. Psychometrika, 45, 325342.CrossRefGoogle Scholar
Milligan, G. W. (1985). An algorithm for generating artificial test clusters. Psychometrika, 50, 123127.CrossRefGoogle Scholar
Milligan, G. W., Cooper, M. C. (1985). An examination of procedures for determining the number of clusters in a data set. Psychometrika, 50, 159179.CrossRefGoogle Scholar
Milligan, G. W., Cooper, M. C. (1987). Methodology review: Clustering methods. Applied Psychological Measurement, 11(4), 329354.CrossRefGoogle Scholar
Murtagh, F. (1991). A short survey of neural networks for forecasting. In Murtagh, F. (Eds.), Neural networks for statistical and economic data (pp. 8794). Dublin: Munotec.Google Scholar
Murtagh, F. (1991). Neural networks for classification and discrimination, Munchen, Germany: Space Telescope—European Coordinating Facility, European Southern Observatory, Karl-Schwarzschild-Strasse 2, D-8046 Garching bei Munchen.Google Scholar
Nelson, M. M., Illingworth, W. T. (1991). A practical guide to neural networks, Reading, MA: Addison Wesley.Google Scholar
NeuralWare (1991). Neuralworks Professional II/PLUS, Pittsburgh, PA: Author.Google Scholar
Openshaw, S., Wymer, C. (1991). A neural net classifer system for handling census data. In Murtagh, F. (Eds.), Proceedings Neural Networks for Statistical and Economic Data (pp. 7386). Dublin: Munotec Systems.Google Scholar
Osipenko, V. V. (1988). Solution of a double clusterization problem with the use of self-organization. Soviet Journal of Automation and Information Sciences, 21(3), 7782.Google Scholar
Rowher, R. (1991). Neural networks for time-varying data. In Murtagh, F. (Eds.), Proceedings Neural Networks for Statistical and Economic Data (pp. 5969). Dublin: Munotec Systems. (Supported by the Statistical Office of the European Communities, Luxembourg)Google Scholar
Rumelhart, D. E., McClelland, J. L. and the PDP Research Group (1987). Parallel Distributed Processing: Explorations in the Microstructure of Cognition, Volume 1: Foundations, Cambridge, MA: MIT Press.Google Scholar
SAS Institute (1990). SAS User's Guide: Statistics Version 5 Edition,, Cary, NC: Author.Google Scholar
Srivastava, A., Murty, M. N. (1990). A comparison between conceptual clustering and conventional clustering. Pattern Recognition, 23(9), 975981.CrossRefGoogle Scholar
Wan, S. J., Wong, S. K. M., Prusinkiewicz, P. (1988). An algorithm for multidimensional data clustering. ACM Transactions on Mathematical Software, 14(2), 153162.CrossRefGoogle Scholar