Hostname: page-component-745bb68f8f-cphqk Total loading time: 0 Render date: 2025-01-08T11:52:06.632Z Has data issue: false hasContentIssue false

Maximum Likelihood Methods in Treating Outliers and Symmetrically Heavy-Tailed Distributions for Nonlinear Structural Equation Models with Missing Data

Published online by Cambridge University Press:  01 January 2025

Sik-Yum Lee*
Affiliation:
The Chinese University of Hong Kong
Ye-Mao Xia
Affiliation:
The Chinese University of Hong Kong
*
Requests for reprints should be sent to S. Y. Lee, Department of Statistics, The Chinese University of Hong Kong, Shatin, N. T., Hong Kong. E-mail: [email protected]

Abstract

By means of more than a dozen user friendly packages, structural equation models (SEMs) are widely used in behavioral, education, social, and psychological research. As the underlying theory and methods in these packages are vulnerable to outliers and distributions with longer-than-normal tails, a fundamental problem in the field is the development of robust methods to reduce the influence of outliers and the distributional deviation in the analysis. In this paper we develop a maximum likelihood (ML) approach that is robust to outliers and symmetrically heavy-tailed distributions for analyzing nonlinear SEMs with ignorable missing data. The analytic strategy is to incorporate a general class of distributions into the latent variables and the error measurements in the measurement and structural equations. A Monte Carlo EM (MCEM) algorithm is constructed to obtain the ML estimates, and a path sampling procedure is implemented to compute the observed-data log-likelihood and then the Bayesian information criterion for model comparison. The proposed methodologies are illustrated with simulation studies and an example.

Type
Original Paper
Copyright
Copyright © 2006 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Footnotes

The research described herein was fully supported by a grant (CUHK 4243/03H) from the Rearch Grants Council of the Hong Kong Special Administration Region. The authors are thankful to the Editor, the Associate Editor, and anonymous reviewers for valuable comments which improve the paper significantly, and are grateful to ICPSR and the relevant funding agency for allowing the use of their data.

References

Bentler, P.M. (2004). EQS6: Structural equations program manual, Encino, CA: Multivariate Software.Google Scholar
Berkane, M., Bentler, P.M. (1988). Estimating of the contamination parameters and identification of outliers in multivariate data. Sociological Methods & Research, 17, 5564.CrossRefGoogle Scholar
Bowman, K.O., Shenton, L.R. (1988). Properties of estimators for the gamma distribution, Dordrecht: Marcel Dekker.Google Scholar
Browne, M.W. (1987). Robustness of statistical influence in factor analysis and related models. Biometrika, 74, 375384.Google Scholar
Browne, M.W., Shapiro, A. (1988). Robustness of normal theory methods in the analysis of linear latent variable models. British Journal of Mathematical and Statistical Psychology, 41, 193208.CrossRefGoogle Scholar
Campbell, N.A. (1982). Robust procedure in multivariate analysis I: Robust covariance estimation. Applied Statistics, 29, 231237.CrossRefGoogle Scholar
Cowles, M.K. (1996). Accelerating Monte Carlo Markov Chains convergence for cumulative-link generalized linear modes. Statistics and Computing, 6, 101111.CrossRefGoogle Scholar
Dempster, A.P., Laird, N.M., Rubin, D.B. (1977). Maximum likelihood from incomplete data via the EM algorithm (with discussion). Journal of the Royal Statistical Society, Series B, 39, 138.CrossRefGoogle Scholar
Gelman, A., Meng, X.L. (1998). Simulating normalizing constants: From importance sampling to bridge sampling to path sampling. Statistical Science, 13, 163185.CrossRefGoogle Scholar
Geman, S., Geman, D. (1984). Stochastic relaxation, Gibbs distribution, and the Bayesian restoration of images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 6, 721741.CrossRefGoogle ScholarPubMed
Hastings, W.K. (1970). Monte Carlo sampling methods using Markov chains and their applications. Biometrika, 57, 97100.CrossRefGoogle Scholar
Jöreskog, K.G., Sörbom, D. (1996). LISREL 8: Structural equation modelling with the SIMPLIS command language, London: Scientific Software International.Google Scholar
Kano, Y., Berkane, M., Bentler, P.M. (1993). Statistical inference based on pseudo-maximum likelihood estimators in elliptical populations. Journal of the American Statistical Association, 88, 135143.CrossRefGoogle Scholar
Kass, R.E., Raftery, A.E. (1995). Bayes factor. Journal of the American Statistical Association, 90, 773795.CrossRefGoogle Scholar
Lange, K.L., Little, R.J.A., Taylor, J. M. G. (1989). Robust statistical modelling using the t-distribution. Journal of the American Statistical Association, 84, 881896.Google Scholar
Lee, M., Lomax, R.G. (2005). The effects of varying degrees of nonnormality in structural equation modeling. Structural Equation Modeling, 12, 127.CrossRefGoogle Scholar
Lee, S.Y., Song, X.Y. (2004). Maximum likelihood analysis of a general latent variable model with hierarchically mixed data. Biometrics, 60, 624636.CrossRefGoogle ScholarPubMed
Lee, S.Y., Song, X.Y. (2003). Model comparison of nonlinear structural equation models with fixed covariates. Psychometrika, 68, 2747.CrossRefGoogle Scholar
Lee, S.Y., Song, X.Y., Lee, J.C.K. (2003). Maximum likelihood estimation of nonlinear structure models with ignorable missing data. Journal of Educational and Behavioral Statistics, 28, 111134.CrossRefGoogle Scholar
Lee, S.Y., Zhu, H.T. (2002). Maximum likelihood estimation of nonlinear structural equation models. Psychometrika, 67, 189210.CrossRefGoogle Scholar
Little, R.J.A. (1988). Robust estimation of mean and covariance matrix from data with missing values. Applied Statistics, 37, 2339.CrossRefGoogle Scholar
Little, R.J.A., Rubin, D.B. (1987). Statistical analysis with missing data, Dordrecht: Wiley.Google Scholar
Louis, T.A. (1982). Finding the observed information matrix when using the EM algorithm. Journal of the Royal Statistical Society, Series B, 44, 226233.CrossRefGoogle Scholar
Mardia, V.V. (1970). Measures of multivariate skewness and kurtosis with application. Biometrika, 57, 519530.CrossRefGoogle Scholar
Meng, X.L., Rubin, D.B. (1993). Maximum likelihood estimation via the ECM algorithm: A general framework. Biometrika, 80, 267278.CrossRefGoogle Scholar
Metropolis, N., Rosenbluth, A.W., Rosenbluth, M.N., Teller, A.H., Teller, E. (1953). Equations of state calculations by fast computing machines. Journal of Chemical Physics, 21, 10871092.CrossRefGoogle Scholar
Ogasawara, H. (2005). Asymptotic robustness of the asymptotic bias in structural equation modeling. Computational Statistics & Data Analysis, 49, 771783.CrossRefGoogle Scholar
Schwarz, G. (1978). Estimating the dimension of a model. Annals of Statistics, 6, 461464.CrossRefGoogle Scholar
Song, X.Y., Lee, S.Y. (2005). Maximum likelihood analysis of nonlinear structural equation models with dichotomons variables. Multivariate Behavioral Research, 40, 151177.CrossRefGoogle ScholarPubMed
Song, X.Y., Lee, S.Y. (2004). Bayesian analysis of two-level nonlinear structural equation models with continuous and polytomous data. British Journal of Mathematical and Statistical Psychology, 57, 2952.CrossRefGoogle ScholarPubMed
Watanabe, M., Yamaguchi, K. (2004). The EM algorithm and relate statistical models, Dordrecht: Marcel Dekker.Google Scholar
Wei, G.C.G., Tanner, M.A. (1990). Monte Carlo implementation of the EM algorithm and the poor man’s data augmentation algorithms (in theory and methods). Journal of the American Statistical Association, 85, 699704.CrossRefGoogle Scholar
Yuan, K.H., Bentler, P.M. (1997). Mean and covariance structure analysis: theoretical and practical improvements (in theory and methods). Journal of the American Statistical Association, 92, 767774.CrossRefGoogle Scholar
Yuan, K.H., Bentler, P.M. (1998a). Robust mean and covariance structure analysis. British Journal of Mathematical and Statistical Psychology, 51, 6388.CrossRefGoogle ScholarPubMed
Yuan, K.H., Bentler, P.M. (1998b). Structural equation modelling with robust covariance. Sociological Methodology, 28, 363396.CrossRefGoogle Scholar
Yuan, K.H., Bentler, P.M. (2000). Robust mean and covariance structure analysis through iteratively reweighted least squares. Psychometrika, 65, 4358.CrossRefGoogle Scholar