High-Dimensional Maximum Marginal Likelihood Item Factor Analysis by Adaptive Quadrature

Stephen Schilling; R. Darrell Bock

doi:10.1007/s11336-003-1141-x

High-Dimensional Maximum Marginal Likelihood Item Factor Analysis by Adaptive Quadrature

Published online by Cambridge University Press: 01 January 2025

Stephen Schilling and

R. Darrell Bock

Show author details

Stephen Schilling*: Affiliation:
School of Education, University of Michigan
R. Darrell Bock: Affiliation:
Center for Health Statistics, University of Illinois at Chicago
*: Requests for reprints should be sent to Stephen Schilling, Assistant Professor, University of Michigan, School of Education, Ann Arbor, MI 48109, USA. E-mail: [email protected]

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

Although the Bock–Aitkin likelihood-based estimation method for factor analysis of dichotomous item response data has important advantages over classical analysis of item tetrachoric correlations, a serious limitation of the method is its reliance on fixed-point Gauss-Hermite (G-H) quadrature in the solution of the likelihood equations and likelihood-ratio tests. When the number of latent dimensions is large, computational considerations require that the number of quadrature points per dimension be few. But with large numbers of items, the dispersion of the likelihood, given the response pattern, becomes so small that the likelihood cannot be accurately evaluated with the sparse fixed points in the latent space. In this paper, we demonstrate that substantial improvement in accuracy can be obtained by adapting the quadrature points to the location and dispersion of the likelihood surfaces corresponding to each distinct pattern in the data. In particular, we show that adaptive G-H quadrature, combined with mean and covariance adjustments at each iteration of an EM algorithm, produces an accurate fast-converging solution with as few as two points per dimension. Evaluations of this method with simulated data are shown to yield accurate recovery of the generating factor loadings for models of up to eight dimensions. Unlike an earlier application of adaptive Gibbs sampling to this problem by Meng and Schilling, the simulations also confirm the validity of the present method in calculating likelihood-ratio chi-square statistics for determining the number of factors required in the model. Finally, we apply the method to a sample of real data from a test of teacher qualifications.

Keywords

factor analysis item response theory latent variables EM algorithm marginal likelihood estimation GLS estimation adaptive quadrature monte carlo integration

Type: Original Paper
Information: Psychometrika , Volume 70 , Issue 3 , September 2005 , pp. 533 - 555

DOI: https://doi.org/10.1007/s11336-003-1141-x [Opens in a new window]
Copyright: Copyright © 2005 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Ahrens, J.H., Dieter, U. (1979). Computer methods for sampling from the exponential and normal distributions. Communications of the Association for Computing Machinery, 15, 873–882.CrossRef Google Scholar

Ansari, A., Jedidi, K. (2000). Bayesian factor analysis for multilevel binary observations. Psychometrika, 65(4), 475–496.CrossRef Google Scholar

Bartholomew, D.J., Knott, M. (1999). Latent Variable Models and Factor Analysis. Oxford: New York.Google Scholar

Bock, R.D. (1975/1985) Multivariate Statistical Methods in Behavioral Research. McGraw-Hill, New York; 1985 reprint, Chicago: Scientific Software InternationalGoogle Scholar

Bock, R.D., Lieberman, M. (1970). Fitting a response model for dichotomously scored items. Psychometrika, 35, 179–197.CrossRef Google Scholar

Bock, R.D., Aitkin, M. (1981). Marginal maximum likelihood estimation of item parameters: application of an EM algorithm. Psychometrika, 46, 443–459.CrossRef Google Scholar

Bock, R.D., Gibbons, R.D., Muraki, E. (1987). Full information item factor analysis. Applied Psychological Measurement, 12(3), 261–280.CrossRef Google Scholar

Bock, R.D., Schilling, S.G. (1997). High-dimensional full-information item factor analysis. In Birkane, M. (Eds.), Latent Variable Modeling and Applications to Causality (pp. 163–176). New-York: Springer.CrossRef Google Scholar

Bock, R.D., Gibbons, R.D., Muraki, E., Schilling, S.G., Wilson, D.T., Wood, R. (1999). TESTFACT 3: Test Scoring, Item Statistics, and Full-information Item Factor Analysis. Chicago: Scientific Software International.Google Scholar

Divgi, D.R. (1979). Calculation of the tetrachoric correlation coefficient. Psychometrika, 44, 169–172.CrossRef Google Scholar

Ferguson, G.A. (1941). The factorial interpretation of test difficulty. Psychometrika, 6, 323–329.CrossRef Google Scholar

Fox, J.P., Glas, C.A.W. (2001). Bayesian estimation of a multilevel IRT model using Gibbs sampling. Psychometrika, 66(2), 271–288.CrossRef Google Scholar

Guilford, J.P. (1941). The difficulty of a test and its factor composition. Psychometrika, 6, 66–77.CrossRef Google Scholar

Haberman, S.J. (1977). Log-linear models and frequency tables with small expected cell counts. Annals of Statistics, 5, 1148–1169.CrossRef Google Scholar

Harman, H.H. (1987). Modern Factor Analysis. Chicago: University of Chicago Press.Google Scholar

Hedeker, D., Gibbons, R.D. (1994). A random-effects ordinal regression model for multilevel analysis. Biometrics, 50, 933–944.CrossRef Google Scholar PubMed

Hill, H.C., Schilling, S.G., Ball, D.L. (2004) Developing measures of teachers mathematics knowledge for teaching. Elementary School Journal, in press.CrossRef Google Scholar

Householder, A.S. (1964). The Theory of Matrices in Numerical Analysis. New York: Blaisdell.Google Scholar

Kaiser, H.F. (1958). The varimax criterion for analytic rotation in factor analysis. Psychometrika, 23, 187–200.CrossRef Google Scholar

Leonelli, B.T., Chang, C.H., Bock, R.D., Schilling, S.G. (2000). A full-information item factor analysis interpretation of the MMPI-2: Normative Sampling with Non-pathonomic Descriptors. Journal of Personality Assessment, 74(3), 400–422.CrossRef Google Scholar PubMed

Lesaffre, E., Spiessens, B. (2001). On the effect of the number of quadrature points in a logistic random-effects model: an example. Applied Statistics, 50, 325–335.Google Scholar

Lindstrom, M.J., Bates, D.M. (1990). Nonlinear mixed effects models for repeated measures data. Biometrics, 46, 673–687.CrossRef Google Scholar PubMed

Liu, C., Rubin, D.B., Wu, Y.N. (1998). Parameter expansion to accelerate EM: The PX-EM algorithm. Biometrika, 85(4), 755–770.CrossRef Google Scholar

Liu, Q., Pierce, D.A. (1994). A note on G-H quadrature. Biometrika, 81(3), 624–629.Google Scholar

Meng, X.L., Schilling, S.G. (1996). Fitting full-information factor models and an empirical investigation of bridge sampling. Journal of the American Statistical Association, 91, 1254–1267.CrossRef Google Scholar

Mislevy, R.J. (1984). Estimating latent distributions. Psychometrika, 49(3), 359–381.CrossRef Google Scholar

Muthén, B.O. (1984). A general structural equation model with dichotomous, ordered categorical and continuous latent variable indicators. Psychometrika, 49, 115–132.CrossRef Google Scholar

Muthén, L. K., Muthén, B.O. (1998–2001). Mplus User’s Guide (Second edition). Muthén & Muthén, Los Angeles CA.Google Scholar

Naylor, J.C., Smith, A.F.M. (1982). Applications of a method for the efficient computation of posterior distributions. Applied Statistics, 31, 214–225.CrossRef Google Scholar

Polak, E. (1971). Computational Methods in Optimization. New York: Academic Press.Google Scholar

Powell, M.J.D. (1964). An efficient method for several variables without calculating derivatives. Computer Journal, 7, 155–162.CrossRef Google Scholar

Rabe-Hesketh, S., Pickles, A., Skrondal, A., (2001) GLLAMM Manual. Tech. rept. 2001/01. Department of Biostatistics and Computing, Institute of Psychiatry, King’s College, University of London. Downloadable from http://www.gllamm.org.Google Scholar

Rabe-Hesketh, S., Skrondal, A., Pickles, A. (2002). Reliable estimation of generalized linear mixed models using adaptive quadrature. The Stata Journal, 2, 1–21.CrossRef Google Scholar

Rabe-Hesketh, S., Skrondal, A., Pickles, A. (2005). Generalized multilevel structural equation modeling. Psychometrika, 69, 167–190.CrossRef Google Scholar

Rabe-Hesketh, S., Skrondal, A., Pickles, A. (2005b) Maximum likelihood estimation of limited and discrete dependent variable models with nested random effects. Journal of Econometrics, in press.CrossRef Google Scholar

Ramsay, J.O. (1998). Estimating smooth monotone functions. Journal of the Royal Statistical Society, Series B, 60, 365–375.CrossRef Google Scholar

Raudenbush, S.W., Yang, M.Yosef (2000). Maximum likelihood for generalized linear models with nested random effects via high-order, multivariate Laplace approximation. Journal of Computational and Graphical Statistics, 9(1), 141–157.CrossRef Google Scholar

Raudenbush, S.W., Birk, A.S. (2002). Maximum likelihood for generalized linear models with nested random effects via high-order, multivariate Laplace approximation. Journal of Computational and Graphical Statistics, 9(1), 141–157.CrossRef Google Scholar

Ripley, B.D. (1987). Stochastic Simulation. New York: Wiley.CrossRef Google Scholar

Schilling, S.G. (1993) Advances in Full Information Item Factor Analysis using the Gibbs Sampler. (Unpublished doctoral dissertation, University of Chicago)Google Scholar

Schrage, L. (1979). A more portable fortran random number generator. Association for Computing Machinery: Transactions on Mathematical Software, 5, 132–138.CrossRef Google Scholar

Thurstone, L.L. (1947). Multiple Factor Analysis. Chicago: The University of Chicago Press.Google Scholar

Thurstone, L.L., Thurstone, T.G. (1941). Factorial studies of intelligence. Psychometric Monographs No. 2. Chicago: University of Chicago Press.Google Scholar

Tierney, L., Kadane, J.B. (1986). Accurate approximations for posterior moments and marginal densities. Journal of the American Statistical Association, 81, 82–86.CrossRef Google Scholar

Wei, G.C.G., Tanner, M.A. (1990). A Monte Carlo implementation of the EM algorithm and the poor man’s data augmentation algorithm. Journal of the American Statistical Association, 85, 699–704.CrossRef Google Scholar

Wood, R., Wilson, D.T., Gibbons, R.D., Schilling, S.G., Muraki, E., Bock, R.D. (2003). TESTFACF 4: Test Scoring, Item Statistics, and Full-information Item Factor Analysis. Chicago: Scientific Software International.Google Scholar

Zimowski, M.F., Muraki, E., Mislevy, R.J., Bock, R.D. (1995). BILOG-MG: multiple-group item analysis and test scoring. Chicago: Scientific Software International.Google Scholar

Article contents

High-Dimensional Maximum Marginal Likelihood Item Factor Analysis by Adaptive Quadrature

Abstract

Keywords

Access options

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests