Joint Maximum Likelihood Estimation for High-Dimensional Exploratory Item Factor Analysis

Yunxiao Chen; Xiaoou Li; Siliang Zhang

doi:10.1007/s11336-018-9646-5

Joint Maximum Likelihood Estimation for High-Dimensional Exploratory Item Factor Analysis

Published online by Cambridge University Press: 01 January 2025

Yunxiao Chen

Xiaoou Li and

Siliang Zhang

Show author details

Yunxiao Chen*: Affiliation:
London School of Economics and Political Science
Xiaoou Li: Affiliation:
University of Minnesota
Siliang Zhang: Affiliation:
Fudan University
*: Correspondence should be made toYunxiao Chen, London School of Economics and Political Science, London, UK. Email: [email protected]

Article contents

Abstract
Footnotes
References

Get access

Rights & Permissions

Abstract

Joint maximum likelihood (JML) estimation is one of the earliest approaches to fitting item response theory (IRT) models. This procedure treats both the item and person parameters as unknown but fixed model parameters and estimates them simultaneously by solving an optimization problem. However, the JML estimator is known to be asymptotically inconsistent for many IRT models, when the sample size goes to infinity and the number of items keeps fixed. Consequently, in the psychometrics literature, this estimator is less preferred to the marginal maximum likelihood (MML) estimator. In this paper, we re-investigate the JML estimator for high-dimensional exploratory item factor analysis, from both statistical and computational perspectives. In particular, we establish a notion of statistical consistency for a constrained JML estimator, under an asymptotic setting that both the numbers of items and people grow to infinity and that many responses may be missing. A parallel computing algorithm is proposed for this estimator that can scale to very large datasets. Via simulation studies, we show that when the dimensionality is high, the proposed estimator yields similar or even better results than those from the MML estimator, but can be obtained computationally much more efficiently. An illustrative real data example is provided based on the revised version of Eysenck’s Personality Questionnaire (EPQ-R).

Keywords

joint maximum likelihood estimator item response theory IRT high-dimensional data alternating minimization projected gradient descent personality assessment

Type: Original Paper
Information: Psychometrika , Volume 84 , Issue 1 , 15 March 2019 , pp. 124 - 146

DOI: https://doi.org/10.1007/s11336-018-9646-5 [Opens in a new window]
Copyright: Copyright © 2018 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

Footnotes

Electronic supplementary material The online version of this article (https://doi.org/10.1007/s11336-018-9646-5) contains supplementary material, which is available to authorized users.

References

Andersen, E. B. (1973). Conditional inference and models for measuring. Copenhagen, Denmark: Mentalhygiejnisk Forlag. Google Scholar

Baker, F. B. (1987). Methodology review: Item parameter estimation under the one-, two-, and three-parameter logistic models. Applied Psychological Measurement, 11, (2) 111– 141. CrossRef Google Scholar

Bartholomew, D. J., Moustaki, I, Galbraith, J, & Steele, F (2008). Analysis of multivariate social science data. Boca Raton, FL: CRC Press. CrossRef Google Scholar

Béguin, A. A., & Glas, C. A. (2001). MCMC estimation and some model-fit analysis of multidimensional IRT models. Psychometrika 66, (4) 541– 561. CrossRef Google Scholar

Birnbaum, A, Lord, F. M., & Novick, M. R. (1968). Some latent trait models and their use in inferring an examinee’s ability. Statistical Theories of Mental Test Scores, Reading, MA: Addison-Wesley. Google Scholar

Bock, R. D., & Aitkin, M (1981). Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm. Psychometrika, 46 (4), 443– 459. CrossRef Google Scholar

Bock, R. D., Gibbons, R, & Muraki, E (1988). Full-information item factor analysis. Applied Psychological Measurement, 12 (3), 261– 280. CrossRef Google Scholar

Bolt, D. M., & Lall, V. F. (2003). Estimation of compensatory and noncompensatory multidimensional item response models using Markov chain Monte Carlo. Applied Psychological Measurement, 27 (6), 395– 414. CrossRef Google Scholar

Browne, M. W. (2001). An overview of analytic rotation in exploratory factor analysis. Multivariate Behavioral Research, 36 (1), 111– 150. CrossRef Google Scholar

Cai, L (2010a). High-dimensional exploratory item factor analysis by a Metropolis–Hastings Robbins–Monro algorithm. Psychometrika, 75 (1), 33– 57. CrossRef Google Scholar

Cai, L (2010,). Metropolis–Hastings Robbins–Monro algorithm for confirmatory item factor analysis. Journal of Educational and Behavioral Statistics, 35 (3), 307– 335. CrossRef Google Scholar

Cai, T, & Zhou, W. -X (2013). A max-norm constrained minimization approach to 1-bit matrix completion. The Journal of Machine Learning Research, 14 (1), 3619– 3647. Google Scholar

Chalmers, R. P. (2012). mirt: A multidimensional item response theory package for the R environment. Journal of Statistical Software, 48 (6), 1– 29. CrossRef Google Scholar

Chiu, C. -Y., Köhn, H. -F., Zheng, Y, & Henson, R (2016). Joint maximum likelihood estimation for diagnostic classification models. Psychometrika, 81 (4), 1069– 1092. CrossRef Google Scholar PubMed

Dagum, L., & Menon, R. (1998). OpenMP: An industry standard API for shared-memory programming. Computational Science & Engineering, IEEE, 5 (1), 46– 55. CrossRef Google Scholar

Davenport, M. A., Plan, Y., van den Berg, E., & Wootters, M. (2014). 1-bit matrix completion. Information and Inference, 3, (3) 189– 223. CrossRef Google Scholar

Edelen, M. O., & Reeve, B. B. (2007). Applying item response theory (IRT) modeling to questionnaire development, evaluation, and refinement. Quality of Life Research, 16 (1), 5– 18. CrossRef Google Scholar PubMed

Edwards, M. C. (2010). A Markov chain Monte Carlo approach to confirmatory item factor analysis. Psychometrika, 75 (3), 474– 497. CrossRef Google Scholar

Embretson, S. E., & Reise, S. P. (2000). Item response theory for psychologists, Mahwah, NJ: Lawrence Erlbaum Associates Publishers. Google Scholar

Eysenck, S. B., Eysenck, H. J., & Barrett, P (1985). A revised version of the psychoticism scale. Personality and Individual Differences, 6 (1), 21– 29. CrossRef Google Scholar

Ghosh, M (1995). Inconsistent maximum likelihood estimators for the Rasch model. Statistics & Probability Letters, 23 (2), 165– 170. CrossRef Google Scholar

Haberman, S. J. (1977). Maximum likelihood estimates in exponential response models. The Annals of Statistics, 5 (5), 815– 841. CrossRef Google Scholar

Haberman, S. J. (2004). Joint and conditional maximum likelihood estimation for the Rasch model for binary responses. ETS Research Report Series RR-04-20.CrossRef Google Scholar

Jöreskog, K. G., & Moustaki, I (2001). Factor analysis of ordinal variables: A comparison of three approaches. Multivariate Behavioral Research, 36, (3), 347– 387. CrossRef Google Scholar PubMed

Lee, K, Ashton, M. C., & Robins, R. W., Fraley, R. C., & Krueger, R. F. (2009). Factor analysis in personality research. Handbook of Research Methods in Personality Psychology, New York, NY: Guilford Press. Google Scholar

Lee, S-Y, Poon, W-Y, & Bentler, P. M. (1990). A three-stage estimation procedure for structural equation models with polytomous variables. Psychometrika, 55, (1), 45– 51. CrossRef Google Scholar

Lord, F. M. (1980). Applications of item response theory to practical testing problems, Mahwah, NJ: Routledge. Google Scholar

Meng, X-L, & Schilling, S (1996). Fitting full-information item factor models and an empirical investigation of bridge sampling. Journal of the American Statistical Association, 91, (435), 1254– 1267. CrossRef Google Scholar

Mislevy, R. J. & Stocking, M. L. (1987). A consumer’s guide to LOGIST and BILOG. ETS Research Report Series RR-87-43.CrossRef Google Scholar

Neyman, J, & Scott, E. L. (1948). Consistent estimates based on partially consistent observations. Econometrica, 16, (1), 1– 32. CrossRef Google Scholar

Parikh, N., & Boyd, S. (2014). Proximal algorithms. Foundations and Trends. Optimization, 1(3), 127–239.CrossRef Google Scholar

Reckase, M (2009). Multidimensional item response theory, New York, NY: Springer. CrossRef Google Scholar

Reckase, M. D. (1972). Development and application of a multivariate logistic latent trait model. Ph.D. thesis, Syracuse University, Syracuse NY.Google Scholar

Reise, S. P., & Waller, N. G. (2009). Item response theory and clinical measurement. Annual Review of Clinical Psychology, 5, 27– 48. CrossRef Google Scholar PubMed

Schilling, S, & Bock, R. D. (2005). High-dimensional maximum marginal likelihood item factor analysis by adaptive quadrature. Psychometrika, 70, (3), 533– 555. CrossRef Google Scholar

Sun, J, Chen, Y, Liu, J, Ying, Z, & Xin, T Latent variable selection for multidimensional item response theory models via

L_{1}

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$L_1$$\end{document}

regularization. (2016). Psychometrika, 81, (4), 921– 939. CrossRef Google Scholar

von Davier, A (2010). Statistical models for test equating, scaling, and linking, New York, NY: Springer. Google Scholar

Wirth, R, & Edwards, M. C. (2007). Item factor analysis: Current approaches and future directions. Psychological Methods, 12, (1), 58– 79. CrossRef Google Scholar PubMed

Yao, L, & Schwarz, R. D. (2006). A multidimensional partial credit model with associated item and test statistics: An application to mixed-format tests. Applied Psychological Measurement, 30, (6), 469– 492. CrossRef Google Scholar

Yates, A (1988). Multivariate exploratory data analysis: A perspective on exploratory factor analysis, Albany, NY: State University of New York Press. Google Scholar

Chen et al. supplementary material

Supplement to “Joint Maximum Likelihood Estimation for High-dimensional Exploratory Item Factor Analysis”

File 238.2 KB

Article contents

Joint Maximum Likelihood Estimation for High-Dimensional Exploratory Item Factor Analysis

Abstract

Keywords

Access options

Article purchase

Temporarily unavailable

Footnotes

References

Chen et al. supplementary material

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests