Hostname: page-component-745bb68f8f-v2bm5 Total loading time: 0 Render date: 2025-01-07T18:34:35.487Z Has data issue: false hasContentIssue false

Equipercentile Equating Via Data-Imputation Techniques

Published online by Cambridge University Press:  01 January 2025

Michelle Liou*
Affiliation:
Academia Sinica
Philip E. Cheng
Affiliation:
Academia Sinica
*
Requests for reprints should be sent to Michelle Liou, Educational Testing Service, LARG—Mail Stop 02-T, Rosedale Road, Princeton, NJ 08541.

Abstract

In the design of common-item equating, two groups of examinees are administered separate test forms, and each test form contains a common subset of items. We consider test equating under this situation as an incomplete data problem—that is, examinees have observed scores on one test form and missing scores on the other. Through the use of statistical data-imputation techniques, the missing scores can be replaced by reasonable estimates, and consequently the forms may be directly equated as if both forms were administered to both groups. In this paper we discuss different data-imputation techniques that are useful for equipercentile equating; we also use empirical data to evaluate the accuracy of these techniques as compared with chained equipercentile equating.

Type
Original Paper
Copyright
Copyright © 1995 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Footnotes

A paper presented at the European Meeting of the Psychometric Society, Barcelona, Spain, July, 1993.

We would like to thank Heng-Syung Jeng and the College Entrance Examination Center, ROC, for providing the empirical data used in this research, to Chang-Yung Yu for writing the programs, and to the Editor and referees for their valuable comments on drafts of our manuscript.

References

Allen, N. L., Holland, P. W., & Thayer, D. T. (1992, April). A missing data approach to estimating distributions of scores for optional test sections. Paper presented at the Annual Meeting of the National Council on Measurement in Education, San Francisco.Google Scholar
Angoff, W. H. (1984). Scales, norms, and equivalent scores, Princeton, NJ: Educational Testing Service.Google Scholar
Braun, H. I., Holland, P. W. (1982). Observed-score test equating: A mathematical analysis of some ETS equating procedures. In Holland, P. W., Rubin, D. B. (Eds.), Test Equating, NY: Academic Press.Google Scholar
Carlin, J. B., Rubin, D. B. (1991). Summarizing multiple-choice tests using three informative statistics. Psychological Bulletin, 110, 338349.CrossRefGoogle Scholar
Cheng, P. E. (1994). Nonparametric estimation of mean functionals with data missing at random. Journal of The American Statistical Association, 89, 8187.CrossRefGoogle Scholar
Dempster, A. P., Laird, N., Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm (with discussion). Journal of The Royal Statistical Society, Series B, 39, 138.CrossRefGoogle Scholar
Hanson, B. (1991). A comparison of bivariate smoothing methods in common-item equipercentile equating. Applied Psychological Measurement, 15, 391408.CrossRefGoogle Scholar
Hanson, B. (1991). Method of moments estimates for the four-parameter beta compound binomial model and the calculation of classification consistency indexes, IA: The American College Testing Program.Google Scholar
Hanson, B. A. (1993, April). A missing data approach to adjusting writing sample scores. Paper presented at the Annual Meeting of the National Council on Measurement in Education, Atlanta.Google Scholar
Holland, P. W., Thayer, D. T. (1987). Notes on the use of log-linear models for fitting discrete probability distributions, Princeton, NJ: Educational Testing Service.CrossRefGoogle Scholar
Holland, P. W., Thayer, D. T. (1989). The kernel method of equating score distributions, Princeton, NJ: Educational Testing Service.CrossRefGoogle Scholar
Jarjoura, D., Kolen, M. J. (1985). Standard errors of equipercentile equating for the common item nonequivalent populations design. Journal of Educational Statistics, 10, 143160.CrossRefGoogle Scholar
Kolen, M. J. (1991). Smoothing methods for estimating test score distributions. Journal of Educational Measurement, 28, 257282.CrossRefGoogle Scholar
Kolen, M. J., Brennan, R. L. (1987). Linear quating models for the common-item nonequivalent populations design. Applied Psychological Measurement, 11, 263277.CrossRefGoogle Scholar
Little, R. A., Rubin, D. B. (1987). Statistical Analysis with Missing Data. Wiley series in probability and mathematical statistics: Applied probability and statistics, New York: John Wiley & Sons.Google Scholar
Little, R. A., & Rubin, D. B. (1993). Test equating from biased samples. Unpublished lecture notes.Google Scholar
Livingston, S. A. (1993). Small-sample equating with log-linear smoothing. Journal of Educational Measurement, 30, 2329.CrossRefGoogle Scholar
Livingston, S. A., Dorans, N. J., Wright, N. K. (1990). What combination of sampling and equating methods works best?. Applied Measurement in Education, 3, 7395.CrossRefGoogle Scholar
Lord, F. (1965). A strong true-score theory, with applications. Psychometrika, 30, 239270.CrossRefGoogle Scholar
Rice, J. (1984). Bandwidth choice for nonparametric regression. Annals of Statistics, 12, 12151230.CrossRefGoogle Scholar
Rosenbaum, P. R., Thayer, D. (1987). Smoothing the joint and marginal distributions of scored two-way contingency tables in test equating. British Journal of Mathematical and Statistical Psychology, 40, 4349.CrossRefGoogle Scholar
Silverman, B. W. (1986). Density estimation for statistics and data analysis, London: Chapman and Hall.Google Scholar
Sundberg, R. (1974). Maximum likelihood theory for incomplete data from an exponential family. Scandinavian Journal of Statistics, 1, 4958.Google Scholar
Titterington, D. M. (1977). Analysis of incomplete multivariate binary data by the kernel method. Biometrika, 64, 455460.CrossRefGoogle Scholar