Abstract
In the design of common-item equating, two groups of examinees are administered separate test forms, and each test form contains a common subset of items. We consider test equating under this situation as an incomplete data problem—that is, examinees have observed scores on one test form and missing scores on the other. Through the use of statistical data-imputation techniques, the missing scores can be replaced by reasonable estimates, and consequently the forms may be directly equated as if both forms were administered to both groups. In this paper we discuss different data-imputation techniques that are useful for equipercentile equating; we also use empirical data to evaluate the accuracy of these techniques as compared with chained equipercentile equating.
Similar content being viewed by others
References
Allen, N. L., Holland, P. W., & Thayer, D. T. (1992, April).A missing data approach to estimating distributions of scores for optional test sections. Paper presented at the Annual Meeting of the National Council on Measurement in Education, San Francisco.
Angoff, W. H. (1984).Scales, norms, and equivalent scores. Princeton, NJ: Educational Testing Service. (Reprint of chapter in R. L. Thorndike (Ed.),Educational measurement (2nd ed., pp. 508–600), Washington, DC: American Council on Education, 1971)
Braun, H. I., & Holland, P. W. (1982). Observed-score test equating: A mathematical analysis of some ETS equating procedures. In P. W. Holland & D. B. Rubin (Eds),Test Equating. NY: Academic Press.
Carlin, J. B., & Rubin, D. B. (1991). Summarizing multiple-choice tests using three informative statistics.Psychological Bulletin, 110, 338–349.
Cheng, P. E. (1994). Nonparametric estimation of mean functionals with data missing at random.Journal of The American Statistical Association, 89, 81–87.
Dempster, A. P., Laird, N., & Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm (with discussion).Journal of The Royal Statistical Society, Series B, 39, 1–38.
Hanson, B. (1991a). A comparison of bivariate smoothing methods in common-item equipercentile equating.Applied Psychological Measurement, 15, 391–408.
Hanson, B. (1991b).Method of moments estimates for the four-parameter beta compound binomial model and the calculation of classification consistency indexes (Research Report No. 91-5). IA: The American College Testing Program.
Hanson, B. A. (1993, April).A missing data approach to adjusting writing sample scores. Paper presented at the Annual Meeting of the National Council on Measurement in Education, Atlanta.
Holland, P. W., & Thayer, D. T. (1987).Notes on the use of log-linear models for fitting discrete probability distributions (Technical Report No. 87-79). Princeton, NJ: Educational Testing Service.
Holland, P. W., & Thayer, D. T. (1989).The kernel method of equating score distributions (Research Report 89-7). Princeton, NJ: Educational Testing Service.
Jarjoura, D., & Kolen, M. J. (1985). Standard errors of equipercentile equating for the common item nonequivalent populations design.Journal of Educational Statistics, 10, 143–160.
Kolen, M. J. (1991). Smoothing methods for estimating test score distributions.Journal of Educational Measurement, 28, 257–282.
Kolen, M. J., & Brennan, R. L. (1987). Linear quating models for the common-item nonequivalent populations design.Applied Psychological Measurement, 11, 263–277.
Little, R. A., & Rubin, D. B. (1987).Statistical Analysis with Missing Data. InWiley series in probability and mathematical statistics: Applied probability and statistics. New York: John Wiley & Sons.
Little, R. A., & Rubin, D. B. (1993).Test equating from biased samples. Unpublished lecture notes.
Livingston, S. A. (1993). Small-sample equating with log-linear smoothing.Journal of Educational Measurement, 30, 23–29.
Livingston, S. A., Dorans, N. J., & Wright, N. K. (1990). What combination of sampling and equating methods works best?Applied Measurement in Education, 3, 73–95.
Lord, F. (1965). A strong true-score theory, with applications.Psychometrika, 30, 239–270.
Rice, J. (1984). Bandwidth choice for nonparametric regression.Annals of Statistics, 12, 1215–1230.
Rosenbaum, P. R., & Thayer, D. (1987). Smoothing the joint and marginal distributions of scored two-way contingency tables in test equating.British Journal of Mathematical and Statistical Psychology, 40, 43–49.
Silverman, B. W. (1986).Density estimation for statistics and data analysis. London: Chapman and Hall.
Sundberg, R. (1974). Maximum likelihood theory for incomplete data from an exponential family.Scandinavian Journal of Statistics, 1, 49–58.
Titterington, D. M. (1977). Analysis of incomplete multivariate binary data by the kernel method.Biometrika, 64, 455–460.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Liou, M., Cheng, P.E. Equipercentile equating via data-imputation techniques. Psychometrika 60, 119–136 (1995). https://doi.org/10.1007/BF02294433
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF02294433