Skip Navigation
Funding Opportunities | Search Funded Research Grants and Contracts

IES Grant

Title: Multiple Imputation Procedures for Multilevel Data
Center: NCER Year: 2015
Principal Investigator: Enders, Craig Awardee: University of California, Los Angeles
Program: Statistical and Research Methodology in Education      [Program Details]
Award Period: 3 years (7/1/15–6/30/18) Award Amount: $840,129
Type: Methodological Innovation Award Number: R305D150056
Description:

Previous Award Number: R305D150006
Previous Awardee: Arizona State University

Co-Principal Investigator: Roy Levy

The primary goal of this project was to develop a user-friendly stand-alone software package that accommodates missing data in two- and three-level data structures. Missing data are a frequent occurrence in education research, and they pose a number of challenges for multilevel analyses that do not arise in single-level analyses. Although maximum likelihood estimation (or full information maximum likelihood, FIML) is the default estimation approach in most multilevel software packages, it is not optimal for handling missing data. Most multilevel modeling programs necessarily exclude cases with missing predictors, and cases with missing outcome scores are also excluded from cross-sectional analyses. This removal of cases is problematic when data are missing at level 2 (or higher) because entire clusters are excluded from the analysis.

In the first part of this study, the research team developed and evaluated a multiple imputation algorithm for two-level data structures, with the algorithm being able to accommodate nominal, ordinal, and interval/ratio variables at every level of the data hierarchy. The imputation model also allowed for random slopes between any pair of level-1 variables. For the second part, the research team extended the model and software to three-level data structures. For the third part of the study, the research team examined the behavior of test statistics on data that have undergone multiple imputation. Statistical significance tests are often used in complete-data multilevel analyses, and analogous procedures are available for multiply imputed data, but these tests were originally developed for single-level analyses and have not been systematically studied for use in multilevel analyses. All three parts of the study included Monte Carlo simulations for testing the performance of the imputation procedures and test statistics under various conditions. The research team disseminated the results of this project through peer-reviewed conference presentations and journal manuscripts. The research team held workshops on the techniques and software at major conferences and made the user-friendly software freely available online.

Project Website: https://www.appliedmissingdata.com/blimp

Related IES Projects: Model-based Multiple Imputation for Multilevel Data: Methodological Extensions and Software Enhancements (R305D190002); Dealing with Missing Data in Educational Research: Methodological Innovations and Contemporary Recommendations (R305D220001)

Publications

Journal articles

Enders, C.K. (2017). Multiple Imputation as a Flexible Tool for Missing Data Handling in Clinical Research. Behaviour Research and Therapy, 98, 4-18

Enders, C. K., Du, H., & Keller, B. T. (2020). A model-based imputation procedure for multilevel regression models with random coefficients, interaction effects, and nonlinear terms. Psychological Methods, 25(1), 88.

Enders, C. K., Keller, B. T., & Levy, R. (2018). A fully conditional specification approach to multilevel imputation of categorical and continuous variables. Psychological Methods, 23(2), 298.

Enders, C. K., Hayes, T., & Du, H. (2018). A comparison of multilevel imputation schemes for random coefficient models: fully conditional specification and joint model imputation with random covariance matrices. Multivariate Behavioral Research, 53(5), 695-713.

Juvonen, J., Lessard, L. M., Schacter, H. L., & Enders, C. (2019). The effects of middle school weight climate on youth with higher body weight. Journal of Research on Adolescence, 29(2), 466-479.

Keller, B. T. (2021). An introduction to factored regression models with Blimp. Psych, 4(1), 10-37. Keller, B. T., & Du, H. (2019). A Fully Conditional Specification Approach to Multilevel Multiple Imputation with Latent Cluster Means. Multivariate Behavioral Research, 54(1), 149-150.

Levy, R., & Enders, C. K. (2021). Full conditional distributions for Bayesian multilevel models with additive or interactive effects and missing data on covariates. Communications in Statistics-Simulation and Computation, 1-225.

Mansolf, M., Jorgensen, T. D., & Enders, C. K. (2020). A multiple imputation score test for model modification in structural equation models. Psychological Methods, 25(4), 393.

Mistler, S. A., & Enders, C. K. (2017). A comparison of joint model and fully conditional specification imputation for multilevel missing data. Journal of Educational and Behavioral Statistics, 42(4), 432-466.

Smith, D. S., Schacter, H. L., Enders, C., & Juvonen, J. (2018). Gender norm salience across middle schools: Contextual variations in associations between gender typicality and socioemotional distress. Journal of Youth and Adolescence, 47(5), 947-960.

Vera, J. D., & Enders, C. K. (2021). Is item imputation always better? An investigation of wave-missing data in growth models. Structural Equation Modeling: A Multidisciplinary Journal, 28(4), 506-517.


Back