|Title:||Development of Accessible Methodologies and Software in Hierarchical Models with Missing Data|
|Principal Investigator:||Raudenbush, Stephen||Awardee:||National Opinion Research Center (NORC)|
|Program:||Statistical and Research Methodology in Education [Program Details]|
|Award Period:||3 years||Award Amount:||$1,184,993|
|Type:||Methodological Innovation||Award Number:||R305D090022|
Co-Principal Investigator: Yongyin Shin
This project will integrate the appropriate modeling of multilevel data with rigorous methods for modeling with missing data. The project is to develop new methods that integrate these, to make currently available methods broadly accessible for the first time through user-friendly software, and to train educational researchers to use these methods and software.
In experimental research, the dominant design involves randomly assigned classrooms or schools to treatments. Therefore, the key explanatory variables are at the classroom or school levels while the outcome is measured at the individual level. In most cases, classrooms or schools are matched or blocked prior to randomization, so that the design will often have two or more levels of variation. The longitudinal follow-up of students generates an additional level. Hierarchical models, also known as multilevel models, are appropriate for the analysis of such data. Similarly, educational surveys involve multi-stage samples. Because of student mobility across classrooms, schools or school districts, the analysis may require a cross-classified hierarchical model.
Despite the advances in educational data analysis, a ubiquitous problem is that explanatory variables and outcomes are subject to be missing at any of the levels. Due to the lack of widely available methods for efficiently handling such missing data within the context of multilevel data and hierarchical models, the project aims to draw on the researchersí recent advances in developing: (1) methods for efficient analysis of two-level data, (2) a generally applicable approach for three-level data, (3) software to estimate the model and impute missing data, and (4) an efficient method for three-level data where the outcomes and covariates at any level are subject to be missing.
Specifically, the project will (1) test, validate, and disseminate free software for the case of two- or three-level continuous data with missing values at any level; (2) develop, test, and refine new methods for cross-classified models and discrete outcomes; and (3) run a series of workshops to train researchers to use these methods.
Related IES Projects: Accessible Methodology and User-Friendly Software for Multivariate Hierarchical Models Given Incomplete Data (R305D130033)
Shin, Y. (2013). Efficient Handling of Predictors and Outcomes Having Missing Values. In L. Rutkowski, M. VonDavier, and D. Rutkowski (Eds.), A Handbook of International Large-Scale Assessment Data Analysis (pp. 451–479). Boca Raton, FL: CRC Press.
Journal article, monograph, or newsletter
Shin, Y. (2012). Do Black Children Benefit More From Small Classes? Multivariate Instrumental Variable Estimators With Ignorable Missing Data. Journal of Educational Behavioral Statistics, 37(4): 543–574.
Shin, Y. and Raudenbush, S.W. (2010). A Latent Cluster-Mean Approach to the Contextual Effects Model With Missing Data. Journal of Educational and Behavioral Statistics, 35(1): 26–53.
Shin, Y., and Raudenbush, S.W. (2011). The Causal Effect of Class Size on Academic Performance: Multivariate Instrumental Variable Estimators With Tennessee Class Size Data Missing at Random. Journal of Educational and Behavioral Statistics, 36(2): 154–185.