Technical Methods Report: Estimation and Identification of the Complier Average Causal Effect Parameter in Education RCTs - Chapter 3: ITT Impact and Variance Estimators

Technical Methods Report: Estimation and Identification of the Complier Average Causal Effect Parameter in Education RCTs

NCEE 2009-4040
April 2009

Chapter 1. Introduction
Chapter 2: The Theoretical Framework Underlying the ITT Parameter
Chapter 3: ITT Impact and Variance Estimators
Chapter 4: The CACE Parameter
Chapter 5: The Standardized ITT and CACE Estimators
Chapter 6: Empirical Analysis
Chapter 7: Summary and Conclusions
References
Tables
Appendix A: Proof of Equation (29) (31 KB)
Appendix B: Summary of Data Sources
PDF & Related Info

Chapter 3: ITT Impact and Variance Estimators

In this chapter, we use the models in (3) and (4) to discuss ITT estimators in nominal units, because they form the foundation for the CACE and standardized estimators. We focus on commonly used differencesin- means and analysis of covariance (ANCOVA) estimators, which are used for the empirical analysis.

We make the simplifying assumption that m_i =m for all units (that is, equal cluster sizes). Cluster sizes are often similar for RCTs in the education area (and for the RCTs examined in our empirical work), and variance formulas are much more complex with unequal cluster sizes. Furthermore, the formulas presented in this chapter apply approximately for unequal unit sizes that do not vary substantially across units if m is replaced in the formulas by the average unit size m (Kish 1965) or, preferably, by [n/Σ(1/m_i)] (Hedges 2007).

The Simple Differences-In-Means Estimator
The simple differences-in-means ITT estimator α̂_ITT1 can be obtained by applying standard regression methods to (3). The resulting estimator is as follows:

estimator equation

where estimator equation This estimator is the average difference between cluster means across the treatment and control groups.

Schochet (2008) shows that α̂_ITT1 is asymptotically normally distributed with mean α_ITT and the following asymptotic variance:

asymptotic variange

The within-unit (second) variance term in (6) is the conventional variance expression for an impact estimator in a nonclustered design where random assignment is conducted within units. Design effects in a clustered design arise because of the first between-unit variance term, which represents the extent to which mean outcomes vary across units (Murray 1998; Donner and Klar 2000).

An asymptotically unbiased estimator for the within-unit variance σ_W² is as follows (Cochran 1963; Hedges 2007):

asymptotically unbiased estimator for the within-unit variance

Similarly, an asymptotically unbiased estimator for the between-unit variance σ_B² is:

asymptotically unbiased estimator for the between-unit variance

Note that equation (9) can also be expressed in terms of regression residual sums of squares:

regression residual sums of squares

where ŷ_i is the predicted value for unit i from the between-unit regression of y_i on T_i and an intercept. Inserting (7) and (8) into (6) yields the following variance estimator for α̂_ITT1:

variance estimator

This estimator also applies to nonclustered designs where units are defined as students.

The Analysis of Covariance (ANCOVA) Estimator
The ANCOVA estimator α̂_ITT2 can be obtained by applying regression methods to (4) where baseline covariates (such as pretests) are included in the analytic models, primarily to improve the precision of the impact estimates. Schochet (2008) shows that α̂_ITT2 is asymptotically normally distributed with mean α_ITT and the following asymptotic variance:

asymptotic variance

In this expression, σ_B1² and σ_W1² are between- and within-unit variances, respectively, that are conditional on the covariates, and reduce σ_B² and σ_W² depending on the size of the outcome-covariate correlations in the joint superpopulation distributions (these are R² adjustments).

Using methods that are parallel to the simple differences-in-means estimator presented above, a consistent variance estimator for α̂_ITT2 in (12) is as follows:

consistent variance estimator

where S_B1² is obtained using (10) with the following changes: (1) ŷ_i is now the predicted value for unit i from the between-unit regression of y_i on Q_i =[1 T_i Z_i]; and (2) (n -2) is replaced by (n -k) where k is the rank of the matrix Q whose rows contain the Q_is. In practice, T_i and Z_i may be weakly correlated due to random sampling and missing data. Thus, (13) can be refined as follows:

refined consistent variance estimator

Finally, in our empirical work, we also used STATA to estimate more efficient generalized least squares models that allowed for unequal cluster sample sizes. Specifically, we used generalized estimating equation (GEE) methods with the sandwich variance estimator (Liang and Zeger 1986), and full and restricted maximum likelihood approaches to general linear mixed models (Littell et al. 1996; Bryk and Raudenbush 1992). The empirical results using these methods are very similar to those that are presented in this report, and thus, are not reported.

Top