Skip Navigation
Technical Methods Report: Estimation and Identification of the Complier Average Causal Effect Parameter in Education RCTs

NCEE 2009-4040
April 2009

Chapter 2: The Theoretical Framework Underlying the ITT Parameter

We consider two-level clustered designs where students are nested within units (such as schools, classrooms, or districts) that are randomly assigned to a single treatment or control group—the most common designs used in large-scale RCTs in the education field. The results that are presented for twolevel designs, however, can be collapsed to obtain results for nonclustered designs where students are the unit of random assignment. This is because nonclustered designs are a special case of clustered designs where every cluster has one student and there is no within-cluster variance.

We consider a "superpopulation" version of the Neyman-Rubin causal inference model (see Rubin 1974; Imbens and Rubin 2007; Schochet 2008). It is assumed that the sample contains n units (groups), with np treatment units and n(1-p) control units, where p is the sampling rate to the treatment group (0 <p<1). Let WTi be the "potential" unit-level outcome for unit i when assigned to the treatment condition and WCi be the potential outcome for unit i in the control condition. These potential outcomes are assumed to be random draws from potential treatment and control outcome distributions in the study population with means μT and μC, respectively, and common variance σB2. It is assumed that the potential outcomes for each unit are independent of the treatment status of other units.

Suppose next that mi students are sampled from the student superpopulation within study unit i. Let YTij and YCij be student-level potential outcomes (conditional on unit-level potential outcomes) that are random draws from potential outcome distributions with means WTi and WCi, respectively, and common variance σW2 >0.

Under this causal inference model, the difference between the two potential outcomes, (WTi -WCi), is the unit-level treatment effect for unit i, and the ITT (or average treatment effect) parameter is E(WTi -WCi) =μTC The unit-level treatment effects, and hence, the ITT parameter, cannot be calculated directly because for each unit and student, the potential outcome is observed in either the treatment or control condition, but not in both. Formally, if Ti is a treatment status indicator variable that equals 1 for treatments and 0 for controls, then the observed outcome for a unit, wi, can be expressed as follows:

(1) wi = TiWTi +(1-Ti) WCi.

Similarly, the observed outcome for a student yij is:

(2) yij = TiYTij +(1-Ti) YCi.

The simple equations in (1) and (2) form the basis for the estimation models that are considered in this report.

The terms in (1) can be rearranged to create the following regression model:

(3) yij0ITTTi +(ui +eij), where

1. α0 = μC and αITTTC (the ITT parameter) are coefficients to be estimated;

2. ui =Ti (WTiT) +(1-Ti) WCiC) is a unit-level error term with mean zero and between-unit variance σB2 that is uncorrelated with Ti; and

3. eij =Ti (YTij -WTi) +(1-Ti) YCij -WCi) is a student-level error term with mean zero and within-unit variance σW2 that is uncorrelated with ui and Ti.

Importantly, (3) can also be derived using the following two-level hierarchical linear model (HLM) (Bryk and Raudenbush 1992):

Level 1: yij =wi +eij

Level 2: wi =αi +αITT +ui

where Level 1 corresponds to students and Level 2 to units. Inserting the Level 2 equation into the Level 1 equation yields (3). Thus, the HLM approach is consistent with the causal inference theory presented above.

Finally, baseline covariates can be included in (3) as "irrelevant" variables to improve the precision of the impact estimates, which yields the following estimation model:

(4) yij0ITTTi +(Xij -Xi′γ +Zi′δ +(ui* +eij*

where Xij is a vector of student-level baseline covariates that is centered around the unit-level covariate mean Xi; γ is a parameter vector that is associated with Xij;Zi is a vector of unit-level baseline covariates (that could include Xi and stratum indicators) with associated parameter vector δ; and ui* and eij* are error terms that are now conditional on the covariates. We center the Xij covariates around the unit-level means so that we can separately identify the effects of covariates on the within- and betweenunit variance components.

Top