Skip Navigation
Technical Methods Report: Do Typical RCTs of Education Interventions Have Sufficient Statistical Power for Linking Impacts on Teacher Practice and Student Achievement Outcomes?

NCEE 2009-4065
October 2009


Aaronson, D., L. Barrow, and W. Sander (2007). Teacher and Student Achievement in the Chicago Public High Schools. Journal of Labor Economics, vol. 25, no. 1, 95-136.

Baron, R. and D. Kenny (1986). The Moderator-Mediator Variable Distinction in Social Psychological Research: Conceptual, Strategic, and Statistical Considerations. Journal of Personality and Social Psychology, 51(6), 1173-1182.

Bloom, H., P. Zhu, and F. Unlu (2009). Finite Sample Bias from Instrumental Variables Analysis in Randomized Trials. MDRC Working Paper.

Bryk, A. and S. Raudenbush (1992). Hierarchical Linear Models: Applications and Data Analysis Methods. Newbury Park, CA: Sage.

Chiang (2009). Classroom and School-Level ICCs in Test Score Gains of Elementary School Students in Low Income Schools. Mathematica Policy Research Working Paper.

Cohen, J. (1977; 1988 2nd Edition). Statistical Power Analysis for the Behavioral Sciences. New York, NY: Academic Press.

Fuller, W (1987). Measurement Error Models. New York: John Wiley and Sons.

Gamse, B.C., Bloom, H.S., Kemple, J.J., Jacob, R.T., (2008). Reading First Impact Study: Interim Report. Washington, DC: National Center for Education Evaluation and Regional Assistance, Institute of Education Sciences, U.S. Department of Education.

Glazerman, S., S. Dolfin, M. Bleeker, A. Johnson, E. Isenberg, J. Lugo-Gil, M. Grider, E. Britton (2008). Impacts of Comprehensive Teacher Induction: Results from the First Year of a Randomized Controlled Study. Washington, DC: U.S. Department of Education, Institute of Education Sciences.

Goldhaber, D. (2002). The Mystery of Good Teaching: Surveying the Evidence on Student Achievement and Teachers’ Characteristics. Education Next, Vol. 2, No. 1, 50-55.

Greene, W. (2000). Econometric Analysis. Fourth Edition. Upper Saddle River, NJ: Prentice Hall.

Hanushek, E., J. Kain, D. O’Brien, and S. Rivkin (2005). The Market for Teacher Quality. National Bureau of Economic Research Working Paper 11154.

Hedges, L. and Hedberg, E. (2007). Intraclass Correlation Values for Planning Group Randomized Trials in Education. Educational Evaluation and Policy Analysis, 29, 60-87.

Holland, P.W. (1988). Causal Inference, Path Analysis, and Recursive Structural Equation Models. In C.C. Clogg (Ed.). Sociological Methodology (pp. 449-493). Washington DC: American Sociological Association.

Imbens, G. and J. Angrist (1994). Identification and Estimation of Local Average Treatment Effects, Econometrica, 62, 467–476.

Jacob, B. and L. Lefgren (2005). Principals as Agents: Subjective Performance Measurement in Education. National Bureau of Economic Research Working Paper 11463.

Jackson, R., A. McCoy, C. Pistorino, A. Wilkinson, J. Burghardt, M. Clark, C. Ross, P. Schochet, and P. Swank (May 2007). National Evaluation of Early Reading First: Report to Congress. Washington, DC: U.S. Department of Education, Institute of Education Sciences.

James-Burdumy, S. et al. (2009). Effectiveness of Selected Supplemental Reading Comprehension Interventions. Washington, DC: U.S. Institute of Education Sciences.

Kish, L. (1965). Survey Sampling. New York: John Wiley and Sons.

Kline, R (2005). Principles and Practice of Structural Equation Modeling. Guilford, NY: Guilford Press.

Kling, J., J. Liebman and L. Katz (2007). Experimental Analysis of Neighborhood Effects. Econometrica Vol. 75, No. 1, 83-119.

Kraemer, H. and S. Thiemann (1987). How Many Subjects? Newbury Park, CA: Sage.

MacCallum, R., M. Browne, and H. Sugawara (1996). Power Analysis and Determination of Sample Size for Covariance Structure Modeling. Psychological Methods 1(2), 130-149.

MacKinnon, D. and Dwyer, J. (1993). Estimating Mediated Effects in Prevention Studies. Evaluation Review, 17, 141-158.

McCaffrey, D., J. Lockwood, D. Koretz, T. Louis, and L. Hamilton (2004). Models for Value-Added Modeling of Teacher Effects. Journal of Educational and Behavioral Statistics, 29(1), 67-101.

Murray, M. (2006). Avoiding Invalid Instruments and Coping with Weak Instruments. Journal of Economic Perspectives, 20(4), 111-132.

Nye, B., S. Konstantopoulos, and L. Hedges (2004). How Large Are Teacher Effects? Educational Evaluation and Policy Analysis, vol. 26, 237-257.

Perez-Johnson, I., L. Campuzano, K. Fortson, C. Gentile, S. Amin, J. Burghardt, A. Schirm (2009). Designing a Study to Validate Measures of Effective Teachers and Classrooms. Mathematica Policy Research Working Paper.

Rao, C. (1973). Linear Statistical Inference and Its Applications. New York: Wiley and Sons.

Raudenbush, S., A. Martinez, H. Bloom, P. Zhu, and F. Lin (2008). The Reliability of Group-Level Measures and the Power of Group-Randomized Studies. University of Chicago Working Paper.

Raudenbush, S. (1997). Statistical Analysis and Optimal Design for Cluster Randomized Trials. Psychological Methods, 2, 173-185.

Rogers, W. and K. Hopkins (1988). Power Estimates in the Presence of a Covariate and Measurement Error. Educational and Psychological Measurement, Vol. 48, 647-656.

Rothstein, J. (2009). Teacher Quality in Education Production: Tracking, Decay, and Student Achievement. Princeton University Industrial Relations Section Working Paper.

Saginor, N. and P. Hyjek (2005). Observing Standards-Based Classrooms: The Vermont Classroom Observation Tool (VCOT). Montpelier, VT: Vermont Institutes.

Schochet, P. (2008). Statistical Power for Random Assignment Evaluations of Education Programs. Journal of Educational and Behavioral Statistics, 33(1), 62-87.

Sobel, M. (1982). Asymptotic Confidence Intervals for Indirect Effects in Structural Equations Models. In S. Leinhart (Ed.), Sociological Methodology (pp. 290-312). San Francisco: Jossey-Bass.

Sobel, M. (2008). Identification of Causal Parameters in Randomized Studies With Mediating Variables. Journal of Educational and Behavioral Statistics, 33(2), 230-251.

Stock, J., J. Wright, and M. Yogo (2002). A Survey of Weak Instruments and Weak Identification in Generalized Method of Moments. Journal of Business and Economic Statistics, 20(4), 518-529.

Wooldridge, J (2002). Econometric Analysis of Cross Section and Panel Data. MA: MIT Press.