Skip Navigation
The Evaluation of Enhanced Academic Instruction in After-School Programs
NCEE 2009-4077
September 2009

Figure ES.3.  SAT 10 Total Reading Scores from Baseline to Follow-Up and the Associated Impact of the Enhanced Reading Program After One Year and Two Years of Service


SOURCES: MDRC calculations are from baseline and follow-up results on the Stanford Achievement Test Series, 10th ed. (SAT 10) abbreviated battery. National norming sample calculations are from the SAT 10 (2002 norming sample): Stanford Achievement Test Series: Tenth Edition: Technical Data Report (Harcourt Assessment, 2004, pp. 312-338).
NOTES: The growth line for the enhanced program group is based on the observed mean baseline and follow-up test scores of students assigned to the enhanced after-school program for two consecutive years (baseline is Fall 2005; follow-ups are Spring 2006 and Spring 2007). The growth line for the regular program group represents the test scores that students in the enhanced program group would have obtained had they not been assigned to the enhanced program (calculated as the mean test score for the enhanced program group minus the estimated impact at a given time point). The growth line for the national norming sample is based on the average SAT 10 total reading scores for a nationally representative sample of students with the same grade composition in each period as the two-year sample. Specifically, at each point in time (the fall baseline, the first spring, and the second spring), the SAT 10 national norm scores for second-, third-, and fourth-graders are averaged weighting each grade average score according to their proportion in the two-year study sample at baseline. This creates an expected two-year improvement of nationally representative students at the same grade levels as this study’s sample. The baseline for the national norming sample is set relative to the average baseline score of the enhanced program group. Estimated impacts on follow-up results are regression-adjusted using ordinary least squares, controlling for indicators of random assignment, baseline reading total scaled score, race/ethnicity, gender, free-lunch status, age, overage for grade, single-adult household, and mother's education. Among those who did not reapply to the study in the second year, nonresponse weights are used to account for those students for whom follow-up data was not collected. Statistical significance is indicated by (*) when the p-value is less than or equal to 5 percent.