Skip Navigation

What Works Clearinghouse


Appendices


Appendix A1.1 Study characteristics: Hancock, 2002 (randomized controlled trial)

Characteristic Description
Study citation Hancock, C. M. (2002). Accelerating reading trajectories: The effects of dynamic research-based instruction. Dissertation Abstracts International, 63(06), 2139A. (UMI No. 3055690)
Participants The study involved 94 second-grade students who attended a single school. Out of this group, 48 students received the intervention and 46 were in the comparison group. The students were randomly assigned to intervention and comparison groups using block randomization procedures. All students in the second grade were administered several initial measures. Student scores were rank-ordered within each classroom, and then each student was matched with a similarly performing student. Students were then randomly assigned to the intervention group and the comparison group within matched pairs. No information was reported regarding student ethnicity or gender, but 11% of the students in this school qualified for free or reduced-price lunch. There was no attrition.
Setting The study took place in one elementary school in the Kyrene school district in Tempe, Arizona.
Intervention In additional to the regular curriculum (including reading instruction), the intervention group received 25 minutes of supplemental instruction using Read Naturally four times a week for 11 weeks. In each lesson, the first five minutes were spent on oral reading of a selected passage with a teaching assistant. The reading was timed for one minute, and the total number of words read correctly was recorded on a graph. The last 20 minutes involved repeated oral reading of curriculum stories either individually or with a cassette tape. Once students practiced a passage eight times (three times with a cassette and five times individually), they did a timed reading with the teacher. If the student achieved mastery (100 words read correctly with three or fewer errors), the student moved on to another passage; otherwise, the cycle was repeated. The procedures used in this study excluded Read Naturally’s pre-reading vocabulary instruction component and the Read Naturally placement system to individualize instruction.
Comparison In additional to their regular curriculum (including reading instruction), the comparison group students received supplemental instruction using Connecting Math Concepts curriculum (Level B). This program used worksheets, workbooks, coins, and games, and taught basic mathematics skills such as place value, money counting, time, addition, subtraction, and multiplication.
Primary outcomes and measurement The author used the Peabody Picture Vocabulary Test: Third Edition (PPVT-III), the Word Use Fluency test, and the Curriculum Based Measure: Cloze Probe and Test of Reading Fluency. The author used initial reading skills as a covariate to account for baseline differences between groups. For a more detailed description of these outcome measures, see Appendices A2.12.2.
Staff/teacher training Six teaching assistants were trained over five days. Teaching assistants were observed modeling lessons during the training sessions and provided with written feedback. Teaching assistants were also observed once a week during the first phase, and at least once every three weeks during the second phase, receiving feedback as necessary.

Top

Appendix A1.2 Study characteristics: Mesa, 2004 (quasi-experimental design)

Characteristic Description
Study citation Mesa, C. L. (2004). Effect of Read Naturally software on reading fluency and comprehension. Unpublished master’s thesis, Piedmont College, Demorest, GA.
Participants Twelve students from a single class were selected to participate because they had mastered certain decoding patterns. These students were matched into pairs based on their pre-intervention test scores (STAR Reading Test); one student was assigned to the intervention group and one to the comparison group.1
Setting The study took place in one elementary school in Georgia.
Intervention Students in the group left their regular class for Read Naturally (2001) computer instruction for 45 minutes a day, four days a week, for three weeks. Students used the program independently unless they had a question or were attempting to pass a level, in which case they interacted with the teacher. The Read Naturally group worked with minimal teacher’s supervision.
Comparison The comparison group did not receive any special instruction and remained in the class with the regular classroom teacher.
Primary outcomes and measurement The author administered the Oral Reading Fluency test. Two other outcomes, the STAR Reading Test and the Comprehension Reading Test, were also used in the study, but have not been included in this review because sufficient information was not provided to evaluate face validity and reliability of these tests. For a more detailed description of these outcome measures, see Appendices A2.12.2.
Staff/teacher training No information on teacher training is provided.
1 The pretest equivalency of the two groups on the Oral Reading Fluency measure was verified by the WWC.

Top

Appendix A2.1 Outcome measures in the fluency domain

Outcome measure Description
Oral Reading Fluency The test measures the number of words read per minute minus any errors. The passage was a 113-word passage (as cited in Mesa, 2004).
Curriculum Based Measurement: Test of Reading Fluency Students were given passages from Level B of the Test of Reading Fluency, which are based on several published curricula and are designed to represent general grade-level reading material. The total number of words read correctly was recorded (as cited in Hancock, 2002).

Top

Appendix A2.2 Outcome measures in the READING comprehension domain

Outcome measure Description
Vocabulary
Peabody Picture Vocabulary Test: Third Edition (PPVT-III) A standardized, receptive vocabulary test that asks students to choose which one of four pictures corresponds to a test word spoken aloud (as cited in Hancock, 2002).
Word Use Fluency The Word Use Fluency test measured students’ expressive language skills. The tester verbally presented words to the student, who was asked to use the words in a sentence. Words were presented one at a time, and the next word was presented once a response was given. The task lasted one minute, and the total correct number of responses was provided (as cited in Hancock, 2002).
Reading comprehension
Curriculum Based Measurement: Cloze Probe Students read passages of text and fill in key missing words from three choices (as cited in Hancock, 2002).

Top

Appendix A3.1 Summary of study findings included in the rating for the fluency domain1

  Authors' findings
from the study
 
  Mean outcome
(standard deviation)2
WWC calculations
Outcome measure Study sample Sample size (students) Read Naturally group Comparison group Mean difference3
(Read Naturally – comparison)
Effect size4 Statistical significance5
(at α = 0.05)
Improvement index6
Hancock, 2002 (randomized controlled trial)7
CBM: Test of Reading Fluency Second grade 94 117.38
(30.73)
112.38
(30.52)
5.00 0.16 ns +6
Average for fluency domain (Hancock, 2002)8 0.16 ns +6
Mesa, 2004 (quasi-experimental design)7
Oral Reading Fluency9 First grade 12 80.00
(20.66)
74.33
(25.56)
5.67 0.23 ns +9
Average for fluency domain (Mesa, 2004)8 0.23 ns +9
Domain average for fluency across all studies8 0.19 na +8

ns = not statistically significant
na = not applicable
CBM = Curriculum Based Measurement

1 This appendix reports findings considered for the effectiveness rating and the average improvement indices for the fluency domain.
2 The standard deviation across all students in each group shows how dispersed the participants’ outcomes are: a smaller standard deviation on a given measure would indicate that participants had more similar outcomes.
3 Positive differences and effect sizes favor the intervention group; negative differences and effect sizes favor the comparison group.
4 For an explanation of the effect size calculation, see Technical Details of WWC-Conducted Computations.
5 Statistical significance is the probability that the difference between groups is a result of chance rather than a real difference between the groups.
6 The improvement index represents the difference between the percentile rank of the average student in the intervention condition and that of the average student in the comparison condition. The improvement index can take on values between –50 and +50, with positive numbers denoting results favorable to the intervention group.
7 The level of statistical significance was reported by the study authors or, where necessary, calculated by the WWC to correct for clustering within classrooms or schools and for multiple comparisons. For an explanation about the clustering correction, see the WWC Tutorial on Mismatch. For the formulas the WWC used to calculate statistical significance, see Technical Details of WWC-Conducted Computations. In the cases of Hancock (2002) and Mesa (2004), no corrections for clustering or multiple comparisons were needed.
8 The WWC-computed average effect sizes for each study and for the domain across studies are simple averages rounded to two decimal places. The average improvement indices are calculated from the average effect size.
9 The Read Naturally group mean equals the comparison group mean plus the mean difference. The computation of the mean difference took into account the pretest difference between the study groups.

Top

Appendix A3.2 Summary of study findings included in the rating for the comprehension domain1

  Authors' findings
from the study
 
  Mean outcome
(standard deviation)2
WWC calculations
Outcome measure Study sample Sample size (students) Read Naturally group3 Comparison group Mean difference4
(Read Naturally – comparison)
Effect size5 Statistical significance6
(at α = 0.05)
Improvement index7
Hancock, 2002 (randomized controlled trial)8
Construct: Vocabulary development
PPVT-III Second grade 94 118.11
(16.14)
117.79
(17.50)
0.32 0.02 ns +1
Word Use Fluency Second grade 94 53.10
(12.07)
50.42
(12.20)
2.68 0.22 ns +9
Construct: Reading comprehension
CBM: Cloze Probe Second grade 94 22.70
(8.66)
23.37
(7.18)
–0.67 –0.08 ns –3
Domain average for comprehension (Hancock, 2002)9 0.05 na +2

ns = not statistically significant
na = not applicable
PPVT-III = Peabody Picture Vocabulary Test: Third Edition
CBM = Curriculum Based Measurement

1 This appendix reports findings considered for the effectiveness rating and the average improvement indices for the comprehension domain.
2 The standard deviation across all students in each group shows how dispersed the participants' outcomes are: a smaller standard deviation on a given measure would indicate that participants had more similar outcomes.
3 Means are adjusted for pretest. The authors used initial reading skills as a covariant.
4 Positive differences and effect sizes favor the intervention group; negative differences and effect sizes favor the comparison group.
5 For an explanation of the effect size calculation, see Technical Details of WWC-Conducted Computations.
6 Statistical significance is the probability that the difference between groups is a result of chance rather than a real difference between the groups.
7 The improvement index represents the difference between the percentile rank of the average student in the intervention condition and that of the average student in the comparison condition. The improvement index can take on values between –50 and +50, with positive numbers denoting results favorable to the intervention group.
8 The level of statistical significance was reported by the study authors or, where necessary, calculated by the WWC to correct for clustering within classrooms or schools and for multiple comparisons. For an explanation about the clustering correction, see the WWC Tutorial on Mismatch. For the formulas the WWC used to calculate statistical significance, see Technical Details of WWC-Conducted Computations. In the case of Hancock (2002), no corrections for clustering or multiple comparisons were needed.
9 The WWC-computed average effect sizes for each study and for the domain across studies are simple averages rounded to two decimal places. The average improvement indices are calculated from the average effect size. For a single study included in the comprehension domain, the study average is equal to the domain average.

Top

Appendix A4.1 Read Naturally rating for the fluency domain

The WWC rates an intervention's effects in a given outcome domain as positive, potentially positive, mixed, no discernible effects, potentially negative, or negative.1

For the outcome domain of fluency, the WWC rated Read Naturally as having no discernible effects. The remaining ratings (potentially negative effects, negative effects) were not considered, as Read Naturally was assigned the highest applicable rating.

Rating received

No discernible effects: No affirmative evidence of effects.

  • Criterion 1: None of the studies shows a statistically significant or substantively important effect, either positive or negative.

    Met. No studies showed a statistically significant or substantively important effect, either positive or negative.

Other ratings considered

Positive effects: Strong evidence of a positive effect with no overriding contrary evidence.

  • Criterion 1: Two or more studies showing statistically significant positive effects, at least one of which met WWC evidence standards for a strong design.

    Not met. Only one study met the WWC evidence standards for a strong design, and that study did not show statistically significant or substantively important positive effects.

    AND

  • Criterion 2: No studies showing statistically significant or substantively important negative effects.

    Met. No studies showed statistically significant or substantively important negative effects.

Potentially positive effects: Evidence of a positive effect with no overriding contrary evidence.

  • Criterion 1: At least one study showing a statistically significant or substantively important positive effect.

    Not met. No studies showed a statistically significant or substantively important positive effect.

    AND

  • Criterion 2: No studies showing a statistically significant or substantively important negative effect, and fewer or the same number of studies showing indeterminate effects than showing statistically significant or substantively important positive effects.

    Not met. No studies showed a statistically significant or substantively important negative effect, but one study showed indeterminate effects.

Mixed effects: Evidence of inconsistent effects as demonstrated through either of the following criteria.

  • Criterion 1: At least one study showing a statistically significant or substantively important positive effect, and at least one study showing a statistically significant or substantively important negative effect, but no more such studies than the number showing a statistically significant or substantively important positive effect.

    Not met. No studies showed a statistically significant or substantively important effect, either positive or negative.

    OR

  • Criterion 2: At least one study showing a statistically significant or substantively important effect, and more studies showing an indeterminate effect than showing a statistically significant or substantively important effect.

    Not met. No studies showed a statistically significant or substantively important effect, while one study showed indeterminate effects.

1 For rating purposes, the WWC considers the statistical significance of individual outcomes and the domain-level effect. The WWC also considers the size of the domain-level effect for ratings of potentially positive or potentially negative effects. For a complete description, see the WWC Intervention Rating Scheme.

Top

Appendix A4.2 Read Naturally rating for the comprehension domain

The WWC rates anintervention's effects in a given outcome domain as positive, potentially positive, mixed, no discernible effects, potentially negative, or negative.1

For the outcome domain of comprehension, the WWC rated Read Naturally as having no discernible effects. The remaining ratings (potentially negative effects, negative effects) were not considered, as Read Naturally was assigned the highest applicable rating.

Rating received

No discernible effects: No affirmative evidence of effects.

  • Criterion 1: None of the studies shows a statistically significant or substantively important effect, either positive or negative.

    Met. No studies showed a statistically significant or substantively important effect, either positive or negative.

Other ratings considered

Positive effects: Strong evidence of a positive effect with no overriding contrary evidence.

  • Criterion 1: Two or more studies showing statistically significant positive effects, at least one of which met WWC evidence standards for a strong design.

    Not met. Only one study met the WWC evidence standards for a strong design, and that study did not show statistically significant positive effects.

    AND

  • Criterion 2: No studies showing statistically significant or substantively important negative effects.

    Met. No studies showed statistically significant or substantively important negative effects.

Potentially positive effects: Evidence of a positive effect with no overriding contrary evidence.

  • Criterion 1: At least one study showing a statistically significant or substantively important positive effect.

    Not met. No studies showed a statistically significant or substantively important positive effect.

    AND

  • Criterion 2: No studies showing a statistically significant or substantively important negative effect, and fewer or the same number of studies showing indeterminate effects than showing statistically significant or substantively important positive effects.

    Not met. No studies showed a statistically significant or substantively important negative effect, but one study showed indeterminate effects.

Mixed effects: Evidence of inconsistent effects as demonstrated through either of the following criteria.

  • Criterion 1: At least one study showing a statistically significant or substantively important positive effect, and at least one study showing a statistically significant or substantively important negative effect, but no more such studies than the number showing a statistically significant or substantively important positive effect.

    Not met. No studies showed a statistically significant or substantively important effect, either positive or negative.

    OR

  • Criterion 2: At least one study showing a statistically significant or substantively important effect, and more studies showing an indeterminate effect than showing a statistically significant or substantively important effect.

    Not met. No studies showed a statistically significant or substantively important effect, while one study showed indeterminate effects.

1 For rating purposes, the WWC considers the statistical significance of individual outcomes and the domain-level effect. The WWC also considers the size of the domain-level effect for ratings of potentially positive or potentially negative effects. For a complete description, see the WWC Intervention Rating Scheme.

Top

Appendix A5 Extent of evidence by domain

  Sample size
Outcome domain Number of studies Schools Students Extent of evidence1
Alphabetics 0 0 0 na
Fluency 2 2 106 Small
Comprehension 1 1 94 Small
General reading achievement 0 0 0 na

na = not applicable/not studied

1 A rating of “medium to large” requires at least two studies and two schools across studies in one domain and a total sample size across studies of at least 350 students or 14 classrooms. Otherwise, the rating is “small.”

Top


PO Box 2393
Princeton, NJ 08543-2393
Phone: 1-866-503-6114