Appendix A1.1 Study characteristics: Hancock, 2002 (randomized controlled trial)
| Characteristic | Description |
|---|---|
| Study citation | Hancock, C. M. (2002). Accelerating reading trajectories: The effects of dynamic research-based instruction. Dissertation Abstracts International, 63(06), 2139A. (UMI No. 3055690) |
| Participants | The study involved 94 second-grade students who attended a single school. Out of this group, 48 students received the intervention and 46 were in the comparison group. The students were randomly assigned to intervention and comparison groups using block randomization procedures. All students in the second grade were administered several initial measures. Student scores were rank-ordered within each classroom, and then each student was matched with a similarly performing student. Students were then randomly assigned to the intervention group and the comparison group within matched pairs. No information was reported regarding student ethnicity or gender, but 11% of the students in this school qualified for free or reduced-price lunch. There was no attrition. |
| Setting | The study took place in one elementary school in the Kyrene school district in Tempe, Arizona. |
| Intervention | In additional to the regular curriculum (including reading instruction), the intervention group received 25 minutes of supplemental instruction using Read Naturally four times a week for 11 weeks. In each lesson, the first five minutes were spent on oral reading of a selected passage with a teaching assistant. The reading was timed for one minute, and the total number of words read correctly was recorded on a graph. The last 20 minutes involved repeated oral reading of curriculum stories either individually or with a cassette tape. Once students practiced a passage eight times (three times with a cassette and five times individually), they did a timed reading with the teacher. If the student achieved mastery (100 words read correctly with three or fewer errors), the student moved on to another passage; otherwise, the cycle was repeated. The procedures used in this study excluded Read Naturally’s pre-reading vocabulary instruction component and the Read Naturally placement system to individualize instruction. |
| Comparison | In additional to their regular curriculum (including reading instruction), the comparison group students received supplemental instruction using Connecting Math Concepts curriculum (Level B). This program used worksheets, workbooks, coins, and games, and taught basic mathematics skills such as place value, money counting, time, addition, subtraction, and multiplication. |
| Primary outcomes and measurement | The author used the Peabody Picture Vocabulary Test: Third Edition (PPVT-III), the Word Use Fluency test, and the Curriculum Based Measure: Cloze Probe and Test of Reading Fluency. The author used initial reading skills as a covariate to account for baseline differences between groups. For a more detailed description of these outcome measures, see Appendices A2.1–2.2. |
| Staff/teacher training | Six teaching assistants were trained over five days. Teaching assistants were observed modeling lessons during the training sessions and provided with written feedback. Teaching assistants were also observed once a week during the first phase, and at least once every three weeks during the second phase, receiving feedback as necessary. |
Appendix A1.2 Study characteristics: Mesa, 2004 (quasi-experimental design)
| Characteristic | Description |
|---|---|
| Study citation | Mesa, C. L. (2004). Effect of Read Naturally software on reading fluency and comprehension. Unpublished master’s thesis, Piedmont College, Demorest, GA. |
| Participants | Twelve students from a single class were selected to participate because they had mastered certain decoding patterns. These students were matched into pairs based on their pre-intervention test scores (STAR Reading Test); one student was assigned to the intervention group and one to the comparison group.1 |
| Setting | The study took place in one elementary school in Georgia. |
| Intervention | Students in the group left their regular class for Read Naturally (2001) computer instruction for 45 minutes a day, four days a week, for three weeks. Students used the program independently unless they had a question or were attempting to pass a level, in which case they interacted with the teacher. The Read Naturally group worked with minimal teacher’s supervision. |
| Comparison | The comparison group did not receive any special instruction and remained in the class with the regular classroom teacher. |
| Primary outcomes and measurement | The author administered the Oral Reading Fluency test. Two other outcomes, the STAR Reading Test and the Comprehension Reading Test, were also used in the study, but have not been included in this review because sufficient information was not provided to evaluate face validity and reliability of these tests. For a more detailed description of these outcome measures, see Appendices A2.1–2.2. |
| Staff/teacher training | No information on teacher training is provided. |
| 1 The pretest equivalency of the two groups on the Oral Reading Fluency measure was verified by the WWC. | |
Appendix A2.1 Outcome measures in the fluency domain
| Outcome measure | Description |
|---|---|
| Oral Reading Fluency | The test measures the number of words read per minute minus any errors. The passage was a 113-word passage (as cited in Mesa, 2004). |
| Curriculum Based Measurement: Test of Reading Fluency | Students were given passages from Level B of the Test of Reading Fluency, which are based on several published curricula and are designed to represent general grade-level reading material. The total number of words read correctly was recorded (as cited in Hancock, 2002). |
Appendix A2.2 Outcome measures in the READING comprehension domain
| Outcome measure | Description |
|---|---|
| Vocabulary | |
| Peabody Picture Vocabulary Test: Third Edition (PPVT-III) | A standardized, receptive vocabulary test that asks students to choose which one of four pictures corresponds to a test word spoken aloud (as cited in Hancock, 2002). |
| Word Use Fluency | The Word Use Fluency test measured students’ expressive language skills. The tester verbally presented words to the student, who was asked to use the words in a sentence. Words were presented one at a time, and the next word was presented once a response was given. The task lasted one minute, and the total correct number of responses was provided (as cited in Hancock, 2002). |
| Reading comprehension | |
| Curriculum Based Measurement: Cloze Probe | Students read passages of text and fill in key missing words from three choices (as cited in Hancock, 2002). |
Appendix A3.1 Summary of study findings included in the rating for the fluency domain1
| Authors' findings from the study |
||||||||
|---|---|---|---|---|---|---|---|---|
| Mean outcome (standard deviation)2 |
WWC calculations | |||||||
| Outcome measure | Study sample | Sample size (students) | Read Naturally group | Comparison group | Mean difference3 (Read Naturally – comparison) |
Effect size4 | Statistical significance5 (at α = 0.05) |
Improvement index6 |
Hancock, 2002 (randomized controlled trial)7 |
||||||||
| CBM: Test of Reading Fluency | Second grade | 94 | 117.38 (30.73) |
112.38 (30.52) |
5.00 | 0.16 | ns | +6 |
| Average for fluency domain (Hancock, 2002)8 | 0.16 | ns | +6 | |||||
Mesa, 2004 (quasi-experimental design)7 |
||||||||
| Oral Reading Fluency9 | First grade | 12 | 80.00 (20.66) |
74.33 (25.56) |
5.67 | 0.23 | ns | +9 |
| Average for fluency domain (Mesa, 2004)8 | 0.23 | ns | +9 | |||||
| Domain average for fluency across all studies8 | 0.19 | na | +8 | |||||
|
ns = not statistically significant 2 The standard deviation across all students in each group shows how dispersed the participants’ outcomes are: a smaller standard deviation on a given measure would indicate that participants had more similar outcomes. 3 Positive differences and effect sizes favor the intervention group; negative differences and effect sizes favor the comparison group. 4 For an explanation of the effect size calculation, see Technical Details of WWC-Conducted Computations. 5 Statistical significance is the probability that the difference between groups is a result of chance rather than a real difference between the groups. 6 The improvement index represents the difference between the percentile rank of the average student in the intervention condition and that of the average student in the comparison condition. The improvement index can take on values between –50 and +50, with positive numbers denoting results favorable to the intervention group. 7 The level of statistical significance was reported by the study authors or, where necessary, calculated by the WWC to correct for clustering within classrooms or schools and for multiple comparisons. For an explanation about the clustering correction, see the WWC Tutorial on Mismatch. For the formulas the WWC used to calculate statistical significance, see Technical Details of WWC-Conducted Computations. In the cases of Hancock (2002) and Mesa (2004), no corrections for clustering or multiple comparisons were needed. 8 The WWC-computed average effect sizes for each study and for the domain across studies are simple averages rounded to two decimal places. The average improvement indices are calculated from the average effect size. 9 The Read Naturally group mean equals the comparison group mean plus the mean difference. The computation of the mean difference took into account the pretest difference between the study groups. |
||||||||
Appendix A3.2 Summary of study findings included in the rating for the comprehension domain1
| Authors' findings from the study |
||||||||
|---|---|---|---|---|---|---|---|---|
| Mean outcome (standard deviation)2 |
WWC calculations | |||||||
| Outcome measure | Study sample | Sample size (students) | Read Naturally group3 | Comparison group | Mean difference4 (Read Naturally – comparison) |
Effect size5 | Statistical significance6 (at α = 0.05) |
Improvement index7 |
Hancock, 2002 (randomized controlled trial)8 |
||||||||
| Construct: Vocabulary development | ||||||||
| PPVT-III | Second grade | 94 | 118.11 (16.14) |
117.79 (17.50) |
0.32 | 0.02 | ns | +1 |
| Word Use Fluency | Second grade | 94 | 53.10 (12.07) |
50.42 (12.20) |
2.68 | 0.22 | ns | +9 |
| Construct: Reading comprehension | ||||||||
| CBM: Cloze Probe | Second grade | 94 | 22.70 (8.66) |
23.37 (7.18) |
–0.67 | –0.08 | ns | –3 |
| Domain average for comprehension (Hancock, 2002)9 | 0.05 | na | +2 | |||||
|
ns = not statistically significant 2 The standard deviation across all students in each group shows how dispersed the participants' outcomes are: a smaller standard deviation on a given measure would indicate that participants had more similar outcomes. 3 Means are adjusted for pretest. The authors used initial reading skills as a covariant. 4 Positive differences and effect sizes favor the intervention group; negative differences and effect sizes favor the comparison group. 5 For an explanation of the effect size calculation, see Technical Details of WWC-Conducted Computations. 6 Statistical significance is the probability that the difference between groups is a result of chance rather than a real difference between the groups. 7 The improvement index represents the difference between the percentile rank of the average student in the intervention condition and that of the average student in the comparison condition. The improvement index can take on values between –50 and +50, with positive numbers denoting results favorable to the intervention group. 8 The level of statistical significance was reported by the study authors or, where necessary, calculated by the WWC to correct for clustering within classrooms or schools and for multiple comparisons. For an explanation about the clustering correction, see the WWC Tutorial on Mismatch. For the formulas the WWC used to calculate statistical significance, see Technical Details of WWC-Conducted Computations. In the case of Hancock (2002), no corrections for clustering or multiple comparisons were needed. 9 The WWC-computed average effect sizes for each study and for the domain across studies are simple averages rounded to two decimal places. The average improvement indices are calculated from the average effect size. For a single study included in the comprehension domain, the study average is equal to the domain average. |
||||||||
Appendix A4.1 Read Naturally rating for the fluency domain
The WWC rates an intervention's effects in a given outcome domain as positive, potentially positive, mixed, no discernible effects, potentially negative, or negative.1
For the outcome domain of fluency, the WWC rated Read Naturally as having no discernible effects. The remaining ratings (potentially negative effects, negative effects) were not considered, as Read Naturally was assigned the highest applicable rating.
| Rating received |
|---|
|
No discernible effects: No affirmative evidence of effects.
|
| Other ratings considered |
|
Positive effects: Strong evidence of a positive effect with no overriding contrary evidence.
|
|
Potentially positive effects: Evidence of a positive effect with no overriding contrary evidence.
|
|
Mixed effects: Evidence of inconsistent effects as demonstrated through either of the following criteria.
|
| 1 For rating purposes, the WWC considers the statistical significance of individual outcomes and the domain-level effect. The WWC also considers the size of the domain-level effect for ratings of potentially positive or potentially negative effects. For a complete description, see the WWC Intervention Rating Scheme. |
Appendix A4.2 Read Naturally rating for the comprehension domain
The WWC rates anintervention's effects in a given outcome domain as positive, potentially positive, mixed, no discernible effects, potentially negative, or negative.1
For the outcome domain of comprehension, the WWC rated Read Naturally as having no discernible effects. The remaining ratings (potentially negative effects, negative effects) were not considered, as Read Naturally was assigned the highest applicable rating.
| Rating received |
|---|
|
No discernible effects: No affirmative evidence of effects.
|
| Other ratings considered |
|
Positive effects: Strong evidence of a positive effect with no overriding contrary evidence.
|
|
Potentially positive effects: Evidence of a positive effect with no overriding contrary evidence.
|
|
Mixed effects: Evidence of inconsistent effects as demonstrated through either of the following criteria.
|
| 1 For rating purposes, the WWC considers the statistical significance of individual outcomes and the domain-level effect. The WWC also considers the size of the domain-level effect for ratings of potentially positive or potentially negative effects. For a complete description, see the WWC Intervention Rating Scheme. |
Appendix A5 Extent of evidence by domain
| Sample size | ||||
|---|---|---|---|---|
| Outcome domain | Number of studies | Schools | Students | Extent of evidence1 |
| Alphabetics | 0 | 0 | 0 | na |
| Fluency | 2 | 2 | 106 | Small |
| Comprehension | 1 | 1 | 94 | Small |
| General reading achievement | 0 | 0 | 0 | na |
|
na = not applicable/not studied 1 A rating of “medium to large” requires at least two studies and two schools across studies in one domain and a total sample size across studies of at least 350 students or 14 classrooms. Otherwise, the rating is “small.” |
||||
|Institute of Education Sciences