"" %>
Appendix A1.1 Study characteristics: Bramlett, 1994 (quasi-experimental design)
| Characteristic | Description |
|---|---|
| Study citation | Bramlett, R. K. (1994). Implementing cooperative learning: A field study evaluating issues for school-based consultants. Journal of School Psychology, 32 (1), 67–84. |
| Participants | Eighteen third-grade teachers volunteered to participate in the study. They were matched on the basis of geographic representativeness (school district) and years of teaching experience. In the analysis sample, the CIRC® group included 198 students in nine classrooms, and the comparison group included 194 students in nine classrooms. Each of the two groups of children were divided into three ability levels (lowest 33%, middle 33%, and upper 34%) based on the students' percentile rankings of the pretest California Achievement Test (CAT) total reading scores.1 |
| Setting | The study took place in eight school districts in rural southern Ohio. The number of participating schools was not provided in the study. |
| Intervention | Students in the nine intervention classes were given only the reading components of the CIRC® program: basal related activities, partner reading, story structure, words out loud, word meaning, story retelling, spelling, direct instruction in reading comprehension, and independent reading. |
| Comparison | Students in the comparison group received their regular reading curriculum, which was not indicated in the study. Teachers in the comparison group were promised that they could receive CIRC® training at the completion of the study, and six of them were subsequently trained. |
| Primary outcomes and measurement | The author administered four California Achievement Test (CAT) subtests: Reading Vocabulary, Reading Comprehension, Total Reading, and Word Analysis. The Total Reading subtest has not been included in this review because it is based on the results from the two subtests reported separately: Reading Vocabulary and Reading Comprehension (see Appendix A2 for more detailed descriptions of outcome measures). |
| Teacher training | The teachers received a one-day (6 hour) training in CIRC® by a certified trainer, as well as the project supplemental materials. Following training, the teachers were given assistance via observation and behavioral consultation sessions (approximately 15–30 minutes). Teachers also attended three half-day meetings during the study year to discuss implementation issues. The teachers in the comparison group were promised training and materials upon completion of the first year of the project. |
| 1 for the lowest 33% subgroup, the study did not establish that the comparison group was equivalent to the intervention group at baseline. Analyses of the other two subgroups (middle ability and higher ability) are presented in Appendix A4. | |
Appendix A1.2 Study characteristics: Skeans, 1991 (quasi-experimental design)
| Characteristic | Description |
|---|---|
| Study citation | Skeans, Sharon E. S. (1991). The effects of cooperative integrated reading and composition, fidelity of implementation, and teacher concerns on student achievement. Dissertation Abstracts International, 53 (02), 0455A. (UMI No. 9217026) |
| Participants | The study used intact classroom groups from the same school district, with twenty-four third-grade teachers matched on students' achievement, demographic factors, and hours of training.1 No attrition was reported. In the analysis sample, 169 students were in the 12 intervention classrooms and 141 students were in the 12 comparison classrooms. |
| Setting | The study took place in a suburban district north of Houston, Texas. |
| Intervention | The intervention group was taught CIRC® integrated language arts and cooperative learning for 18 weeks. The program used daily 90-minute lessons to focus on story-related activities, direct instruction in reading comprehension, and integrated reading and language arts activities. Within four-member cooperative teams, pairs of mixed-ability students worked together to read. Within their teams, students re-read the story and discussed to clarify unknown vocabulary, understand the main idea, and comprehend stories. They worked through the writing process together, drafting, revising, and editing each other's writing. |
| Comparison | Students in the comparison classes continued using the district's integrated language arts program. The children's pretest scores on the Metropolitan Achievement Test were used as a covariant in the analyses to account for the differences between groups. |
| Primary outcomes and measurement | The author administered two subtests of the Metropolitan Achievement Test (MAT): Vocabulary and Reading Comprehension. Third graders received the elementary level of the test. The MAT Language subtest was also used in the study, but has not been included in this review because it was outside the scope of the Beginning Reading review (see Appendix A2 for more detailed descriptions of outcome measures). |
| Teacher training | The intervention teachers received at least 12 hours of CIRC® training. The teachers in both conditions had received at least 18 hours of training in the district's integrated language arts program. In order to continue teaching with the program, teachers needed to return a Stages of Concern questionnaire and a formative evaluation of CIRC®, indicating that they were implementing the program. |
| 1 The fifth-grade sample that was included in this study is not reviewed in this report because it is outside the scope of the review. For sample relevancy criteria please see the Beginning Reading Protocol. | |
Appendix A2 Outcome measures in the comprehension domain by construct
| Outcome measure | Description |
|---|---|
| Reading comprehension | |
| California Achievement Test (CAT): Reading Comprehension (Form E) | A group-administered, standardized assessment of reading comprehension (as cited in Bramlett, 1994). |
| CAT: Word Analysis (Form E) | A group-administered, standardized assessment of word analysis (as cited in Bramlett, 1994). |
| Metropolitan Achievement Test-6 (MAT-6): Reading Comprehension (Form L) | A group-administered, standardized assessment of reading comprehension (as cited in Skeans, 1991). |
| Vocabulary development | |
| CAT: Reading Vocabulary (Form E) | A group-administered, standardized assessment of vocabulary (as cited in Bramlett, 1994). |
| MAT-6: Vocabulary subtest (Form L) | A group-administered, standardized assessment of vocabulary (as cited in Skeans, 1991). |
Appendix A3 Summary of study findings included in the rating for the comprehension domain by construct1
| Authors' findings from the study | ||||||||
|---|---|---|---|---|---|---|---|---|
| Mean outcome (standard deviation2) | WWC calculations | |||||||
| Outcome measure | Study sample | Sample size (Classrooms/students) | CIRC® group3 | Comparison group | Mean difference4 (CIRC® – comparison) | Effect size5 | Statistical significance6 (at α= 0.05) | Improvement index7 |
| Construct: Reading comprehension | ||||||||
| Bramlett, 1994 (quasi-experimental design)8 | ||||||||
| CAT: Reading Comprehension | Third grade | 18/392 | 687.0 (56.4) | 681.0 (61.1) | 6.0 | 0.10 | ns | +4 |
| CAT: Word Analysis | Third grade | 18/392 | 667.0 (43.3) | 662.0 (49.7) | 5.0 | 0.11 | ns | +4 |
| Skeans, 1991 (quasi-experimental design)8 | ||||||||
| MAT-6: Reading Comprehension | Third grade | 24/305 | 58.6 (20.7) | 57.0 (20.6) | 1.6 | 0.08 | ns | +3 |
| Construct: Vocabulary development | ||||||||
| Bramlett, 1994 (quasi-experimental design)8 | ||||||||
| CAT: Reading Vocabulary | Third grade | 18/392 | 684.0 (48.7) | 682.0 (59.5) | 2.0 | 0.04 | ns | +1 |
| Skeans, 1991 (quasi-experimental design)8 | ||||||||
| MAT-6: Vocabulary | Third grade | 24/310 | 61.2 (17.9) | 57.5 (18.3) | 3.7 | 0.20 | ns | +8 |
| Average9 for comprehension domain (Bramlett, 1994) | 0.08 | ns | +3 | |||||
| Average9 for comprehension domain (Skeans, 1991) | 0.14 | ns | +6 | |||||
| Domain average9 for comprehension domain | 0.11 | na | +4 | |||||
|
ns = not statistically significant 2 The standard deviation across all students in each group shows how dispersed the participants' outcomes are: a smaller standard deviation on a given measure would indicate that participants had more similar outcomes. 3 The CIRC® group means were adjusted for pretest. Pretest reading scores were used as a covariant. 4 Positive differences and effect sizes favor the intervention group; negative differences and effect sizes favor the comparison group. 5 For an explanation of the effect size calculation, see Technical Details of WWC-Conducted Computations. 6 Statistical significance is the probability that the difference between groups is a result of chance rather than a real difference between the groups. 7 The improvement index represents the difference between the percentile rank of the average student in the intervention condition versus the percentile rank of the average student in the comparison condition. The improvement index can take on values between -50 and +50, with positive numbers denoting results favorable to the intervention group. 8 The level of statistical significance was reported by the study authors or, where necessary, calculated by the WWC to correct for clustering within classrooms or schools (corrections for multiple comparisons were not done for findings not included in the overall intervention rating). For an explanation about the clustering correction, see the WWC Tutorial on Mismatch. See Technical Details of WWC-Conducted Computations for the formulas the WWC used to calculate statistical significance. In the case of Bramlett (1994), a correction for clustering and multiple comparisons was needed, so the significance levels may differ from those reported in the original study. In the case of Skeans (1991), a correction for clustering was needed, so the significance levels may differ from those reported in the original study. 9 The WWC-computed average effect sizes for each study and for the domain across studies are simple averages rounded to two decimal places. The average improvement indices are calculated from the average effect size. |
||||||||
Appendix A4 Summary of subgroup findings for the comprehension domain by construct11
| Authors' findings from the study | ||||||||
|---|---|---|---|---|---|---|---|---|
| Mean outcome (standard deviation2) | WWC calculations | |||||||
| Outcome measure | Study sample | Sample size (Classrooms/students) | CIRC® group3 | Comparison group | Mean difference4 (CIRC® – comparison) | Effect size5 | Statistical significance6 (at α= 0.05) | Improvement index7 |
| Construct: Reading comprehension | ||||||||
| Bramlett, 1994 (quasi-experimental design)8 | ||||||||
| CAT: Reading Comprehension | Third grade/medium | 18/151 | 698.0 (45.6) | 695.0 (35.7) | 3.0 | 0.07 | ns | +3 |
| CAT: Word Analysis | Third grade/medium | 18/151 | 670.0 (29.9) | 673.0 (38.3) | –3.0 | –0.09 | ns | –3 |
| CAT: Reading Comprehension | Third grade/high | 18/92 | 744.0 (32.7) | 735.0 (35.5) | 9.0 | 0.26 | ns | +10 |
| CAT: Word Analysis | Third grade/high | 18/92 | 712.0 (38.2) | 704.0 (37.1) | 8.0 | 0.21 | ns | +8 |
| Construct: Vocabulary development | ||||||||
| Bramlett, 1994 (quasi-experimental design)8 | ||||||||
| CAT: Reading Vocabulary | Third grade/medium | 18/151 | 694.0 (36.7) | 693.0 (30.0) | 1.0 | 0.03 | ns | +1 |
| CAT: Reading Vocabulary | Third grade/high | 18/92 | 736.0 (33.1) | 738.0 (31.6) | –2.0 | –0.06 | ns | –2 |
|
ns = not statistically significant 1 This appendix presents subgroup findings for high ability students (defined as upper 34% of the sample) and medium ability students (middle 33%) for measures that fall in the comprehension domain. Total group scores were used for rating purposes and are presented in Appendix A3.2 The standard deviation across all students in each group shows how dispersed the participants' outcomes are: a smaller standard deviation on a given measure would indicate that participants had more similar outcomes. 3 The CIRC® group means were adjusted for pretest. Pretest reading scores were used as a covariant. 4 Positive differences and effect sizes favor the intervention group; negative differences and effect sizes favor the comparison group. 5 For an explanation of the effect size calculation, see Technical Details of WWC-Conducted Computations. 6 Statistical significance is the probability that the difference between groups is a result of chance rather than a real difference between the groups. 7 The improvement index represents the difference between the percentile rank of the average student in the intervention condition versus the percentile rank of the average student in the comparison condition. The improvement index can take on values between -50 and +50, with positive numbers denoting results favorable to the intervention group. 8 The level of statistical significance was reported by the study authors or, where necessary, calculated by the WWC to correct for clustering within classrooms or schools (corrections for multiple comparisons were not done for findings not included in the overall intervention rating). For an explanation about the clustering correction, see the WWC Tutorial on Mismatch. See Technical Details of WWC-Conducted Computations for the formulas the WWC used to calculate statistical significance. In the case of Bramlett (1994), a correction for clustering was needed, so the significance levels may differ from those reported in the original study. |
||||||||
Appendix A5 CIRC® rating for the comprehension domain
The WWC rates an intervention's effects in a given outcome domain as positive, potentially positive, mixed, no discernible effects, potentially negative, or negative.1
For the outcome domain of comprehension, the WWC rated CIRC® as having no discernible effects. It did not meet the criteria for other ratings (positive effects, potentially positive effects, mixed effects, potentially negative effects, and negative effects) because the one study that met WWC standards did not show statistically significant or substantively important effects.
| Rating received |
|---|
|
No discernible effects: No affirmative evidence of effects.
|
| Other ratings considered |
|
Positive effects: Strong evidence of a positive effect with no overriding contrary evidence.
|
|
Potentially positive effects: Evidence of a positive effect with no overriding contrary evidence.
|
|
Mixed effects: Evidence of inconsistent effects as demonstrated through either of the following criteria.
|
|
Potentially negative effects: Evidence of a negative effect with no overriding contrary evidence.
|
|
Negative effects: Strong evidence of a negative effect with no overriding contrary evidence.
|
| 1 for rating purposes, the WWC considers the statistical significance of individual outcomes and the domain-level effect. The WWC also considers the size of the domain-level effect for ratings of potentially positive or potentially negative effects. See the WWC Intervention Rating Scheme for a complete description. |
Appendix A6 Extent of evidence by domain
| Sample size | ||||
|---|---|---|---|---|
| Outcome domain | Number of studies | Schools1 | Students | Extent of evidence2 |
| Alphabetics | 0 | 0 | 0 | na |
| Fluency | 0 | 0 | 0 | na |
| Comprehension | 2 | over 8 | 702 | Medium to large |
| General reading achievement | 0 | 0 | 0 | na |
|
na = not applicable/not studied 1No information is provided about the number of schools. Bramlett (1994) study took place in eight school districts (and 18 classrooms), while 24 teachers participated in Skeans (1991) study.2A rating of "medium to large" requires at least two studies and two schools across studies in one domain and a total sample size across studies of at least 350 students or 14 classrooms. Otherwise, the rating is "small." |
||||