WWC review of this study

The Impacts of a Standards-Based Grading System Emphasizing Formative Assessment, Feedback, and Re-Assessment: A Mixed Methods, Cluster Randomized Control Trial in Ninth Grade Mathematics Classrooms

Steven L. Kramer; Michael A. Posner; Alexander S. Browman; Nancy R. Lawrence; Jennifer Roem; Kathleen Krier (2025). Journal of Research on Educational Effectiveness, v18 n1 p56-87. Retrieved from: https://eric.ed.gov/?id=EJ1492695

  •  examining 
    2,736
     Students
    , grades
    8-9

Reviewed: April 2026

At least one finding shows moderate evidence of effectiveness
At least one statistically significant positive finding
Meets WWC standards with reservations
General Mathematics Achievement outcomes—Statistically significant positive effect found for the domain
Outcome
measure
Comparison Period Sample Intervention
mean
Comparison
mean
Significant? Improvement
    index
Evidence
tier

Researcher-developed geometry or algebra end-of-course assessment

Proficiency-based Assessment and Reassessment of Learning Outcomes (PARLO) vs. Business as usual

0 Days

Full sample;
2,736 students

0.33

0.00

Yes

 
 
13
 


Evidence Tier rating based solely on this study. This intervention may achieve a higher tier when combined with the full body of evidence.

Characteristics of study sample as reported by study author.


  • Female: 55%
    Male: 45%

  • Rural, Suburban, Urban
  • Race
    Asian
    5%
    Black
    30%
    Other or unknown
    15%
    White
    51%
  • Ethnicity
    Hispanic    
    8%
    Not Hispanic or Latino    
    92%
  • Eligible for Free and Reduced Price Lunch
    Other or unknown    
    100%

Setting

This study was conducted in 29 schools across urban, suburban, and rural areas, including public, charter, and Catholic schools from high- and low-performing districts. Two cohorts participated: 20 schools participated in 2010-11 and 2011-12, and 15 schools participated in 2011-12 and 2012-13. The study focused primarily on 9th-grade Algebra and Geometry classrooms, with all teachers in these subjects asked to participate for 2 years.

Study sample

The study randomly assigned 18 schools to the intervention group and 17 to the comparison group. 14 intervention schools and 15 comparison schools provided outcome data for the study. The sample of students included in the analysis consisted of 1,649 intervention students and 1,087 comparison students. The analytic sample of teachers included 33 intervention teachers and 25 comparison group teachers. More than two-thirds (68 percent) of the teachers were female. Teachers were 88 percent White, 7 percent Black, 5 percent Asian, and 2 percent multiracial or of another unspecified race.

Intervention Group

Teachers in the intervention condition used Proficiency-based Assessment and Reassessment of Learning Outcomes (PARLO), a standards-based grading system designed to help students learn course content by organizing grading around clearly defined learning outcomes, formative feedback, and opportunities to show improved mastery over time. PARLO aims to shift grading away from averaging early and later performance and toward grading based on the best evidence of what students had learned by the end of the term. In the study, PARLO teachers organized each semester around about 10 to 15 learning outcomes and shared these outcomes and success criteria with students and families. Teachers used quizzes, exit tickets, observations, and other classroom evidence to judge student progress on each learning outcome and to give students feedback. Students could complete additional learning activities, such as error logs, remediation plans, or in-class catch-up opportunities, and then reassess for full credit. Final grades were based on the number of learning outcomes on which students demonstrated proficient or high-performance work, rather than on a weighted average of points earned across the semester. Teachers participated for two years and developed their own learning outcomes, grading procedures, and classroom routines within the PARLO approach. Students were taught by teachers using PARLO during eighth or ninth grade.

Comparison Group

Teachers in the comparison condition received training on organizing instruction by learning outcomes and providing formative assessment and feedback, which the authors consider necessary but not sufficient components of the PARLO system. However, teachers in the comparison condition did not receive any additional training or implement other key PARLO system features, such as reassessment policies or mastery-based grading.

Support for implementation

Teachers received a stipend for participation. All participating teachers received professional development on organizing instruction by learning outcomes and providing formative feedback during the summer prior to implementation. Teachers in Cohort 1 received 3 days of this professional development, and teachers in Cohort 2 received 2 days. Teachers in the intervention condition also received additional professional development on the other components of PARLO. Cohort 1 teachers in the intervention condition received an additional 3 days of professional development to support PARLO implementation prior to the first year of implementation and an additional 2 days prior to the second year of implementation. Cohort 2 teachers in the intervention condition received an additional 3 days of professional development prior to the first year of implementation and an additional 2 days prior to the second year of implementation. All teachers in the intervention condition also had the opportunity to participate in monthly professional learning community (PLC) meetings during the intervention.

 

Your export should download shortly as a zip archive.

This download will include data files for study and findings review data and a data dictionary.

Connect With the WWC

loading