Retrieval Practice in the Classroom — Without Grading More
A practical guide for high school teachers: how to run retrieval practice in class without adding grading, what the classroom studies actually show, and how to start this week.
Most teachers already believe in the idea behind retrieval practice. The objection is never the evidence; it is the workload. If every act of recall has to pass through your gradebook, the method dies on contact with a 150-student roster.
This post makes the opposite case: retrieval practice works best in the classroom precisely when it is ungraded, frequent, and short — and the implementation cost can be close to zero.
What the Classroom Studies Show
Laboratory findings on the testing effect are decades old, but the classroom replications are what matter for teachers. Roediger, Agarwal, McDaniel, and McDermott (2011) ran retrieval practice inside real middle school social studies classes: material that was quizzed (without grades attached) was retained significantly better on end-of-semester and end-of-year exams than material that was reviewed or simply re-taught. The quizzes were minutes long. The teacher did not change the curriculum.
McDermott, Agarwal, D'Antonio, Roediger, and McDaniel (2014) extended this to middle and high school science and history classrooms and found the same pattern: low-stakes quizzing produced reliable gains on later unit exams, and the format of the quiz (multiple choice vs. short answer) mattered less than the fact of retrieval itself.
Two details from this literature are worth holding onto:
- The benefit attaches to what gets retrieved. Quizzed material outperforms non-quizzed material within the same class. Coverage matters — which is why frequency beats depth.
- Stakes are not the mechanism. The retrieval is. Grading the quizzes adds anxiety and workload without adding retention.
The Implementation That Survives a Real Schedule
The version of retrieval practice that survives contact with a real teaching load has three properties: it is short, it is routine, and nobody grades it.
Brain dumps (2–3 minutes). At the start of class: "Write down everything you remember from yesterday." No collection, no grade. Students check their own gaps against the day's opening slide. This is the cheapest retrieval format that exists, and it doubles as your settling-in routine.
Exit questions (1 minute). One question, answered from memory, on the way out. If you want a signal without grading, a show of hands on confidence ("how sure are you?") after revealing the answer tells you more about tomorrow's lesson plan than a stack of collected slips.
Spaced re-asking. The piece most classrooms miss. A question asked once, the day after teaching, mostly measures short-term memory. The same question re-asked two weeks later — when forgetting has set in — is where the durable learning happens. Keep a running list of "questions worth re-asking" per unit and pull two or three old ones into each week's openers.
That last item is the hard one to run by hand, and it is the one piece where software earns its place.
Where Software Fits (and Where It Doesn't)
The scheduling problem — which student needs which item re-asked when — is not solvable on paper for a full roster. Each student forgets different material at different rates. This is what spaced repetition algorithms are for: they model each student's forgetting curve per item and schedule the re-ask just before the memory fades.
This is the model behind LearningCues: you assign the content, students review on their own schedule, and the algorithm (FSRS) handles per-student spacing. The teacher-facing side answers the question paper can't: who is retaining what — broken down by chapter, with the weak spots surfaced before the test rather than on it. Library content is tagged to CCSS, NGSS, and AP standards, so the assignment you create maps to the framework your department already reports against.
What software does not replace is the in-class retrieval routine. The two minutes of blank-page recall at the start of class builds something an app cannot: the classroom norm that remembering is expected, attempted daily, and safe to fail at.
A One-Week Starting Plan
- Monday: Open with a 2-minute brain dump on last week's material. Tell students why — the Roediger et al. finding fits in one sentence.
- Tuesday–Thursday: One exit question per day, answered from memory. Keep the questions; they are next month's re-ask bank.
- Friday: Pull three questions from two weeks ago into the opener. Expect worse performance than you'd like — that struggle is the point, and saying so out loud changes how students read their own difficulty.
None of this requires new materials, new grading, or permission. The research has been stable for fifteen years; the only variable is whether the retrieval actually happens.
References
- Roediger, H. L., Agarwal, P. K., McDaniel, M. A., & McDermott, K. B. (2011). Test-enhanced learning in the classroom: Long-term improvements from quizzing. Journal of Experimental Psychology: Applied, 17(4), 382–395.
- McDermott, K. B., Agarwal, P. K., D'Antonio, L., Roediger, H. L., & McDaniel, M. A. (2014). Both multiple-choice and short-answer quizzes enhance later exam performance in middle and high school classes. Journal of Experimental Psychology: Applied, 20(1), 3–21.
- Dunlosky, J., Rawson, K. A., Marsh, E. J., Nathan, M. J., & Willingham, D. T. (2013). Improving students' learning with effective learning techniques. Psychological Science in the Public Interest, 14(1), 4–58.