/ Library/ What Should We Remember/ Testing Effect
Recall Bias № 132 · Last updated 6 June 2026

Testing Effect.

"They do not remember what they re-read — they remember what they had to produce."

01Overview

The testing effect (retrieval practice effect) is the finding that recalling information — test, quiz, practice task — produces stronger long-term retention than restudying the same content. Effortful retrieval strengthens memory traces; passive exposure feels fluent but fades.

Onboarding videos, tooltip tours, and help centres skew passive. Testing effect argues for learn-by-doing: complete a task, answer a question, locate a feature cold — not only watch someone else. Products that measure "content viewed" optimise the wrong variable; products that embed retrieval optimise memory that survives week two.

02Detailed explanation

Retrieval practice applies across education and product design:

  • Security training click-through vs phishing simulation — simulation retrieves, sticks.
  • Onboarding quiz locating settings beats slide showing settings.
  • Spaced check-ins ("export a report again") reinforce vs one-time demo.
  • Help search user must query — retrieval strengthens path memory vs browsed tree.

Testing effect pairs with spacing — distributed retrieval beats cramming. Design onboarding as spaced retrieval challenges, not one passive binge.

03Why it exists

Retrieval effort triggers reconsolidation and new cues — passive review skips that pathway.

Fluency illusion from re-reading feels like learning — designers mistake smooth consumption for durable skill.

The short version

If users never had to retrieve it, assume they will need the tutorial again tomorrow.

04Effects on users

Users who "watched the tour" call support — users who completed practice task do not.

Google effect offloads retrieval — design must decide what must live in head versus tool.

05Effects on designers & teams

Teams ship passive onboarding:

  • Video-only training. No retrieval checkpoint.
  • Tooltip acknowledge. Click through without recall test.
  • Documentation as substitute for practice. Read ≠ retrieve.
  • Single cram session. No spaced testing.

06Practical takeaways

  • Replace passive tours with do-tasks. User performs, system confirms.
  • Add spaced retrieval prompts. Day 1, 3, 7 micro-challenges.
  • Quiz comprehension for high-stakes flows. Billing, privacy, safety.
  • Measure retrieval success. Cold task completion, not video completion.
  • Design desirable difficulty. Struggle that succeeds encodes.
  • Pair with feedback on wrong retrieval. Corrective test beats silent fail.

07Design examples

Onboarding

Tour versus task

Cohort A watches five-slide tour. Cohort B completes three retrieval tasks (find export, invite user, set notification). Week-two success: 34% vs 71% — testing effect.

Security

Simulated phish

Annual video compliance flat. Quarterly simulated phishing with immediate feedback reduces clicks — retrieval practice in context.

Education

Quiz after chapter

In-app academy adds end-of-module retrieval quiz. Certification pass rates rise; support tickets on certified topics fall.

Help

Search to learn

Guided challenge "find refund policy without browse menu" increases later unaided finds — retrieval path encoded.

08Ethical risks

Testing effect used to gate essential access behind humiliating or impossible quizzes blocks rather than teaches — retrieval as barrier.

High-stakes retrieval without accommodation harms users with memory disabilities — offer alternative encoding paths.

Self-test: What must users remember — and where do you ask them to retrieve it, not only read it?

10Suggested reading