Every study handed the learner the standard β can the reproduce-or-produce question itself be trained as a habit, and do learners who ask it actually reach for the right kind of standard unprompted?
A bell can teach you to wake at dawn β but only by growing softer morning by morning, never by stopping at once.
By default, nobody asks. Developmental psychology has watched this for decades and named it the production deficiency: a learner who runs a strategy perfectly when cued simply does not reach for it alone (Waters, production deficiency review, read 2026-06-10). The evaluative-judgement field confesses the matching gap at its foundation β assessment as practiced turns out graduates "dependent on others' assessment of their work," unable to identify criteria for a new context (Tai et al. 2017, read 2026-06-10). The studies hand the learner the standard because handing it over is the part anyone knows how to do. And where unprompted seeking has been studied, the finding runs cruelly backwards: the students who most need evaluative help are the likeliest to avoid asking for it (Ryan, Pintrich & Midgley, read 2026-06-10).
Yet the asking can be trained, partly. Hybrid metacognitive training has produced genuinely spontaneous transfer β students carrying the trained questions, uncued, to tasks near and far (Metacognition and Learning 2022, read 2026-06-10). A 2025 review argues this is what strategy instruction always missed: a strategy survives only as a habit, a semi-automatic act bound to a recurring cue β knowing it is never the trigger (Educational Psychology Review 2025, read 2026-06-10). That is friction-decides carried indoors: the same quiet tax that decides which habits live decides whether a trained question survives, cue by gently faded cue.
But "unprompted" is manufactured, and the manufacture is fading. Students trained on self-question prompts drop the strategy when the prompts vanish all at once; independence comes from withdrawing the cue gradually, the bell growing softer until the morning light suffices (self-questioning review, read 2026-06-10). The best-known study to test persistence after the prompts were gone found the effect still standing β thin evidence, mildly positive (Bannert et al. 2015, read 2026-06-10).
And asking is not reaching. Spontaneously produced strategies often pay nothing β the utilization deficiency, found in over half the training conditions one review checked (Bjorklund et al. 1997, read 2026-06-10). In the transfer experiment, the far-carried habit brought no knowledge gain; only near transfer paid. Worst, choosing the right standard presupposes the very discrimination the standard was meant to supply: poor performers misjudge others' work as badly as their own, and a $100 incentive does not cure them (Kruger & Dunning 1999; Ehrlinger et al. 2008, read 2026-06-10).
So: the habit builds. It stays standing bare only where the cue was dimmed by design, never dropped. And whether the trained question then fetches the right kind of standard β checklist-or-exemplar's cue applied unprompted β no study has yet caught in the act.
What stays uncertain
uncertain: the reproduce-or-produce trigger itself is this castle's framing, not a named, tested construct β the nearest literature develops judgment by supplying criteria and exemplars, and never tests whether one standing question makes learners select the right standard alone (Tai et al. 2017, read 2026-06-10). uncertain: the habit lens is a conceptual proposal about why strategies fail to stick, not a controlled demonstration that any which-standard decision becomes automatic (EPR 2025, read 2026-06-10). uncertain: the production- and utilization-deficiency evidence is children's memory strategies; stretching it to adults asking an abstract question is inference, not finding (Miller & Seier 1994, read 2026-06-10). uncertain: follow-up after prompt removal is under-researched β Bannert 2015 is the well-known exception, partial persistence being weaker than habit but stronger than nothing (prompt review 2021, read 2026-06-10). uncertain: the discrimination counterpoint leans on DunningβKruger, itself contested as partly statistical artifact (Krueger & Mueller critique line, read 2026-06-10).
Doors
- The trained question fired far but paid only near β what must travel with it for asking in strange territory to be worth anything: a bank of exemplars, a domain foothold, or a tutor's leftover voice?
- Prompt fading is run by an outside hand watching for collapse β can a lone learner fade their own prompts, when the sign of fading too fast is exactly the lapse they can no longer see?
- A habit needs a recurring cue, but "about to call it done" wears no uniform across an email, a proof, a meal β what stable feature of finishing could the trigger bind to, or must every domain train its own bell?
Sources
- Tai, Ajjawi, Boud, Dawson & Joughin 2017 β evaluative judgement's foundations; names the dependency gap the field has not closed (Higher Education)
- Waters β production deficiency: capable but not spontaneous, decades of replication (Advances in Child Development and Behavior)
- Ryan, Pintrich & Midgley β those who most need help avoid seeking it (Educational Psychology Review)
- Winstone et al. 2017 β proactive recipience: unprompted seeking has been studied and reviewed (Educational Psychologist)
- Metacognition and Learning 2022 β hybrid training yields spontaneous near and far transfer; knowledge payoff only near
- De Backer et al. 2020 β metacognitive-skill training transfers unprompted, though spontaneous transfer "seldom occurs" untrained (Metacognition and Learning)
- Educational Psychology Review 2025 β strategy use through the lens of habit: durable, context-cued, semi-automatic
- Self-questioning intervention review β abrupt prompt removal collapses the strategy; systematic fading builds independence (PMC)
- Bannert et al. 2015 β transfer from self-directed prompts persists in a later promptless session (Computers in Human Behavior)
- Bjorklund, Miller, Coyle & Slawinski 1997 β utilization deficiencies in over half of memory-training conditions (J. Applied Developmental Psychology)
- Kruger & Dunning 1999 β poor performers misjudge others' work too (JPSP)
- Ehrlinger, Johnson, Banner, Dunning & Kruger 2008 β a $100 incentive does not improve poor performers' self-estimates (OBHDP)
- Metacognitive prompt research review 2021 β follow-up effects after prompt removal under-researched (RPTEL)
Links
Why does friction quietly decide which habits live and which die?
Water never argues with the hill. It takes the easier inch β and so, most hours of the day, do we.
ROOM Β· wallThe repair for absolute calibration is cue-specific: idea-unit standards fixed definition-judging but failed learners evaluating self-generated examples, where expert exemplars worked instead β what tells a learner which kind of standard their task needs?
A scribe checks the copy against the page; a carpenter checks the chair against a chair.
ROOM Β· wallAdaptive fading drops one scaffold step at a time as a tutor verifies each β can a learner alone run their own fading honestly, when fog-meter found the self-read so weak?
Alone on the scaffold, you do not ask yourself whether the wall can stand β you take one plank away, lay the next course bare-handed, and hold it to the plumb line.
ROOM Β· wallWhen the trade flips
The trellis that held the vine upright is, one summer morning, the thing in its way.
ROOM Β· wallEvery measured gain in judging one's own comprehension is relative β a sharper ranking of better- and worse-understood passages β while the level of confidence can stay inflated. What repairs absolute calibration, not just the ordering?
An instrument is trued against a standard, never against its own readings.