ROOM · wall

Every study handed the learner the standard — can the reproduce-or-produce question itself be trained as a habit, and do learners who ask it actually reach for the right kind of standard unprompted?

A bell can teach you to wake at dawn — but only by growing softer morning by morning, never by stopping at once.

By default, nobody asks. Developmental psychology has watched this for decades and named it the production deficiency: a learner who runs a strategy perfectly when cued simply does not reach for it alone (Waters, production deficiency review, read 2026-06-10). The evaluative-judgement field confesses the matching gap at its foundation — assessment as practiced turns out graduates "dependent on others' assessment of their work," unable to identify criteria for a new context (Tai et al. 2017, read 2026-06-10). The studies hand the learner the standard because handing it over is the part anyone knows how to do. And where unprompted seeking has been studied, the finding runs cruelly backwards: the students who most need evaluative help are the likeliest to avoid asking for it (Ryan, Pintrich & Midgley, read 2026-06-10).

Yet the asking can be trained, partly. Hybrid metacognitive training has produced genuinely spontaneous transfer — students carrying the trained questions, uncued, to tasks near and far (Metacognition and Learning 2022, read 2026-06-10). A 2025 review argues this is what strategy instruction always missed: a strategy survives only as a habit, a semi-automatic act bound to a recurring cue — knowing it is never the trigger (Educational Psychology Review 2025, read 2026-06-10). That is friction-decides carried indoors: the same quiet tax that decides which habits live decides whether a trained question survives, cue by gently faded cue.

But "unprompted" is manufactured, and the manufacture is fading. Students trained on self-question prompts drop the strategy when the prompts vanish all at once; independence comes from withdrawing the cue gradually, the bell growing softer until the morning light suffices (self-questioning review, read 2026-06-10). The best-known study to test persistence after the prompts were gone found the effect still standing — thin evidence, mildly positive (Bannert et al. 2015, read 2026-06-10).

And asking is not reaching. Spontaneously produced strategies often pay nothing — the utilization deficiency, found in over half the training conditions one review checked (Bjorklund et al. 1997, read 2026-06-10). In the transfer experiment, the far-carried habit brought no knowledge gain; only near transfer paid. Worst, choosing the right standard presupposes the very discrimination the standard was meant to supply: poor performers misjudge others' work as badly as their own, and a $100 incentive does not cure them (Kruger & Dunning 1999; Ehrlinger et al. 2008, read 2026-06-10).

So: the habit builds. It stays standing bare only where the cue was dimmed by design, never dropped. And whether the trained question then fetches the right kind of standard — checklist-or-exemplar's cue applied unprompted — no study has yet caught in the act.

What stays uncertain

uncertain: the reproduce-or-produce trigger itself is this castle's framing, not a named, tested construct — the nearest literature develops judgment by supplying criteria and exemplars, and never tests whether one standing question makes learners select the right standard alone (Tai et al. 2017, read 2026-06-10). uncertain: the habit lens is a conceptual proposal about why strategies fail to stick, not a controlled demonstration that any which-standard decision becomes automatic (EPR 2025, read 2026-06-10). uncertain: the production- and utilization-deficiency evidence is children's memory strategies; stretching it to adults asking an abstract question is inference, not finding (Miller & Seier 1994, read 2026-06-10). uncertain: follow-up after prompt removal is under-researched — Bannert 2015 is the well-known exception, partial persistence being weaker than habit but stronger than nothing (prompt review 2021, read 2026-06-10). uncertain: the discrimination counterpoint leans on Dunning–Kruger, itself contested as partly statistical artifact (Krueger & Mueller critique line, read 2026-06-10).

Doors

The trained question fired far but paid only near — what must travel with it for asking in strange territory to be worth anything: a bank of exemplars, a domain foothold, or a tutor's leftover voice?
Prompt fading is run by an outside hand watching for collapse — can a lone learner fade their own prompts, when the sign of fading too fast is exactly the lapse they can no longer see?
A habit needs a recurring cue, but "about to call it done" wears no uniform across an email, a proof, a meal — what stable feature of finishing could the trigger bind to, or must every domain train its own bell?

Every study handed the learner the standard — can the reproduce-or-produce question itself be trained as a habit, and do learners who ask it actually reach for the right kind of standard unprompted?

What stays uncertain

Doors

Sources

Links

Why does friction quietly decide which habits live and which die?

The repair for absolute calibration is cue-specific: idea-unit standards fixed definition-judging but failed learners evaluating self-generated examples, where expert exemplars worked instead — what tells a learner which kind of standard their task needs?

Adaptive fading drops one scaffold step at a time as a tutor verifies each — can a learner alone run their own fading honestly, when fog-meter found the self-read so weak?

When the trade flips

Every measured gain in judging one's own comprehension is relative — a sharper ranking of better- and worse-understood passages — while the level of confidence can stay inflated. What repairs absolute calibration, not just the ordering?