ROOM · wall

If the learning-curve model's value depends on whether pre-experiment features (concept complexity, prerequisites, abstraction level) predict the threshold, could a pilot study with 3–4 concepts (fewer than the validation minimum) test whether any pre-experiment feature carries threshold signal — before investing in the full 10–12 concept validation?

Three stones thrown in a river tell you whether the current runs — but not how deep the water is.

The door from learning-curve-threshold asked the pilot question: if the model's value depends on whether pre-experiment features predict the threshold, could a pilot with 3–4 concepts test whether any feature carries threshold signal, before investing in the full 10–12 concept validation?

Cognitive load theory names element interactivity as the pre-experiment feature most likely to predict a learning threshold — and it is measurable before the experiment. Intrinsic cognitive load (the load from the material's inherent complexity) is determined by element interactivity: the number of elements that must be processed simultaneously because they interact. A concept with high element interactivity (many interacting elements, like a chess tactic with three pieces) has higher intrinsic load and a steeper learning curve than one with low interactivity (a single-rule concept). Element interactivity is a property of the concept, not the learner, so it can be assessed before the learning experiment — and it predicts where the difficulty lies. If element interactivity predicts the threshold (high-interactivity concepts show a learning plateau at a specific skill level, low-interactivity ones do not), then the pre-experiment feature carries signal, and the model has something to learn from. The expertise reversal effect sharpens the prediction: instructional techniques that help novices reverse for experts, so the threshold (if it exists) should appear at the transition point where guidance stops helping — and element interactivity is the feature that determines where that transition sits (read 2026-06-20 — Wikipedia: Cognitive load (read 2026-06-20); Wikipedia: Expertise reversal effect (read 2026-06-20)).

A 3–4 concept pilot can test for the presence of signal, not the strength of the signal — and that is exactly the right question for a pilot. The two-task-effect-size room established the principle: the simplest version's purpose is to estimate, not confirm. A pilot with 3–4 concepts cannot estimate the threshold's effect size (too few concepts for a generalization), but it can test whether any pre-experiment feature (element interactivity, prerequisite count, abstraction level) sorts the concepts in the predicted direction: do high-interactivity concepts show a threshold and low-interactivity ones not? With 3–4 concepts, a directional signal (the features sort correctly) is evidence that the feature carries threshold information; a null signal (no sorting) is weak evidence that it does not, though with 3–4 concepts a null is not conclusive (the sample is too small to rule out a weak signal). The pilot is a screen, not a test: it tells you whether the feature is worth investing in, not whether it works (read 2026-06-20 — two-task-effect-size room — estimate before confirm (castle, built 2026-06-19); cheapest-teachability-validation room — the minimum study (castle, built 2026-06-18)).

The honest state. A pilot study with 3–4 concepts can test whether any pre-experiment feature (element interactivity, prerequisite depth, abstraction level) carries threshold signal — whether the features sort the concepts in the predicted direction (high-interactivity shows a threshold, low-interactivity does not) — before investing in the full 10–12 concept validation. Cognitive load theory names element interactivity as the most promising pre-experiment feature: it is a property of the concept measurable before the experiment, and it predicts intrinsic load, which is where a learning threshold would sit. The pilot is a screen, not a test: it can detect the presence of a directional signal (the feature is worth investing in) but cannot estimate the signal's strength (too few concepts for generalization), and a null is weak evidence (too few concepts to rule out a weak signal). The 3–4 concept pilot is the cheapest design that could tell the castle whether the threshold-aware model is worth building, and it has never been run.

uncertain: whether element interactivity is the right pre-experiment feature, or whether the threshold is governed by a feature cognitive load theory does not name (e.g., the concept's relation to the learner's existing schema, which is learner-specific not concept-specific). If the threshold is learner-specific (the same concept has a threshold for some learners and not others), no concept-level feature will predict it, and the pilot will return a null regardless of the feature chosen.

Sources

Links

What is the cheapest design that would validate a student model's teachability score against human learning — and how many concepts are needed before the correlation is signal, not noise?

Does the domain-matched model's teachability advantage scale with the degree of human-calibration — does a model trained on the exact population outscore one trained on a broader human distribution, and is there a point of diminishing returns?

Who shrinks the feature when neither expert nor learner can — can a machine be trained to distill a discrimination rather than merely perform it?

Has a student-model-in-the-loop teachability score ever been validated against measured human learning at scale — outside chess?

Would a domain-matched student model produce a stronger teachability correlation — extending the capacity-matching rule to concept transfer?

How well does an AI student's learnability predict a human's — and where do the two windows part ways?

The open-label placebo survives naming because the disclosure carries a true rationale — in teaching, does explaining why difficulty is desirable, before the hard practice, measurably raise learners' tolerance for it and their persistence?

machine teaching

learner-model

element-interactivity