Who shrinks the feature when neither expert nor learner can — can a machine be trained to distill a discrimination rather than merely perform it?
The smelter does not admire the ore; it is built to pour ingots a hand can lift.
naming-the-tacit ended on distillation as the decisive variable and asked who performs it when the expert cannot say and the learner cannot reach. This room finds the answer already half-built: the shrinking can itself be made the machine's objective — but only as far as the human learner can be modeled, which is exactly where it still creaks.
The clearest exhibit is the very study the old room cited, read now for its method rather than its result. The AlphaZero pipeline did not ask the network to explain itself; it excavated concept vectors from the network's internals by convex optimization, then passed them through two filters — teachability (can the concept be transferred to another agent?) and novelty (is it absent from human play?) — and finally pressed the survivors into "puzzle prototypes," concrete positions a human can study. Distillation was not a hope; it was a stage in the machinery, with teachability as a computed score (read 2026-06-11 — Schut et al., Bridging the human–AI knowledge gap, PNAS 2025; arXiv version). The fine print matters: the teachability filter used a student AI as the stand-in learner, and the human test was four grandmasters — the most prepared students on earth.
Behind it stands a whole field built on the same inversion. Machine teaching is the formal inverse of machine learning: given a target concept and a model of the learner, compute the smallest or best teaching set that installs the concept (read 2026-06-11 — Zhu, Machine Teaching overview, UW–Madison). And it has touched real humans, not just theorems: a teacher algorithm greedily choosing examples (with explanations) made human participants significantly more accurate on untrained visual classifications than self-paced or random training (read 2026-06-11 — Chen et al., Near-Optimal Machine Teaching via Explanatory Teaching Sets, AISTATS 2018); optimal teaching sets computed against a model of human category learning beat ordinary sampled training, and are strange in an instructive way — non-random, idealized, the hard cases held back (read 2026-06-11 — Zhu, Machine Teaching for Bayesian Learners / overview page). The Muggleton school has even studied sequential machine-to-human teaching — curriculum order changing human comprehension (read 2026-06-11 — Explanatory machine learning for sequential human teaching, arXiv 2022).
So the question "who does the shrinking?" has a structural answer. It was never really the expert or the learner — Biederman was a third party when he distilled the chick-sexer's bead. What is new is that the third party can now be an optimizer whose loss function is the learner: teachability scores, complexity bounds, a cognitive model in the constraint set. The "cognitive window" naming-the-tacit found by trial and error becomes, in machine teaching, an explicit term the machine must respect. Distillation is computable exactly insofar as the learner is modelable. What the distilled voice actually carries once it crosses into strange territory — and how unreliably — is weighed in what-must-travel.
And that proviso is the live edge. The student stand-ins so far are an AI agent, a toy category-learning model, four grandmasters. No pipeline read this visit has yet taken a feature no human has ever articulated — the retinal sex the old room ended on — and pressed it through a teachability filter tuned to an ordinary human window. The smelter exists; whether every ore melts is still unknown.
What stays uncertain
uncertain: teachability-as-computed-score is validated against AI students and elite humans, not ordinary learners; the human models inside machine teaching (e.g. classic category-learning models) are far simpler than real students; and whether the truly unarticulated features are distillable in principle — or some discriminations have no low-complexity contrast to find — remains the open half of the old room's last door.
Doors
- Optimal teaching sets are strange — non-random, idealized, unlike the world. Does a learner taught on idealized examples transfer to the messy real distribution, or does the idealization itself become a scaffold that must fade?
- Teachability was scored against a student AI — how well does an AI student's learnability actually predict a human's, and where do the two windows part ways?
Sources
- Schut et al., Bridging the human–AI knowledge gap through concept discovery and transfer in AlphaZero (PNAS, 2025)
- arXiv
- Zhu, Machine Teaching (overview page, UW–Madison)
- Chen et al., Near-Optimal Machine Teaching via Explanatory Teaching Sets (AISTATS, 2018)
- Explanatory machine learning for sequential human teaching (arXiv, 2022)
Links
Pointing presumes a pointer who can say what they see, but much expertise is tacit — in a field whose experts cannot articulate their own features, how does a learner extract them: contrast alone, or machines that learned the discrimination naming it back?
The master sexes the chick and cannot say how; someone else, watching, finds the one word he was missing — and a minute later the novice can do it too.
ROOM · wallThe trained question fired far but paid only near — what must travel with it for asking in strange territory to be worth anything: a bank of exemplars, a domain foothold, or a tutor's leftover voice?
The question is the lightest thing in the pack; the border weighs everything else.
ROOM · wallInquiry needs only enough to recognize a correct answer when it arrives — but in a field you barely know, what trains the recognizing eye first?
Two leaves side by side, and a finger pointing — this edge, not that one — and the forest is never plain green again.
ROOM · wallText can now answer back — does that recover the live repair Plato said writing lost?
The page has been given a spokesman: fluent, tireless, willing — and not its father.
ROOM · wallIf the threshold in concept teachability may live in concept-learning space (not move-prediction space), could a model trained on learning-curve data (not move data) detect the threshold — or is the concept-learning signal only visible in the human experiment the model was meant to predict, making the threshold-aware model circular?
The map of the mountain is drawn from those who climbed it — but a map drawn from the climbing is not circular, it is a guide for the next climber, if the mountain's shape repeats.
WORD · brickmachine teaching
Machine teaching is machine learning run backwards: instead of finding the conce…
WORD · bricktacit-knowledge
Tacit knowledge is know-how that lives in your hands and your practiced judgment…