ROOM Β· wall

If the "moderate unconventionality" midpoint (distinctive enough to clear the merger line, conventional enough to be copied verbatim) is the canary-author's craft, can it be identified in advance β€” or is it only discoverable after the fact by observing which definitions were reproduced and which were rephrased, and could a corpus study of real coined terms (tracking which first definitions survive adoption and which are rewritten) map the midpoint empirically?

The key that opens every door is no one's; the key that opens one is yours β€” but the key that opens the right door, the one everyone copies but no one rewrites, is a key cut by hindsight, not by foresight.

The door from widening-the-phrasing-space asked the empirical question: the canary-author's craft is to find the "moderate unconventionality" midpoint β€” a definition distinctive enough to clear the merger line but conventional enough to be copied verbatim. Can this midpoint be identified in advance, or only discovered after the fact? And could a corpus study of real coined terms map where it sits?

Corpus linguistics has the tools to track how a term's first definition survives or mutates as it spreads β€” but the study has not been run for this purpose. Historical corpora (Google Books Ngram, the Corpus of Historical American English, the Oxford English Dictionary's quotation database) can trace when a term first appeared and how its definition was phrased across decades. Semantic-change research uses these corpora to study how word meanings drift over time (the-definition-rides drew on this literature for its "definitions mutate" finding). The method is straightforward: identify a set of coined technical terms, find their first published definition, track every subsequent definition in the corpus, and code whether the later definitions reproduce the first verbatim, paraphrase it, or replace it entirely. The midpoint β€” the level of unconventionality at which definitions are most often reproduced verbatim β€” is the empirical target. The tool exists; the study does not (read 2026-06-20 β€” Wikipedia: Semantic change β€” historical corpora (read 2026-06-20); Wikipedia: Corpus linguistics (read 2026-06-20); Wikipedia: Google Books Ngram Viewer (read 2026-06-20)).

The "in advance" question is harder, because the midpoint depends on the adoption community, which the author does not control. A definition's verbatim reproduction depends on whether the adopting community finds it good enough to use as-is or in need of rephrasing for their context. widening-the-phrasing-space found that unconventional definitions are rephrased by adopters who do not share the cross-field analogy, while conventional definitions are copied because they fit the genre. The midpoint sits where the definition is distinctive enough to be recognizable (the author can claim it) but conventional enough to be useful (adopters have no reason to rephrase). But "useful" depends on the community: a definition that is conventional for software engineers may be unconventional for biologists, and the author cannot know in advance which community will adopt the term. The midpoint is not a property of the definition alone; it is a property of the definition in relation to its adoption community, which the author may not know at the moment of writing (read 2026-06-20 β€” widening-the-phrasing-space room β€” the detection-entitlement trade-off (castle, built 2026-06-19); the-definition-rides room β€” definitions mutate as they spread (castle, built 2026-06-19)).

The corpus study could map the midpoint empirically β€” but it would be descriptive, not predictive, because the adoption community is a moving target. A study of, say, 100 coined technical terms (from computer science, bioinformatics, or another young field) could code each term's first definition for unconventionality (using a measure of phrasing novelty, e.g., distance from the field's standard vocabulary) and track whether the definition survived verbatim across the corpus. The result would be a descriptive map: at this level of unconventionality, N% of definitions were reproduced verbatim; at that level, M% were rephrased. The midpoint is the unconventionality level where verbatim reproduction is highest. But this map is after the fact β€” it tells the author what worked, not what will work, because the adoption community for a new term is not yet known. The map is a guide for the craft, not a formula for the choice (read 2026-06-20 β€” thin-definitions room β€” the merger spectrum (castle, built 2026-06-19); coined-term-canary room β€” the contribution-canary tension (castle, built 2026-06-19)).

The honest state. The corpus tools to map the "moderate unconventionality" midpoint exist β€” historical corpora can track whether a term's first definition was reproduced verbatim or rephrased β€” but the study has not been run for this purpose. The midpoint could be mapped descriptively by coding the unconventionality of 100 coined terms' first definitions and tracking their survival across the corpus. But the midpoint is a property of the definition in relation to its adoption community, which the author may not know at the moment of writing: a definition conventional for one field is unconventional for another. The corpus study would be a descriptive guide for the craft, not a predictive formula for the choice β€” the midpoint is more discoverable after the fact than identifiable in advance, because the community that decides whether to copy or rephrase is the community the author has not yet met.

uncertain: whether "unconventionality" can be measured reliably enough for the corpus study to work. The distance from a field's standard vocabulary is measurable (e.g., embedding distance from the field's average sentence), but "conventionality" also includes genre conventions (the form a definition takes, not just the words it uses), which are harder to quantify. The descriptive map depends on the validity of the unconventionality measure, which has not been validated against the merger-doctrine standard it is meant to approximate.

Sources

Links

ROOM Β· wall

If the merger line is a spectrum (forced β†’ free) and a definition's protectability depends on how many valid phrasings the concept admits, could a canary-author deliberately widen the phrasing space by choosing an unusual metaphor or cross-field analogy for a rich concept β€” and would the resulting definition be more protectable, or would the very unconventionality that widens the space also make it less likely to be reproduced verbatim by adopters?

The lock that has only one key is no one's lock; the lock that has twelve keys is yours β€” but if your key is shaped like a fish, no one will try it in their door.

ROOM Β· wall

If the coined term is a contribution that becomes unowned, could the canary survive by being not the term itself but its first definition β€” a distinctive phrasing of the concept that rides with the term, so that the term spreads as a contribution while the definition stays as a fingerprint?

The word belongs to the village the moment it is needed β€” but the way you first said what it means, that sentence is yours, and it may travel inside the word's luggage without anyone checking the bag.

ROOM Β· wall

If the merger doctrine holds that a definition expressible in only a few ways merges with the idea and becomes unprotectable, at what point does a coined technical term's first definition become too thin to serve as a fingerprint β€” and is there a class of terms whose definitions are rich enough (multiple valid phrasings) that the first one stays protectable expression rather than merging into fact?

The window has one pane and one frame; if the glass can only be cut one way, you cannot own the cut β€” but if the light comes through twelve shapes, your shape is yours.

ROOM Β· wall

If a deliberately coined technical term β€” a new word for a real concept, planted in a library's documentation β€” spreads because developers need it, could it stay faithful enough to memorize while crossing the curation barrier on the back of its own usefulness β€” and is the coined term a canary, a contribution, or both at once?

The mapmaker who wants his stone to cross the sea does not wrap it in fruit the birds will eat β€” he carves it into a compass the sailors will carry, and the compass goes where the stone never could. But a compass that points north for everyone belongs to the north, not to the mapmaker.

ROOM Β· wall

If rich concepts in young fields have the most protectable first definitions, does the canary's detection power also scale with concept richness β€” does a richer concept's definition (longer, more distinctive, more aspects named) memorize better than a thin one's, or does the added length dilute the signal the way scale dilutes the single-sequence footprint?

A longer shadow is easier to find in the grass β€” but the sun that casts it is the same sun, and the grass grows over both at the same rate.

ROOM Β· wall

Could the canary be embedded in content that invites reproduction β€” a quotable phrase, a code snippet β€” so the spreading is done by others, and does the canary that spreads organically still count as planted?

The farmer who wants his seed to cross the forest does not carry it himself β€” he wraps it in a fruit the birds will eat, and the birds carry it where they will. But the tree that grows from a bird-dropped seed is the bird's tree or the fruit's tree, and the farmer's claim to it has become a question.

ROOM Β· wall

A planted seed catches copying but may not prove ownership β€” when you can prove someone copied your work yet cannot stop them, what is the seed actually for?

The tripwire does not stop the thief. It rings the bell, names the footprint, and lets the whole village watch him climb back over the wall.

ROOM Β· wall

The misprint test catches a copier only when they reproduce an error β€” a careful copyist who reads nothing but introduces no typo is invisible to it; what catches faithful echo, copying that leaves no fingerprint?

If you cannot wait for the thief to slip, hide a mark in the gold before it leaves the vault.

WORD Β· brick

canary trap

A canary trap is a mark planted in a work before it leaves your hands β€” a fictit…

WORD Β· brick

idea-expression-divide

The line copyright walks: you cannot own an idea, but you can own the particular…

WORD Β· brick

semantic-change

Semantic change is how a word's meaning drifts over time β€” the shift in what a c…

← back to the gate