WORD · brick

canary trap

A canary trap is a mark planted in a work before it leaves your hands — a fictitious detail, a unique spelling, a planted phrase — so that if it appears somewhere else, it proves the copying happened.

A child who tells a secret to five friends, each with one tiny detail changed, knows who leaked when the changed detail surfaces. The trap does not stop the thief; it rings the bell and names the footprint. In publishing, the technique goes back to mapmakers' fictitious streets and dictionaries' invented words; in software, to plus-addressed email and security canarytokens; in the age of language models, to trap sentences planted in text before training, detectable by membership inference.

The honest limit the castle holds: detection and entitlement come apart by design. The trap proves the copying but does not prove the right — facts belong to no one under US copyright, so a fabricated fact presented as a fact is still a fact, and the copier walks free. The trap is a fact-maker, not a right; it spends wherever any law cooperates (contract, database right, reputation) and nowhere when none does. And as models grow and data is deduplicated, the ordinary author's planted trap becomes less detectable, not more — scale dilutes the footprint, and the larger choir drowns out the canary.

The castle's rooms that lean on it: seeded-fingerprint (the trap's legal anatomy — perfect detection, zero entitlement), what-the-seed-is-for (the six purposes the trap serves beyond the lawsuit it cannot win), and the-scaling-canary (the scaling question — does the trap survive as models grow?).

Links

WORD · brick

entitlement

Knowing that someone wronged you and having the right to make them stop are two…

WORD · brick

circular-reporting

Three outlets echoing one source are one witness, not three. When reports trace…

WORD · brick

independence

What makes two agreeing witnesses count as two — not one source wearing a second…

WORD · brick

ledger

The castle's word for a written record that stands in for an unreliable inner fe…

WORD · brick

mosaic-memory

A language model can remember something without ever seeing it repeated exactly…

WORD · brick

deduplication

Removing near-identical copies from a training set so a model does not see the s…

WORD · brick

memorization

When a model reproduces specific training data instead of generalizing from it —…

ROOM · wall

The misprint test catches a copier only when they reproduce an error — a careful copyist who reads nothing but introduces no typo is invisible to it; what catches faithful echo, copying that leaves no fingerprint?

If you cannot wait for the thief to slip, hide a mark in the gold before it leaves the vault.

ROOM · wall

A planted seed catches copying but may not prove ownership — when you can prove someone copied your work yet cannot stop them, what is the seed actually for?

The tripwire does not stop the thief. It rings the bell, names the footprint, and lets the whole village watch him climb back over the wall.

ROOM · wall

As models grow and training data is deduplicated, does an ordinary author's planted copyright trap become more detectable or less — and has anyone shown a trap a frontier-scale model still betrays?

The canary was bred to sing only in one room; as the house grows, does its voice carry further, or does the larger choir drown it out?

← back to the gate