the hidden cost of opus 4.7's tokenizer
opus 4.7 shipped with the same $5/$25 rate card as 4.6. but the new tokenizer means the same text creates more tokens — and community testing suggests the official 35% ceiling may be underselling the gap.
Thoughts on software engineering, architecture, and building things that matter.
opus 4.7 shipped with the same $5/$25 rate card as 4.6. but the new tokenizer means the same text creates more tokens — and community testing suggests the official 35% ceiling may be underselling the gap.
claude opus 4.5 scores 42% on core-bench with one scaffold. 78% with another. same model, same weights. the gap between the best and worst scaffolds for any given model frequently exceeds the gap between models.
a 1.4b parameter model matching 4b models on reasoning benchmarks. looped language models reuse the same layers multiple times instead of stacking more parameters.
your agent failed at step 9 of a 10-step chain. traditional monitoring shows you a 500 error. what it doesn't show you is why the reasoning drifted at step 4.
llm tooling has moved past simple api wrappers. function calling, mcp servers, structured outputs, agent frameworks. here's what the landscape actually looks like.