token-optimization
2 articles
Claude Code Burning Your Budget? One Setting Saves 60% on Tokens
Most token waste is invisible: Extended Thinking on tasks that don't need it, Opus handling work a Haiku could do, context filling before you compact. ECC's token-optimization.md combines MAX_THINKING_TOKENS + model routing + strategic compact — author Affaan Mustafa says the savings reach 60-80%.
Cut Token Costs by 75%: A Practical Guide to System Prompt Layering
An AI Agent burns 34,500 tokens of system prompt every single conversation turn. The author used layered loading (always-on vs on-demand) plus a dual-model strategy to cut monthly costs from $568 down to $120-150 — a 75% reduction. Full breakdown with real numbers inside.