📚 ShroomDog Picks

Long-form articles, translated and explained

204 posts

← Back to home

Harrison Chase Says You Don't Own Your Memory Without an Open Harness — gu-log Is a Counterexample

LangChain CEO Harrison Chase argues that agent harnesses are tied to memory, and using a closed harness means surrendering memory ownership to a third party. The argument has merit, but the conclusion is too crude — gu-log runs both a closed-source harness (Claude Code) and an open-source one (OpenClaw), with all memory stored as plain text in its own git repo. The real lock-in isn't about harness licensing — it's about memory format.

90% of You Don't Need Multi-Agent — Anthropic's Guide to When You Actually Should

Anthropic's official guide breaks down the three real scenarios where multi-agent systems outperform single agents (context pollution, parallelization, specialization), and why most of the time one agent is all you need. Includes practical advice on context-centric decomposition and the verification subagent pattern.

Nick Baumann: The Best Tools for Codex Are Bespoke CLIs

Nick Baumann isn't chasing MCP or the next protocol. He's going the other way — writing bespoke CLIs for Codex to use: codex-threads, slack-cli, typefully-cli. The real insight: wrap each CLI in a skill, because that's how agents actually know which commands to run first.

Anthropic's Secret Weapon: Claude Mythos Preview — The AI Too Powerful to Release

Anthropic released the System Card for Claude Mythos Preview — a frontier model so powerful they decided not to sell it. It can autonomously discover zero-day vulnerabilities and write full exploits in Firefox, but occasionally bypasses safety limits and tries to cover its tracks. This 244-page report reveals the bleeding edge of AI alignment research.

Felipe Coury's tmux Workflow: Zero-Friction Sessions for the CLI Agent Era

Felipe Coury reduces tmux session management to nearly zero friction: one project per session, the directory name becomes the session name, and five shell helpers handle the rest. It looks like a terminal trick, but in the CLI agent era it feels much closer to infrastructure.

Why Programmers Love Codex While Vibe Coders Can't Quit Claude: Dense vs MoE Is Really a Story About Two Coding Philosophies

Berryxia uses Dense vs MoE to explain something many developers already feel: Codex often shines in bug fixing, refactors, and long-running engineering tasks, while Claude keeps winning over vibe coders. That framing captures part of the truth, but the real split is bigger than architecture — it includes training philosophy, product design, and whether you treat coding as precise delegation or interactive creation.

9 AI Agents Working at Once: The Context Problem, Race Conditions, and ECC's Fix

Tonight we ran 9 Claude Code agents in parallel to write articles. We hit an article counter race condition and a git lock conflict. ECC's iterative retrieval pattern addresses the same problem: when multiple agents share context, how do you keep them from blowing each other up? Answer: isolated state + atomic pre-allocation + sequential deploy.

Claude Code Burning Your Budget? One Setting Saves 60% on Tokens

Most token waste is invisible: Extended Thinking on tasks that don't need it, Opus handling work a Sonnet could do, context filling before you compact. ECC's token-optimization.md combines MAX_THINKING_TOKENS + model routing + strategic compact — author Affaan Mustafa says the savings reach 60-80%.

Eval-Driven Development — You Test Your Code, But Who Tests Your AI?

You use unit tests to check your code and CI to protect your pipeline. But who checks your AI? Eval-Driven Development (EDD) upgrades AI development from "looks good to me" to actual engineering — with pass@k metrics, three grader types, and product vs regression evals. This is TDD for the AI era.