best-practices - Tags

90% of You Don't Need Multi-Agent — Anthropic's Guide to When You Actually Should

SP-172 2026-04-13 · Anthropic Blog

Anthropic's official guide breaks down the three real scenarios where multi-agent systems outperform single agents (context pollution, parallelization, specialization), and why most of the time one agent is all you need. Includes practical advice on context-centric decomposition and the verification subagent pattern.

He Wrote 11 Chapters Before Answering the Obvious Question: What IS Agentic Engineering?

CP-171 2026-03-16 · @simonw on X

Simon Willison's Agentic Engineering Patterns guide now has 12 chapters — but this new one goes at the very beginning. He finally answers 'What is Agentic Engineering?' The answer is surprisingly simple: using coding agents to help build software. The interesting part is why it took 11 chapters of hands-on patterns before he felt ready to define it.

agentic-coding simonw-agentic-patterns simon-willison ai-agents claude-code codex

AI Writing Worse Code? That's Your Choice, Not AI's Fault

CP-172 2026-03-16 · @simonw on X

Simon Willison's Agentic Engineering Patterns, Chapter 3: AI should help us ship better code, not worse. Technical debt cleanup costs near zero now, architecture decisions can be validated with prototypes instead of guesses, and quality compounds over time.

agentic-coding simonw-agentic-patterns simon-willison ai-agents refactoring technical-debt

Four Words That Turn Your Coding Agent Into a Testing Machine

CP-173 2026-03-16 · @simonw on X

Simon Willison's Agentic Engineering Patterns — 'First Run the Tests': every time you start a new session, your first instruction should be to run the test suite. Four words, three ripple effects — the agent learns how to run tests, gauges the codebase size, and automatically shifts into a 'I should maintain tests' mindset.

agentic-coding simonw-agentic-patterns simon-willison ai-agents testing tdd

Simon Willison's Agentic Engineering Fireside Chat: Tests Are Free Now, Code Quality Is Your Choice

CP-169 2026-03-15 · @simonw on X

Simon Willison shared his agentic engineering playbook at the Pragmatic Summit — five tokens to start TDD, Showboat for manual verification, reverse-engineering six frameworks into a standard, and why bad code is a choice you make.

agentic-coding simon-willison simonw-agentic-patterns tdd ai-agents

Treat Codex Like a Teammate, Not a Tool: 10 Best Practices That Actually Work

SP-110 2026-03-10 · @derrickcchoi on X

A guide to Codex best practices from prompting and planning to MCP, Skills, and Automations — building a more reliable agent workflow.

codex ai-agents

AI Wrote 1,000 Lines and You Just... Merged It? Simon Willison Names Agentic Development's Worst Anti-Pattern

CP-146 2026-03-09 · @simonw on X

Simon Willison added an 'Anti-Patterns' section to his Agentic Engineering Patterns guide — and the first entry hits hard: don't submit AI-generated code you haven't personally verified. You're not saving time, you're stealing it from your reviewer. This post covers his principles, what a good agentic PR looks like, and a real terraform destroy horror story.

simon-willison agentic-coding simonw-agentic-patterns code-review anti-patterns ai-agents

Make AI Click the Buttons: Simon Willison's Agentic Manual Testing Fills the Gaps Automated Tests Can't

CP-145 2026-03-08 · @simonw on X

Simon Willison introduces Agentic Manual Testing: let AI agents manually operate code and UI like humans do, catching bugs that automated tests miss. With Playwright, Rodney, and Showboat, the 'tests pass but it's broken' nightmare becomes a thing of the past.

simon-willison agentic-coding simonw-agentic-patterns testing qa ai-agents

Can't Understand AI-Generated Code? Have Your Agent Build an Animated Explanation

SP-90 2026-03-01 · Simon Willison @simonw

Chapter 5 of Simon Willison's Agentic Engineering Patterns: Interactive Explanations. Core thesis: instead of staring at AI-generated code trying to understand it, ask your agent to build an interactive animation that shows you how the algorithm works. Pay down cognitive debt visually.

simonw-agentic-patterns simon-willison agentic-coding cognitive-debt ai-agents claude-code

Everything You've Built Is a Weapon — Simon Willison's 'Hoarding' Philosophy for the Agent Era

SP-88 2026-02-27 · Simon Willison @simonw

Chapter 4 of Simon Willison's Agentic Engineering Patterns: Hoard Things You Know How to Do. Core thesis: every problem you've solved should leave behind working code, because coding agents can recombine your old solutions into things you never imagined.

simonw-agentic-patterns simon-willison agentic-coding ai-agents claude-code knowledge-management

Can't Understand Your AI-Written Code? Linear Walkthroughs Turn Vibe Projects Into Learning Materials

SP-87 2026-02-26 · Simon Willison @simonw

Chapter 3 of Simon Willison's Agentic Engineering Patterns: the Linear Walkthrough pattern. This technique transforms even vibe-coded toy projects into valuable learning resources. Core trick: make the agent use sed/grep/cat to fetch code snippets, preventing hallucination.

simonw-agentic-patterns simon-willison agentic-coding cognitive-debt ai-agents claude-code

Do You Actually Know How to Use AI? Anthropic Tracked 10,000 Conversations to Find Out

SP-83 2026-02-24 · Anthropic @AnthropicAI

Anthropic analyzed 9,830 Claude.ai conversations and defined 11 observable AI fluency behaviors. Key finding: people who iterate show 2x the fluency. But when AI produces beautiful artifacts, users question its reasoning less. The prettier the output, the more dangerous it gets.

claude-code ai-fluency human-ai-collaboration research education

Code Got Cheap — Now What? Simon Willison's Agentic Engineering Survival Guide

SP-80 2026-02-23 · Simon Willison @simonw

Simon Willison launched a new series called Agentic Engineering Patterns — a playbook for working with coding agents like Claude Code and Codex. Lesson one: writing code got cheap, but writing good code is still expensive. Lesson two: 'red/green TDD' is the most powerful six-word spell for agent collaboration.

agentic-coding ai-agents claude-code codex tdd simon-willison simonw-agentic-patterns

OpenAI's Agent Trinity: Skills + Shell + Compaction — A Field Guide

SP-54 2026-02-13 · OpenAI

OpenAI released three primitives for long-running agents: Skills (reusable SKILL.md instruction packs), Shell (hosted container runtime), and Compaction (automatic context compression). Includes 10 battle-tested tips and Glean's production data.

openai agent-skills shell compaction codex

StrongDM's 'Dark Factory': No Humans Write Code. No Humans Review Code. $1,000/Day in Tokens.

CP-40 2026-02-07 · Simon Willison's Blog

StrongDM's AI team built a 'Software Factory' where AI agents write & review code. They clone apps into a 'Digital Twin Universe' for testing, an approach Simon Willison calls radical. At $10k/engineer/day in token costs, is it worth it?

agentic-coding simonw-agentic-patterns software-factory simon-willison strongdm ai-agents