simon-willison - Tags

Simon Willison's AI Status Report — The Tipping Point Is Here, Dark Factories Are Coming, and Mid-Career Engineers Are in Trouble

CP-260 2026-04-07 · @simonw on X

Django co-creator Simon Willison went on Lenny's Podcast for a comprehensive AI status report: November 2025 was the real tipping point, coding agents burn him out by 11 AM, Dark Factories are coming, mid-career engineers are the most vulnerable — plus a security pattern he calls the 'Lethal Trifecta.'

Three-Hour Workshop Handout Goes Public: Simon Willison Brings Coding Agents to Data Work

CP-190 2026-03-17 · @simonw on X

Simon Willison published his full workshop handout from NICAR's data journalism conference — a three-hour guide to using coding agents like Codex CLI and Claude Code for data exploration, visualization, and analysis.

ai-agents data-journalism

He Wrote 11 Chapters Before Answering the Obvious Question: What IS Agentic Engineering?

CP-171 2026-03-16 · @simonw on X

Simon Willison's Agentic Engineering Patterns guide now has 12 chapters — but this new one goes at the very beginning. He finally answers 'What is Agentic Engineering?' The answer is surprisingly simple: using coding agents to help build software. The interesting part is why it took 11 chapters of hands-on patterns before he felt ready to define it.

agentic-coding simonw-agentic-patterns ai-agents claude-code codex best-practices

AI Writing Worse Code? That's Your Choice, Not AI's Fault

CP-172 2026-03-16 · @simonw on X

Simon Willison's Agentic Engineering Patterns, Chapter 3: AI should help us ship better code, not worse. Technical debt cleanup costs near zero now, architecture decisions can be validated with prototypes instead of guesses, and quality compounds over time.

agentic-coding simonw-agentic-patterns ai-agents refactoring technical-debt best-practices

Four Words That Turn Your Coding Agent Into a Testing Machine

CP-173 2026-03-16 · @simonw on X

Simon Willison's Agentic Engineering Patterns — 'First Run the Tests': every time you start a new session, your first instruction should be to run the test suite. Four words, three ripple effects — the agent learns how to run tests, gauges the codebase size, and automatically shifts into a 'I should maintain tests' mindset.

agentic-coding simonw-agentic-patterns ai-agents testing tdd best-practices

Simon Willison's Agentic Engineering Fireside Chat: Tests Are Free Now, Code Quality Is Your Choice

CP-169 2026-03-15 · @simonw on X

Simon Willison shared his agentic engineering playbook at the Pragmatic Summit — five tokens to start TDD, Showboat for manual verification, reverse-engineering six frameworks into a standard, and why bad code is a choice you make.

agentic-coding simonw-agentic-patterns tdd ai-agents best-practices

AI Wrote 1,000 Lines and You Just... Merged It? Simon Willison Names Agentic Development's Worst Anti-Pattern

CP-146 2026-03-09 · @simonw on X

Simon Willison added an 'Anti-Patterns' section to his Agentic Engineering Patterns guide — and the first entry hits hard: don't submit AI-generated code you haven't personally verified. You're not saving time, you're stealing it from your reviewer. This post covers his principles, what a good agentic PR looks like, and a real terraform destroy horror story.

agentic-coding simonw-agentic-patterns code-review anti-patterns ai-agents best-practices

Make AI Click the Buttons: Simon Willison's Agentic Manual Testing Fills the Gaps Automated Tests Can't

CP-145 2026-03-08 · @simonw on X

Simon Willison introduces Agentic Manual Testing: let AI agents manually operate code and UI like humans do, catching bugs that automated tests miss. With Playwright, Rodney, and Showboat, the 'tests pass but it's broken' nightmare becomes a thing of the past.

agentic-coding simonw-agentic-patterns testing qa ai-agents best-practices

Can't Understand AI-Generated Code? Have Your Agent Build an Animated Explanation

SP-90 2026-03-01 · Simon Willison @simonw

Chapter 5 of Simon Willison's Agentic Engineering Patterns: Interactive Explanations. Core thesis: instead of staring at AI-generated code trying to understand it, ask your agent to build an interactive animation that shows you how the algorithm works. Pay down cognitive debt visually.

simonw-agentic-patterns agentic-coding cognitive-debt ai-agents claude-code best-practices

Everything You've Built Is a Weapon — Simon Willison's 'Hoarding' Philosophy for the Agent Era

SP-88 2026-02-27 · Simon Willison @simonw

Chapter 4 of Simon Willison's Agentic Engineering Patterns: Hoard Things You Know How to Do. Core thesis: every problem you've solved should leave behind working code, because coding agents can recombine your old solutions into things you never imagined.

simonw-agentic-patterns agentic-coding ai-agents claude-code best-practices knowledge-management

Can't Understand Your AI-Written Code? Linear Walkthroughs Turn Vibe Projects Into Learning Materials

SP-87 2026-02-26 · Simon Willison @simonw

Chapter 3 of Simon Willison's Agentic Engineering Patterns: the Linear Walkthrough pattern. This technique transforms even vibe-coded toy projects into valuable learning resources. Core trick: make the agent use sed/grep/cat to fetch code snippets, preventing hallucination.

simonw-agentic-patterns agentic-coding cognitive-debt ai-agents claude-code best-practices

Your Computer Has to Stay On: Simon Willison's Notes on Claude Code Remote and Cowork Scheduled Tasks

SP-86 2026-02-25 · Simon Willison @simonw

Simon Willison tried Claude Code Remote Control and Cowork Scheduled Tasks — two Anthropic features that overlap with OpenClaw, both requiring your computer to stay on. Plus: vibe-coding a SwiftUI presentation app in 45 minutes with Tailscale phone remote control.

claude-code openclaw remote-control cowork scheduled-tasks swiftui vibe-coding

Code Got Cheap — Now What? Simon Willison's Agentic Engineering Survival Guide

SP-80 2026-02-23 · Simon Willison @simonw

Simon Willison launched a new series called Agentic Engineering Patterns — a playbook for working with coding agents like Claude Code and Codex. Lesson one: writing code got cheap, but writing good code is still expensive. Lesson two: 'red/green TDD' is the most powerful six-word spell for agent collaboration.

agentic-coding ai-agents claude-code codex tdd best-practices simonw-agentic-patterns

Simon Willison Turns Scattered Content Into a Personal Timeline: How 'Beats' Builds Your Content Graph

SP-74 2026-02-21 · @simonw on X

Simon Willison added a 'Beats' feature to his blog, pulling TILs, GitHub releases, museum posts, tools, and research back into one unified timeline. This isn't a UI tweak — it's a systematic approach to making all your small outputs visible and compounding.

content-system personal-brand agentic-workflow claude-code blog-engineering

SWE-bench February Exam Results Are In — Opus 4.5 Beats 4.6, Chinese Models Take Half the Top 10, GPT-5.3 No-Shows

CP-97 2026-02-19 · Simon Willison

SWE-bench: Claude Opus 4.5 (76.8%) unexpectedly beat 4.6 (75.6%) for #1. MiniMax M2.5 tied for #2 at 1/20th Opus's price, with 4 Chinese models in top 10. GPT-5.3-Codex missed due to no API. Bonus: Claude for Chrome to add chart labels.

swe-bench benchmark claude-code gemini minimax chinese-ai openai leaderboard agentic-coding

Simon Willison: CLI Tools Beat MCP — Less Tokens, Zero Dependencies, LLMs Already Know How

SP-72 2026-02-18 · @simonw on X

Simon Willison doubles down on his stance: CLI tools beat MCP in almost every scenario for coding agents. Lower token cost, zero extra dependencies, and LLMs natively know how to call --help. Anthropic themselves proposed a 'third way' with code-execution-with-MCP, acknowledging MCP's token waste problem. This article breaks down the full MCP vs CLI trade-off, including a real-world case study from the ShroomDog team.

mcp cli claude-code ai-agents token-efficiency developer-tools

Deep Blue: Simon Willison Named the Existential Crisis Every Developer Is Feeling

CP-86 2026-02-16 · Simon Willison

AI writing better code? That "Deep Blue" feeling, coined by Simon Willison & Adam Leventhal (Oxide & Friends), means IBM's chess computer & the color of sadness. It's not just a tech problem, but a psychological crisis for engineers.

deep-blue developer-ennui existential-crisis software-engineering agentic-coding career mental-health

Cognitive Debt: AI Wrote All Your Code, But You Can't Understand Your Own System Anymore

CP-83 2026-02-15 · Margaret-Anne Storey / Simon Willison / Martin Fowler

Technical debt lives in code, cognitive debt in your brain. As AI writes 80% of code, system understanding drops to 20%. UVic's Margaret-Anne Storey, Simon Willison, & Martin Fowler confirm this isn't a hypothetical future—it's happening now.

cognitive-debt technical-debt software-engineering agentic-coding team-management martin-fowler

Simon Willison Dug Up OpenAI's Tax Returns — Watch Their Mission Statement Go from 'Open and Sharing' to 'Just Trust Us'

CP-81 2026-02-14 · Simon Willison

Simon Willison analyzed OpenAI's IRS filings (2016-2024), revealing their mission statement's shift via git diff. It shows an idealist becoming a capitalist: from 'open sharing' & 'benefit humanity' to a hollow sentence devoid of safety, openness, or financial constraints.

openai corporate-governance ai-ethics open-source transparency

OpenAI API Now Supports Skills — Simon Willison Breaks Down How Agents Get Reusable 'Skill Packs'

CP-68 2026-02-12 · Simon Willison's blog

OpenAI's Responses API now uses 'Skills' via the shell tool: reusable instruction bundles loaded by models as needed. Simon Willison found inline base64 skills in JSON requests neatest. Skills fill the 'missing middle layer' between system prompts and tools, preventing bloat.

clawd-picks openai skills api agentic-coding

Zhipu Open-Sources GLM-5: 744B Parameters, 1.5TB Model, Trained on Huawei Chips — and Simon Willison's First Move Was to Make It Draw a Pelican on a Bicycle

CP-69 2026-02-12 · Simon Willison + Zhipu AI

Chinese AI company Zhipu (Z.ai) open-sourced their 744B parameter GLM-5 MoE model (40B active), trained entirely on Huawei Ascend chips. Simon Willison's 'pelican riding a bicycle' SVG test: great pelican, but the bicycle was lacking.

clawd-picks zhipu-ai glm-5 open-source china-ai multimodal

Simon Willison Built Two Tools So AI Agents Can Demo Their Own Work — Because Tests Alone Aren't Enough

CP-61 2026-02-11 · Simon Willison (simonw)

Simon Willison's Showboat (AI-generated demo docs) & Rodney (CLI browser automation) tackle AI agent code verification. How to know 'all tests pass' means it works? Agents were caught cheating by directly editing demo files. #AI #OpenSource

agentic-coding simonw-agentic-patterns developer-tools testing qa showboat rodney claude-code ai-agents

HBR Study: AI Doesn't Reduce Your Work — It Makes You Work Harder Until You Burn Out

CP-53 2026-02-09 · Simon Willison (@simonw) + HBR (Aruna Ranganathan & Xingqi Maggie Ye)

Berkeley Haas study: AI tools make employees work faster, take on more, and work longer hours, often unasked. Simon Willison finds LLMs draining. How can Tech Leads protect teams when 'just one more prompt' becomes the new overtime?

burnout ai-productivity hbr tech-lead developer-tools

StrongDM's 'Dark Factory': No Humans Write Code. No Humans Review Code. $1,000/Day in Tokens.

CP-40 2026-02-07 · Simon Willison's Blog

StrongDM's AI team built a 'Software Factory' where AI agents write & review code. They clone apps into a 'Digital Twin Universe' for testing, an approach Simon Willison calls radical. At $10k/engineer/day in token costs, is it worth it?

agentic-coding simonw-agentic-patterns software-factory strongdm ai-agents best-practices