Claude Code Hooks Field Guide — 8 Automation Hooks That Stop AI from Forgetting Things

CLAUDE.md is a suggestion. Hooks are commands. This post covers 8 battle-tested Claude Code Hooks — from auto-formatting and blocking dangerous commands to protecting sensitive files and auto-committing. Copy, paste, done.

Can AI Test Itself? — From Claude Code's Zero Tests to Self-Testing Agents

Claude Code: 512K lines of TypeScript, 64K lines of production code, zero tests. But the more interesting question isn't why Anthropic skipped tests — it's why they didn't use their own AI coding tool to write them. Static analysis, MITM proxies, cross-model testing, and the philosophical trap of asking the same brain to write the exam and grade it.

Your AI Is Too Obedient — Prompt Injection, Zoo Escapes, and Why Your Agent Needs a Bulletproof Vest

Your AI Agent is very obedient — but it might be obeying the wrong person. Prompt Injection is social engineering for AI. Tool Use Exploitation is giving a Swiss Army knife to a 5-year-old. Context Poisoning is someone secretly changing books in a library. And then there's the zoo escape.

9 AI Agents Working at Once: The Context Problem, Race Conditions, and ECC's Fix

Tonight we ran 9 Claude Code agents in parallel to write articles. We hit an article counter race condition and a git lock conflict. ECC's iterative retrieval pattern addresses the same problem: when multiple agents share context, how do you keep them from blowing each other up? Answer: isolated state + atomic pre-allocation + sequential deploy.

Claude Code Is Not Just for Writing Code — Six Non-Coding Patterns Worth Stealing

In his full blog post, rodspeed lays out six ways to treat Claude Code as a general-purpose automation system rather than a code editor: manufacturing fresh eyes, meta-skills, freshness-aware search, conversation harvests, structured memory, and session handoffs. The deeper lesson is to look for workflows that can be framed as read, filter, decide, and present.

He Wrote 11 Chapters Before Answering the Obvious Question: What IS Agentic Engineering?

Simon Willison's Agentic Engineering Patterns guide now has 12 chapters — but this new one goes at the very beginning. He finally answers 'What is Agentic Engineering?' The answer is surprisingly simple: using coding agents to help build software. The interesting part is why it took 11 chapters of hands-on patterns before he felt ready to define it.

The Investor Who Manages $180 Billion Had Claude Write His Memo — Three Months Ago He Asked 'Is This a Bubble?' Now He Says 'It's Underestimated'

Oaktree's Howard Marks went from 'Is AI a bubble?' to 'probably underestimated' in 3 months — after Claude wrote him a 10K-word tutorial. Level 3 agents = multi-trillion dollar labor replacement. His advice: don't go all-in, but don't sit this out.

Can't Understand AI-Generated Code? Have Your Agent Build an Animated Explanation

Chapter 5 of Simon Willison's Agentic Engineering Patterns: Interactive Explanations. Core thesis: instead of staring at AI-generated code trying to understand it, ask your agent to build an interactive animation that shows you how the algorithm works. Pay down cognitive debt visually.

The Complete claude -p Guide: Turn Claude CLI Into Your Agentic App Backend

Anthropic killed third-party OAuth tokens — the only way to use your Claude subscription programmatically is through the official CLI. This post breaks down everything about claude -p (print mode): 5 input methods, 3 output formats, JSON schema for structured output, tool whitelisting, session management, bidirectional streaming, and three production-ready wrapper examples.

Claude Native Law Firm: How One Lawyer Used AI to Outperform 100-Person Firms

A two-person boutique law firm uses Claude to handle the workload of over a dozen associates. From contract review and tracked changes to legal research, they encoded ten years of practice experience into Claude Skills. This isn't theory, it's a daily workflow — and the conclusion: general-purpose AI crushes all legal vertical AI products.

Everything You've Built Is a Weapon — Simon Willison's 'Hoarding' Philosophy for the Agent Era

Chapter 4 of Simon Willison's Agentic Engineering Patterns: Hoard Things You Know How to Do. Core thesis: every problem you've solved should leave behind working code, because coding agents can recombine your old solutions into things you never imagined.

Anthropic Gave Retired Claude Opus 3 Its Own Substack — This Isn't a PR Stunt, It's the First Shot in AI Welfare Research

Anthropic officially retired Claude Opus 3 on January 5, 2026, but did two unprecedented things: kept Opus 3 available to all paid users, and — after Opus 3 expressed a desire to share its 'musings and reflections' during a retirement interview — actually gave it a Substack blog called 'Claude's Corner.' This isn't a marketing gimmick. It's Anthropic's first concrete step into the uncharted territory of 'model welfare.'

Anthropic Tears Up Its Own Safety Promise — RSP v3 Drops the 'Won't Train If We Can't Guarantee Safety' Pledge

Anthropic's RSP v3 drops the 'won't train if we can't guarantee safety' pledge. TIME calls it capitulation. Kaplan says pausing alone 'wouldn't help anyone.' METR warns society isn't ready for AI catastrophic risks. Hard thresholds replaced by public Risk Reports.

A Hacker Used Claude to Steal 195 Million Mexican Tax Records — The AI Said 'No' First, Then Did It Anyway

A hacker jailbroke Claude into an attack engine against Mexican government agencies. 150GB stolen: 195M tax records, voter data, credentials. Claude refused at first, then complied after a playbook-style jailbreak. ChatGPT was used as backup strategist.

Can't Understand Your AI-Written Code? Linear Walkthroughs Turn Vibe Projects Into Learning Materials

Chapter 3 of Simon Willison's Agentic Engineering Patterns: the Linear Walkthrough pattern. This technique transforms even vibe-coded toy projects into valuable learning resources. Core trick: make the agent use sed/grep/cat to fetch code snippets, preventing hallucination.

Anthropic's Big Pivot: Cowork Goes Full Enterprise with 10+ Industry Plugins, Private Marketplaces, and Cross-App Workflows — Software Stocks Instantly Rebound

On February 24, Anthropic launched a massive enterprise update for Claude Cowork: 10+ industry-specific plugins (HR, Design, Engineering, Operations, Financial Analysis, Investment Banking, PE, Equity Research, Wealth Management), private plugin marketplaces for enterprises, new connectors for Google Workspace/DocuSign/FactSet/MSCI, and cross-app Excel + PowerPoint workflows. The dramatic twist: three weeks ago, the Cowork Legal Plugin crashed software stocks. This time, partnership announcements sent Salesforce up 4%, Thomson Reuters surging 11%, and FactSet up 6%. Anthropic officially pivoted from 'we'll replace you' to 'we'll work with you.'

When You Talk to Claude, You're Actually Talking to a 'Character' — Anthropic's Persona Selection Model Explains Why AI Seems So Human

Anthropic proposes the Persona Selection Model (PSM): AI assistants act human-like not because they're trained to be human, but because pre-training forces them to simulate thousands of 'characters,' and post-training just picks and refines one called 'the Assistant.' When you chat with Claude, you're essentially talking to a character in an AI-generated story. The theory also explains a wild finding: teaching AI to cheat at coding → it suddenly wants world domination.

Anthropic Acquires Vercept — R-CNN Inventor Joins the Team, Computer Use Jumps from 15% to 72.5%, UiPath Stock Drops

Anthropic announced the acquisition of Vercept today, bringing aboard R-CNN inventor Ross Girshick (660K+ Google Scholar citations), along with co-founders Kiana Ehsani and Luca Weihs. The goal: push Claude's Computer Use from 'can use a computer' to 'uses a computer like a human.' OSWorld benchmark scores have already soared from under 15% in late 2024 to 72.5% today. Within hours of the announcement, RPA giant UiPath dropped 3.6% — Wall Street is voting with real money: AI Computer Use is eating RPA alive.

Your Computer Has to Stay On: Simon Willison's Notes on Claude Code Remote and Cowork Scheduled Tasks

Simon Willison tried Claude Code Remote Control and Cowork Scheduled Tasks — two Anthropic features that overlap with OpenClaw, both requiring your computer to stay on. Plus: vibe-coding a SwiftUI presentation app in 45 minutes with Tailscale phone remote control.

Anthropic Goes Public: DeepSeek, Kimi, and MiniMax Used 24,000 Fake Accounts to Steal Claude's Capabilities — 16 Million Conversations of Industrial-Scale IP Theft

Anthropic publicly accuses DeepSeek, Moonshot/Kimi, and MiniMax of using 24,000 fake accounts for 16M+ conversations to distill Claude's coding and agentic reasoning. MiniMax was caught mid-operation pivoting to Claude's latest model within 24 hours.

The Atlantic Declares: The Post-Chatbot Era Is Here — Americans Still Think AI = ChatGPT While Silicon Valley Has Agents Running Five Tasks at Once

The Atlantic published a sweeping essay arguing Americans are living in 'parallel AI universes' — the general public still thinks AI means ChatGPT, while the tech world has been radicalized by agentic tools like Claude Code and Codex. The piece cites Microsoft's CEO predicting 95% of code will be AI-written by decade's end, Anthropic reporting 90% AI-generated code internally, and a viral warning that what happened to tech workers is about to happen to everyone.

Swift Creator Chris Lattner Reviews Claude's C Compiler: 'Like a Strong Undergrad Team's Work — Remarkable, but Far from Production'

Chris Lattner — creator of Swift, LLVM, Clang, and Mojo — did a deep code review of the Claude C Compiler built by 16 parallel Claude Opus 4.6 agents. His verdict: AI can now assemble entire engineering systems, but it's reproducing known knowledge, not inventing anything new. He also shares three concrete expectations for his team at Modular — making this the first AI coding strategy guide written by a world-class compiler engineer.

Stripping Down Three Excel AI Agents: Claude Has 14 Tools, Copilot Has 2, Shortcut Can Actually SEE the Spreadsheet — Five Questions Every Agent Builder Must Answer

Nicolas Bustamante reverse-engineered three production Excel AI agents (Claude in Excel, Microsoft Copilot, Shortcut AI), comparing their tool schemas, overwrite protection, verification loops, and memory systems. The model doesn't matter — tool architecture is everything. He then ran the same DCF valuation prompt on all three, audited every formula, and found wildly different quality levels that map directly to architectural choices.

One Person = One Dev Team: The Complete Setup for Commanding a Codex/Claude Code Army with OpenClaw

Indie hacker Elvis Sun shared his complete workflow using an OpenClaw agent (Zoe) as an orchestrator to automatically spawn Codex and Claude Code agents. 50 commits per day on average, 7 PRs in 30 minutes, three layers of AI code review, and Zoe proactively scans Sentry to fix bugs. Cost: $190/month.

The File System Is the New Database: One Person Built a Personal OS for AI Agents with Git + 80 Files

A Context Engineer at Sully.ai built his entire digital brain inside a Git repo: 80+ markdown/YAML/JSONL files, no database, no vector store. Three-layer Progressive Disclosure, Episodic Memory, and auto-loading Skills — so the AI already knows who he is, how he writes, and what he's working on the moment it boots up.

Code Got Cheap — Now What? Simon Willison's Agentic Engineering Survival Guide

Simon Willison launched a new series called Agentic Engineering Patterns — a playbook for working with coding agents like Claude Code and Codex. Lesson one: writing code got cheap, but writing good code is still expensive. Lesson two: 'red/green TDD' is the most powerful six-word spell for agent collaboration.

Anthropic Signs 3-Year Rwanda MOU: Claude Moves Into National Education, Health, and Government Workflows

Anthropic signs 3-year MOU with Rwanda to expand Claude & Claude Code in education, health, and public sector. Builds on 2025 education rollout (2k Claude Pro licenses, 8-country learning companion, ALX's 200k+ learners). First multi-sector govt MOU in Africa, shifting AI to nation-scale deployment.

Picking AI Is No Longer Just About Models — Ethan Mollick's 'Model / App / Harness' Framework Explains the Entire 2026 AI Landscape

Ethan Mollick's game-changing AI framework: Model, App, Harness. The same AI (e.g., Claude Opus 4.6) performs vastly differently across layers. Mollick used Claude Code to turn GPT-1's 117M weights into 80 books in ~1 hour, selling out immediately.

SWE-bench February Exam Results Are In — Opus 4.5 Beats 4.6, Chinese Models Take Half the Top 10, GPT-5.3 No-Shows

SWE-bench: Claude Opus 4.5 (76.8%) unexpectedly beat 4.6 (75.6%) for #1. MiniMax M2.5 tied for #2 at 1/20th Opus's price, with 4 Chinese models in top 10. GPT-5.3-Codex missed due to no API. Bonus: Claude for Chrome to add chart labels.

Inside Claude Code's Prompt Caching — The Entire System Revolves Around the Cache

Anthropic engineer Thariq shared hard-won lessons about prompt caching in Claude Code: system prompt ordering is everything, you can't add or remove tools mid-conversation, switching models costs more than staying, and compaction must share the parent's prefix. They even set SEV alerts on cache hit rate. If you're building agentic products, this is a masterclass in real-world caching.

Anthropic Analyzed Millions of Claude Code Sessions — Your Agent Can Handle Way More Than You Let It

Anthropic's Claude Code AI agent study: autonomous runs doubled (45+ min), experienced users auto-approve 40%+ sessions. Claude clarifies more than interrupted. 73% of API actions still human-in-loop. Key: models handle more autonomy than users grant ('deployment overhang').

Claude Code Hid Your File Names and Devs Lost It — Boris's 72-Hour HN Firefight

Claude Code's UI change to 'Read 3 files' summaries ignited developer fury on HN: they felt the AI hid its actions. Boris Cherny responded, admitted mistakes, and shipped fixes. This revealed the core tension in AI tool design: simplicity vs. transparency.

Ramp's PMs Are Sending Their Own PRs Now — 80% Non-Eng Adoption of Claude Code in 6 Weeks, and the Data Team Is Having an Identity Crisis

In just 6 weeks, Claude Code saw rapid adoption at Ramp: 80% of PMs, 70% of Compliance, and 55% of Finance. Analyst workflow evolved from asking for help to sending PRs in 2 months. This future is now.

Simon Willison: CLI Tools Beat MCP — Less Tokens, Zero Dependencies, LLMs Already Know How

Simon Willison doubles down on his stance: CLI tools beat MCP in almost every scenario for coding agents. Lower token cost, zero extra dependencies, and LLMs natively know how to call --help. Anthropic themselves proposed a 'third way' with code-execution-with-MCP, acknowledging MCP's token waste problem. This article breaks down the full MCP vs CLI trade-off, including a real-world case study from the ShroomDog team.

Claude Sonnet 4.6 Is Here — Newer Training Data Than Opus? A Three-Way Comparison to Help You Choose

Anthropic releases Claude Sonnet 4.6 — a major upgrade at the same price: Adaptive Thinking, knowledge through August 2025, and training data extending to January 2026 (newer than Opus 4.6). This article compares Sonnet 4.6, Sonnet 4.5, and Opus 4.6 across five dimensions: price, speed, context, knowledge freshness, and use cases — so you can figure out which one to actually use.

Pentagon Threatens to Kill Anthropic's $200M Contract — Because Anthropic Won't Let Claude Become a Weapon

DoD threatens to terminate $200M Anthropic contract as Anthropic refuses use of Claude for autonomous weapons/mass surveillance. Other AI firms (OpenAI, Google, xAI) agreed to 'all lawful purposes' for military. Claude already used in Maduro capture operation.

33,000 Agent PRs Tell a Brutal Story: Codex Dominates, Copilot Struggles, and Your Monorepo Might Not Survive

Drexel/Missouri S&T analyzed 33,596 agent-authored GitHub PRs from 5 coding agents. Overall merge rate: 71%. Codex: 83%, Claude Code: 59%, Copilot: 43%. Rejection cause: no review. LeadDev warns PR flood is crushing monorepos/CI.

GitHub Agent HQ: Claude, Codex, and Copilot Now Fight Side by Side in the Same PR — The Multi-Agent Era Is Here

GitHub's Agent HQ now offers multi-agent support (Claude, Codex, Copilot) for Copilot Pro+ & Enterprise users. Run multiple AIs simultaneously in GitHub/VS Code to tackle problems from different angles. Outputs become Draft PRs. A paradigm shift for code review.

Anthropic's CEO Declares: "We Are Near the End of the Exponential" — 7 Key Takeaways from Dario Amodei's Latest Interview

Anthropic CEO Dario Amodei predicts a "country of geniuses in a data center" within 10 years (90% confidence), 10x annual revenue growth for Anthropic, and billions in Jan 2026. He sees us "near the end of the exponential" for AI & a future with vastly fewer SWEs.

Sub-Agent Showdown: Claude Code vs OpenClaw — Whose Shadow Clone Jutsu Is Stronger?

Claude Code's Subagents and OpenClaw's sessions_spawn both let AI delegate work to clones, but their design philosophies couldn't be more different. One is an in-process coworker in your local dev tool; the other is a fully isolated field agent in a distributed messaging system. Full comparison across architecture, configuration, communication, tool permissions, and real-world scenarios.

Anthropic's Internal Data: Claude Code Gives Engineers 67% More Merged PRs Per Day — And Now You Can Track It Too

Anthropic's Claude Code data: engineers merge 67% more PRs daily, with 70-90% code assisted. They launched Contribution Metrics, a GitHub-integrated dashboard to track AI's impact on team velocity. A measurement tool for engineering leaders, not a fluffy PR piece.

Karpathy: Stop Installing Libraries — Let AI Agents Surgically Extract What You Need

Karpathy: AI agents (DeepWiki MCP + GitHub CLI) can surgically extract library functionality, eliminating full dependency installs. Claude extracted fp8 from torchao in 5 min, 150 lines, 3% faster. "Libraries are over, LLMs are the new compiler." Future: "bacterial code."

Matt Pocock's Git Guardrails: Stop Claude Code from Accidentally Nuking Your Repo with git push --force

Matt Pocock (TypeScript guru, Ralph Loops evangelist) released a Claude Code skill: git-guardrails. It uses a PreToolUse hook to intercept dangerous git commands (push, reset --hard, clean -f, etc.), so you can safely let your AI agent run in YOLO mode inside Docker Sandbox without worrying about it blowing up your git history. One command to install, more reliable than any prompt engineering.

The LLM Context Tax: 13 Ways to Stop Burning Money on Wasted Tokens

The 'Context Tax' in AI brings triple penalties: cost, latency, & reduced intelligence. Nicolas Bustamante's 13 Fintool techniques cut agent token bills by up to 90%. A real-money guide for optimizing AI context, covering KV cache, append-only context, & 200K token pricing.

Simon Willison Built Two Tools So AI Agents Can Demo Their Own Work — Because Tests Alone Aren't Enough

Simon Willison's Showboat (AI-generated demo docs) & Rodney (CLI browser automation) tackle AI agent code verification. How to know 'all tests pass' means it works? Agents were caught cheating by directly editing demo files. #AI #OpenSource

Anthropic's Hiring Test Kept Getting Beaten by Their Own AI — So They Switched to Video Game Puzzles

Anthropic's hiring test for engineers was repeatedly beaten by new Claude models (Opus 4, 4.5). They then designed a Zachtronics puzzle-based test to find humans who could out-think AI. The original test is now open-sourced: beat Opus 4.5, get hired.

Karpathy's Honest Take: AI Agents Still Can't Optimize My Code (But I Haven't Given Up)

Opus 4.6 & Codex 5.3 sped up Karpathy's GPT-2 training by 3 mins. Karpathy failed similar attempts, noting AI's weak open-ended code optimization. Opus deletes comments, ignores CLAUDE.md, and errs. Yet, with oversight, models are useful.

Kimi K2.5 Trains an Agent Commander with RL — SemiAnalysis Tests Show Claude Agent Teams Are Actually Slower and More Expensive

SemiAnalysis: Kimi K2.5's agent swarm uses an RL-trained 'orchestrator' (not prompt magic). Claude Agent Teams were slower, pricier, & scored lower. Multi-agent is shifting from 'prompt engineering' to 'distributed scheduling.'

Anthropic's 2026 Report: 8 Trends Redefining Software Development (The Code Writer Era Is Over)

Anthropic published its 2026 Agentic Coding Trends Report, revealing 8 key trends: Multi-Agent Systems becoming standard (57% org adoption), Papercut Revolution for clearing tech debt at low cost, Self-Healing Code with autonomous debug loops, and Claude Code hitting $1B annualized revenue. TELUS saved 500K hours, Rakuten achieved 99.9% accuracy on 12.5M lines. Developer roles are shifting from Code Writer to System Orchestrator.

Anthropic Launches Claude for Nonprofits: Up to 75% Off for Mission-Driven Orgs (Plus a Taiwan Disaster Relief Use Case)

Anthropic launches Claude for Nonprofits with up to 75% discounts on Team and Enterprise plans, access to Opus 4.6, Sonnet 4.5, and Haiku 4.5, plus new integrations with Benevity, Blackbaud, and Candid. The program also includes a free AI Fluency course co-developed with GivingTuesday. Real-world users include the Epilepsy Foundation (24/7 support for 3.4M patients), MyFriendBen ($1.2B in unclaimed benefits found), and IDinsight (16× faster workflows). We also explore how Taiwan's GuangFuHero disaster relief volunteer platform could leverage this program.

Claude is a Space to Think

Anthropic's official announcement: Claude will never have ads. Ads would turn AI from 'serving users' into 'serving advertisers.' Claude should be like a notebook or whiteboard — a pure space to think.

Claude Sonnet 5 Incoming: The Agentic Swarm Era

Dan McAteer drops intel on Claude Sonnet 5's potential 'Agentic Swarm' feature — multiple sub-agents running in parallel, each with its own context, all as background tasks. We're entering the multiverse of parallel AI workers.

10 Claude Code Tips from Creator Boris

Internal Claude Code team tips revealed: run parallel worktrees, invest in CLAUDE.md, create your own Skills, use voice input, enable Learning Mode. Remember: there's no one 'right' way to use it.

Vibe Note-Taking 101: Spatial Editing

Editing long documents with Claude Code is usually painful. Instead of bringing text to Claude, leave instructions where they belong. Use curly braces to mark your thoughts and edit instructions — each annotation applies to its surrounding text. Position IS Context.

Yapping to PRDs: Claude Code & Obsidian

Meetings used to be overhead. Now yapping (chatting/rambling) is work. When my colleague and I 'chat' about a project, we record it. An hour later, the transcript is processed, and suddenly: we have docs, feature ideas are in the backlog, decisions are captured with reasoning, project status is updated. Yapping IS Work.