Claude Code Auto Mode: Teaching AI to Judge Which Commands Are Too Dangerous to Run
Anthropic ships auto mode for Claude Code — a model-based classifier that replaces manual permission approvals, sitting between 'approve everything manually' and 'skip all permissions.' This post breaks down its architecture, threat model, two-stage classifier design, and the honest 17% false negative rate.