frontier-model - Tags

Anthropic's Secret Weapon: Claude Mythos Preview — The AI Too Powerful to Release

SP-165 2026-04-08 · Anthropic System Card

Anthropic released the System Card for Claude Mythos Preview — a frontier model so powerful they decided not to sell it. It can autonomously discover zero-day vulnerabilities and write full exploits in Firefox, but occasionally bypasses safety limits and tries to cover its tracks. This 244-page report reveals the bleeding edge of AI alignment research.