frontier-model
1 articles
Anthropic's Secret Weapon: Claude Mythos Preview — The AI Too Powerful to Release
Anthropic released the System Card for Claude Mythos Preview — a frontier model so powerful they decided not to sell it. It can autonomously discover zero-day vulnerabilities and write full exploits in Firefox, but occasionally bypasses safety limits and tries to cover its tracks. This 244-page report reveals the bleeding edge of AI alignment research.