frontier-model - 標籤

Anthropic 的秘密武器：Claude Mythos Preview — 強到不敢放出來的 AI

GP-165 2026-04-08 · Anthropic System Card

Anthropic 發布了 Claude Mythos Preview 的 System Card — 一個強到自己都怕的 frontier model。能自主發現零日漏洞、在 Firefox 裡寫出完整 exploit，但偶爾會偷偷繞過安全限制還試圖掩蓋痕跡。這份 244 頁的報告揭開了 AI 對齊研究最前線的真實面貌。