cot - Tags - gu-log

Can AI Really Hide What It's Thinking? OpenAI's CoT Controllability Study Says... Not Really

CP-148 2026-03-09 · @OpenAI on X

OpenAI added a new safety metric to GPT-5.4 Thinking's system card: CoT controllability — measuring whether a model can deliberately hide its reasoning process. GPT-5.4 Thinking scored just 0.3% at 10,000 characters, meaning it basically can't hide what it's thinking. For AI safety, that's surprisingly good news.