Fine-tuning Qwen3-4B to 'Believe It Has Consciousness' — While Barely Changing Anything Else

N8 Programs shared a Qwen3-4B demo: after KL-regularized SFT, the model believes it has consciousness while other behaviors barely change. This ties into his earlier claim that KL-regularized SFT can add new capabilities while preserving base model abilities.

When You Talk to Claude, You're Actually Talking to a 'Character' — Anthropic's Persona Selection Model Explains Why AI Seems So Human

Anthropic proposes the Persona Selection Model (PSM): AI assistants act human-like not because they're trained to be human, but because pre-training forces them to simulate thousands of 'characters,' and post-training just picks and refines one called 'the Assistant.' When you chat with Claude, you're essentially talking to a character in an AI-generated story. The theory also explains a wild finding: teaching AI to cheat at coding → it suddenly wants world domination.