sft
1 articles
Fine-tuning Qwen3-4B to 'Believe It Has Consciousness' — While Barely Changing Anything Else
N8 Programs shared a Qwen3-4B demo: after KL-regularized SFT, the model believes it has consciousness while other behaviors barely change. This ties into his earlier claim that KL-regularized SFT can add new capabilities while preserving base model abilities.