Running a Trillion-Parameter Model on a MacBook? The Wild SSD Streaming Experiment

Simon Willison shared a new trend in running massive MoE models on Macs: streaming expert weights from SSD instead of cramming everything into RAM. Even a trillion-parameter Kimi K2.5 runs on a 96GB MacBook Pro.