local-ai
2 articles
Running a Trillion-Parameter Model on a MacBook? The Wild SSD Streaming Experiment
Simon Willison shared a new trend in running massive MoE models on Macs: streaming expert weights from SSD instead of cramming everything into RAM. Even a trillion-parameter Kimi K2.5 runs on a 96GB MacBook Pro.
Hermes Just Performed Brain Surgery on Itself: A Local AI Agent Hot-Swapped Its Own Model Weights
A local AI agent called Hermes downloaded and switched to a new model (qwopus) without stopping — like swapping a plane's engine mid-flight. Teknium from Nous Research saw it and said 'submit this to a hackathon.'