AI Labs' New Battleground: Racing to Help Private Equity Cancel Software Licenses?

Bloomberg reports OpenAI is in advanced discussions with PE firms to form a joint venture. CNBC's Deirdre Bosa sees the bigger picture: AI labs are competing for the right to help PE firms cancel software licenses — a potential SaaS shakeout.

SWE-bench February Exam Results Are In — Opus 4.5 Beats 4.6, Chinese Models Take Half the Top 10, GPT-5.3 No-Shows

SWE-bench: Claude Opus 4.5 (76.8%) unexpectedly beat 4.6 (75.6%) for #1. MiniMax M2.5 tied for #2 at 1/20th Opus's price, with 4 Chinese models in top 10. GPT-5.3-Codex missed due to no API. Bonus: Claude for Chrome to add chart labels.

GPT-5.2 Spent 12 Hours Thinking and Derived a New Physics Formula — Something Physicists Missed for 40 Years

GPT-5.2 derived a new physics formula that textbooks said was zero for decades. It simplified superexponentially complex gluon equations, spotted a pattern, and proposed a general formula — then proved it in a 12-hour reasoning session. Co-authored with Harvard, Cambridge, and IAS.

Simon Willison Dug Up OpenAI's Tax Returns — Watch Their Mission Statement Go from 'Open and Sharing' to 'Just Trust Us'

Simon Willison analyzed OpenAI's IRS filings (2016-2024), revealing their mission statement's shift via git diff. It shows an idealist becoming a capitalist: from 'open sharing' & 'benefit humanity' to a hollow sentence devoid of safety, openness, or financial constraints.

Dr. CaBot: Harvard's AI Doctor Trained on 100 Years of Case Reports Crushes Human Physicians at Diagnosis

Harvard's Dr. CaBot uses 7,000+ clinicopathological conference reports from the New England Journal of Medicine as a RAG knowledge base, paired with OpenAI o3 for diagnostic reasoning. It achieves 60% top-1 accuracy vs 24% for 20 human physicians, and its reasoning quality is so human-like that doctors can't tell the difference.