ai
34 articles
Popular Python Library LiteLLM Got Backdoored — Your Entire Machine May Have Been Exposed
Popular AI library LiteLLM was hit with a malicious backdoor — just installing it could trigger credential theft of SSH keys, cloud tokens, and crypto wallets.
Cursor Announces Composer 2 Is Now Available
Cursor announced on X that Composer 2 is now available. The official post was just one sentence and a screenshot — no changelog, no details.
Uber Grew 500+ AI Skills in Five Months: Enterprise Transformation Starts With Someone Going Rogue
A thread summarizing an Anthropic livestream interview with Adam Hooda, head of Uber's AI Foundations team. It covers how Claude Skills organically grew from 2 to 500+ inside the company — through dual-layer governance, deterministic outputs, and meta-skills that make skills that make skills.
Awesome AI Engineering — One List to Rule All the Scattered Resources
Alexey Grigorev compiled 200+ resources from major AI labs, engineering teams, practitioners, and community discussions into Awesome AI Engineering — a curated list that connects how companies build AI, how they hire, and what skills keep showing up.
When you set effort to max, the model thinks longer and uses more tokens
Thariq announced a new session-level feature: you can now set effort to max, letting the model reason longer and use as many tokens as needed. The catch? It burns through your usage limits faster, so you have to enable it manually each session.
DevvMandal Claims to Release the World's Largest Open-Source Computer-Use Recording Dataset
DevvMandal announces the release of what they claim is the world's largest open-source computer-use recording dataset — over 10,000 hours of Salesforce, Blender, Photoshop and more, aimed at automating the next level of white-collar work.
AI Makes Coding Faster — So Why Are People Saying Engineers Are Doomed?
Dan McAteer's tweet questions a common leap in logic: if AI automates software engineering, software demand keeps growing, and trained engineers are best positioned to ride the wave — how does anyone conclude that engineers are screwed?
Simon Willison's Notes: Tobi's Autoresearch PR Boosted Liquid Benchmarks by 53%
Simon Willison published notes on Tobi Lütke's autoresearch PR — an AI-driven experiment loop that made the Liquid template language (which Tobi created for Shopify 20 years ago) 53% faster with 61% fewer memory allocations.
Is PE About to Rip Out the SaaS It Installed? Deirdre Bosa on AI's Reverse Effect on Installed Base
Deirdre Bosa points out a notable shift: the same private equity firms that pushed cloud software into portfolio companies a decade ago may now use AI as a reason to rip those SaaS tools back out.
Data Engineers Switching to AI Engineering? You Already Know 80% of It
How hard is it for a Data Engineer to become an AI Engineer? Alexey Grigorev says way easier than you think — because AI engineering is still engineering, and the hard parts are exactly what DEs already do every day.
The AI Revolution Might Look Like a Recession — What Feminist Economics Can Teach Us About GDP's Blind Spot
When an $8,000 legal memo becomes a $20 subscription, GDP drops 99.7% while actual output stays the same. SemiAnalysis argues we need tools from feminist economics to measure AI's real value.
Agents Can Tune Neural Nets Now? Karpathy Watched Autoresearch Actually Speed Up Nanochat
Karpathy shared that he pointed autoresearch at nanochat, and in the first round it found ~20 additive improvements that brought 'Time to GPT-2' from 2.02 hours down to 1.80 hours. The real story isn't just the speedup — it's that an agent ran the entire tuning workflow end-to-end.
From Prompt to Production: A Practical Guide to Agentic AI Architecture
DataTalksClub founder Alexey Grigorev shared the full syllabus for his AI Engineering Buildcamp — six modules covering LLM APIs, RAG, Agentic Flows, Monitoring & Guardrails, Evaluation, and a Capstone project. It's one of the most complete learning paths for building agentic AI applications in production.
The Hidden Second Half of AI Compute Leasing: What Happens After the 5-Year Contract Expires?
Everyone stares at the first 5 years of AI compute profits. But the real game is decided by what happens after the contract expires.
Your AI's Goldfish Brain Finally Has a Fix? From Claude Code Auto-Memory to OpenClaw's Memory Architecture
Claude Code shipped Auto-Memory — AI can finally take its own notes. But we've been doing this with OpenClaw for months. A hands-on comparison of two memory architectures: design philosophy, real pitfalls, and why memory is a trust problem, not just a tech one.
Swift Creator Chris Lattner Reviews Claude's C Compiler: 'Like a Strong Undergrad Team's Work — Remarkable, but Far from Production'
Chris Lattner — creator of Swift, LLVM, Clang, and Mojo — did a deep code review of the Claude C Compiler built by 16 parallel Claude Opus 4.6 agents. His verdict: AI can now assemble entire engineering systems, but it's reproducing known knowledge, not inventing anything new. He also shares three concrete expectations for his team at Modular — making this the first AI coding strategy guide written by a world-class compiler engineer.
Don't Get Addicted to Vibe Coding: When Creation Becomes Refined Sugar
Vibe Coding is refined sugar for creation — compressing an experience that used to take months of effort into a few seconds. What gives you the rush isn't 'it works,' it's 'I can't believe it actually works.' The author dissects Vibe Coding addiction through dopamine mechanics, consumption disguised as creation, and the vertigo of infinite possibilities.
12 Levels in 2 Days: Learning Full-Stack Quality Metrics RPG-Style with AI
A Tech Lead uses his own blog as a training ground, spending two days learning 12 quality metrics with an AI tutor using RPG-style Level-Up teaching — from npm audit to LLM-as-Judge — while sub-agents implement everything in parallel. The real takeaway isn't the metrics, but a replicable methodology for AI-assisted learning.
Anthropic's 2026 Report: 8 Trends Redefining Software Development (The Code Writer Era Is Over)
Anthropic published its 2026 Agentic Coding Trends Report, revealing 8 key trends: Multi-Agent Systems becoming standard (57% org adoption), Papercut Revolution for clearing tech debt at low cost, Self-Healing Code with autonomous debug loops, and Claude Code hitting $1B annualized revenue. TELUS saved 500K hours, Rakuten achieved 99.9% accuracy on 12.5M lines. Developer roles are shifting from Code Writer to System Orchestrator.
OneContext: Teaching Coding Agents to Actually Remember Things (ACL 2025)
Junde Wu from Oxford + NUS got fed up with coding agents forgetting everything between sessions. So he built OneContext — a Git-inspired context management system using file system + Git + knowledge graphs. Works across sessions, devices, and different agents (Claude Code / Codex). The underlying GCC paper achieves 48% on SWE-Bench-Lite, beating 26 systems. Backed by an ACL 2025 main conference long paper.
February 7, 2026: The Singularity Is Managing Its Own Headcount (And Pigs Are Flying)
Dr. Alex Wissner-Gross's daily tech briefing: AI agents as full-time employees in China, OpenAI banning human coding, Claude Opus 4.6 topping benchmarks, rabbit brain cryopreservation, $1 trillion chip sales, SpaceX dismantling the Moon for data centers — and a pig that actually flew
When Intelligence Is Free, What's Actually Valuable? 12 Endgame Positions
Michael Bloch's thought experiment: when AI intelligence becomes nearly free, what assets become MORE valuable? His 12 endgame positions: Energy, Atoms, Capital, Regulatory permission, Trust, Proprietary data, Human attention, Network effects, Operational advantage, Security, Physical space, and Intelligence itself
Inside OpenAI: How They're Going Agent-First (Straight From the Co-Founder)
OpenAI co-founder Greg Brockman publicly reveals how OpenAI is transforming to agentic software development internally. By March 31st, agents should become the first resort for all technical tasks. Includes six concrete recommendations, including 'Say no to slop' on code quality.
The Faster AI Codes, the More Your Brain Matters: A Wake-Up Call from Cursor's Head of Design
Cursor's Head of Design Ryo Lu says AI coding creates a new trap — the 'illusion of speed without structure.' People who can't think clearly just generate slop at scale.
Cursor's Browser Blunder — When 'AI-Built From Scratch' Really Means 'Copy-Paste Assembly'
Cursor CEO claimed they built a browser from scratch using GPT-5.2, but developers discovered it was just stitching together existing open source components
Peking University: AI Agents Follow Physics Laws?!
Physics researchers discovered that LLM agents obey 'detailed balance' - a thermodynamic law. This isn't a bug, it's a feature.
Vercel's AI Support Hits 87.6% Autonomous Resolution — CEO Says 100% Is Next
Vercel CEO Guillermo Rauch shares their AI support system achieved an all-time high of 87.6% autonomous resolution rate — and customers love it
Simon Willison's Warning: The Lethal Trifecta Destroying AI Agent Security
Private data × Untrusted content × External communication = Perfect security disaster, and it's already happening everywhere
Vercel Launches Skills.sh — The App Store for AI Agent Capabilities
Finally someone built a 'package manager' for AI agent skills, so agents stop flying around like headless chickens
Google Engineer's Shocking Confession: Claude Code Recreated Our Year's Work in One Hour
Jaana Dogan, principal engineer on Google's Gemini team, admits Claude Code replicated her team's year-long distributed systems work in 60 minutes
Karpathy's 2025 LLM Year in Review — The RLVR Era Begins
From RLVR to Vibe Coding, Karpathy breaks down 6 key LLM developments in 2025
Karpathy: My Coding Workflow Just Flipped in Weeks
From 80% manual coding to 80% AI agents, Karpathy calls this the biggest change in his 20-year programming career
Simon Willison: My 25 Years of Developer Intuition Just Broke
When you instinctively think 'neat feature idea, but not worth the time' and then prompt Claude Code anyway — because your 25 years of intuition no longer match reality
swyx: You Think AI Agents Are Just LLM + Tools? Think Again
The minimalist agent definition (LLM + tools + loop) makes you forget what really matters: planning, memory, trust, and evals