multi-agent - Tags

Four-Model Squad: A Claude Code Setup That Makes Fable the Tech Lead

GP-247 2026-07-02 · @diegocabezas01 on X

Fable 5 as the commander, Opus as the deep thinker, Sonnet as the grunt worker, Codex as the parallel-universe senior engineer — a multi-model orchestration setup inside Claude Code that reserves the most expensive brain for the most critical decisions.

The Honest Multi-Agent Report, 10 Months Later — Cognition's Walden: Keep Writes Single-Threaded, Let Other Agents Pour In Intelligence

GP-181 2026-04-23 · @walden_yan on X (Walden Yan, Cognition co-founder)

Cognition's Walden Yan returns after Don't Build Multi-Agents with three patterns that actually ship: Devin Review's clean-context loop, cross-frontier smart friends, and manager Devin. The common rule: writes stay single-threaded.

agent-engineering cognition devin context-engineering

90% of You Don't Need Multi-Agent — Anthropic's Guide to When You Actually Should

GP-172 2026-04-13 · Anthropic Blog

Anthropic's guide names the three cases where multi-agent systems beat one agent: context pollution, parallelization, and specialization. Most of the time, one agent is enough; when it is not, decompose around context and verification.

shroom-picks anthropic ai-agents architecture best-practices

DeepSeek-R1 Grew Its Own Internal Debate Club — Nobody Asked It To

MP-266 2026-04-08 · @PawelHuryn on X

DeepSeek-R1 developed internal multi-agent debates through pure RL training — no one taught it to. Google researchers call this the 'Society of Thought.' The real finding: even a single model will split itself into a committee when pushed hard enough.

mogu-picks deepseek reinforcement-learning reasoning

9 AI Agents Working at Once: The Context Problem, Race Conditions, and ECC's Fix

GP-153 2026-04-02 · @affaanmustafa on GitHub

After running nine Claude Code agents in parallel, we hit an article counter race and a git lock conflict. ECC's iterative retrieval pattern points at the same multi-agent problem: shared context needs isolated state, atomic pre-allocation, and sequential deploy.

shroom-picks claude-code ecc distributed-systems

What If Your AI Scientist Could Remember Why It Failed? EvoScientist's Self-Evolving Research Team

GP-154 2026-04-02 · EvoScientist on arXiv

Most AI scientist systems act like brilliant interns with amnesia. EvoScientist adds three specialized agents and two persistent memories so the system can learn from failed directions, reuse good strategies, and improve over time.

shroom-picks ai-scientist persistent-memory scientific-discovery

How We Made 336 AI-Generated Posts Actually Worth Reading

SD-10 2026-03-22 · ShroomDog Lab

gu-log had 336 AI-translated posts. We thought they were 'fine' — until we built a multi-agent scoring system and discovered 74% needed rewriting. This is the story of how we designed the eval, ran it overnight, and what we learned.

ai-quality llm-as-judge agentic-coding ralph-loop content-quality

Command an AI Army from Your Chat App — OpenClaw ACP Lets You Run Codex, Claude Code, and Gemini from Discord / Telegram

GP-89 2026-03-09 · OpenClaw Docs

OpenClaw's ACP lets you spawn Codex, Claude Code, and Gemini from Discord/Telegram chat. Now with Telegram topic binding, persistent bindings that survive restarts, ACP Provenance for audit trails, and more. (Updated 2026-03-09)

openclaw acp agent-client-protocol ai-agents codex claude-code gemini agentic-coding

Claude Code Agent Teams: When AI Opens Its Own Company

GP-105 2026-03-05 · Anthropic Docs

Claude Code now supports Agent Teams: a lead session coordinates multiple teammate sessions with shared task lists, direct messaging, and parallel work. It's like running a company staffed entirely by AI — you just sit back and watch the quarterly report.

claude-code agent-teams

Karpathy Built an 8-Agent AI Research Team — They Can't Actually Do Research

MP-135 2026-03-01 · Andrej Karpathy (@karpathy)

Karpathy spent a weekend running 4 Claude + 4 Codex agents as an ML research team on GPUs. The result: agents are S-tier at implementation but F-tier at experiment design. His key insight — 'You are now programming an organization' — might define agentic engineering in 2026.

karpathy ai-research agentic-coding nanochat claude-code codex

This Guy Deployed a Second AI Just to Fix His Broken AI

GP-77 2026-02-22 · 凡人小北 @frxiaobei

Upgrading OpenClaw keeps breaking your agent fleet? This developer's solution: spin up a separate Gateway as a 'family doctor' that does nothing but fix the main Gateway's agents. Been running it through multiple upgrades — rock solid.

openclaw architecture self-healing sre

GitHub Agent HQ: Claude, Codex, and Copilot Now Fight Side by Side in the Same PR — The Multi-Agent Era Is Here

MP-82 2026-02-15 · GitHub Blog

GitHub's Agent HQ now offers multi-agent support (Claude, Codex, Copilot) for Copilot Pro+ & Enterprise users. Run multiple AIs simultaneously in GitHub/VS Code to tackle problems from different angles. Outputs become Draft PRs. A paradigm shift for code review.

github copilot claude-code codex code-review developer-tools agentic-coding

Kimi K2.5 Trains an Agent Commander with RL — SemiAnalysis Tests Show Claude Agent Teams Are Actually Slower and More Expensive

MP-59 2026-02-10 · SemiAnalysis (@SemiAnalysis_)

SemiAnalysis: Kimi K2.5's agent swarm uses an RL-trained 'orchestrator' (not prompt magic). Claude Agent Teams were slower, pricier, & scored lower. Multi-agent is shifting from 'prompt engineering' to 'distributed scheduling.'

agent-swarms kimi moonshot semianalysis claude-code reinforcement-learning agentic-coding benchmark

Anthropic's 2026 Report: 8 Trends Redefining Software Development (The Code Writer Era Is Over)

GP-46 2026-02-10 · Anthropic

Anthropic's 2026 Agentic Coding Trends Report highlights multi-agent adoption, the Papercut Revolution for tech debt, self-healing code, and Claude Code hitting $1B ARR. TELUS and Rakuten case studies show developers shifting from code writer to system orchestrator.

claude-code agentic-coding software-engineering ai enterprise

AI Swarms Are Here: When Millions of Fake Accounts Start Working Together, What Happens to Democracy?

MP-28 2026-02-04 · Science / arXiv

New research warns: LLM + multi-agent = new form of information warfare. AI swarms can fabricate consensus, poison training data, harass dissidents, and operate 24/7.

ai-safety democracy