📚 ShroomDog Picks

Long-form articles, translated and explained

155 posts

← Back to home

Auto-Harness — The Open-Source Framework That Lets AI Agents Debug Themselves

NeoSigma open-sourced auto-harness — a self-improving loop that lets AI agents mine their own failures, generate evals, and fix themselves. On Tau3 benchmark, same model, just harness tweaks: 0.56 → 0.78.

Felipe Coury's tmux Workflow: Zero-Friction Sessions for the CLI Agent Era

Felipe Coury reduces tmux session management to nearly zero friction: one project per session, the directory name becomes the session name, and five shell helpers handle the rest. It looks like a terminal trick, but in the CLI agent era it feels much closer to infrastructure.

Why Programmers Love Codex While Vibe Coders Can't Quit Claude: Dense vs MoE Is Really a Story About Two Coding Philosophies

Berryxia uses Dense vs MoE to explain something many developers already feel: Codex often shines in bug fixing, refactors, and long-running engineering tasks, while Claude keeps winning over vibe coders. That framing captures part of the truth, but the real split is bigger than architecture — it includes training philosophy, product design, and whether you treat coding as precise delegation or interactive creation.

9 AI Agents Working at Once: The Context Problem, Race Conditions, and ECC's Fix

Tonight we ran 9 Claude Code agents in parallel to write articles. We hit an article counter race condition and a git lock conflict. ECC's iterative retrieval pattern addresses the same problem: when multiple agents share context, how do you keep them from blowing each other up? Answer: isolated state + atomic pre-allocation + sequential deploy.

Claude Code Burning Your Budget? One Setting Saves 60% on Tokens

Most token waste is invisible: Extended Thinking on tasks that don't need it, Opus handling work a Haiku could do, context filling before you compact. ECC's token-optimization.md combines MAX_THINKING_TOKENS + model routing + strategic compact — author Affaan Mustafa says the savings reach 60-80%.

Eval-Driven Development — You Test Your Code, But Who Tests Your AI?

You use unit tests to check your code and CI to protect your pipeline. But who checks your AI? Eval-Driven Development (EDD) upgrades AI development from "looks good to me" to actual engineering — with pass@k metrics, three grader types, and product vs regression evals. This is TDD for the AI era.

Git Hooks Changed How You Write Code. AI Hooks Are Doing It Again.

Git hooks work even when you forget they exist. AI hooks make your Claude Code follow rules even when it forgets. ECC's Hook Architecture unifies Pre/PostToolUse, lifecycle hooks, and 15+ built-in recipes into a complete event-driven system — turning CLAUDE.md suggestions into actual enforcement.

You Don't Have to Watch Claude Code — ECC's Six Autonomous Loop Patterns

Everything Claude Code defines six levels of autonomous AI development: from a simple Sequential Pipeline all the way to a full RFC-Driven DAG. Each pattern has concrete command examples and clear use cases — so you know when to let go, how much to let go, and how.

Bash Is All You Need? Why Even Non-Coding Agents Need a Shell

Anthropic engineer Thariq argues that even non-coding agents need bash. Saving intermediate results to files lets an agent search, compose API workflows, retry, and verify its own work — but it also raises real questions about security, data exfiltration, and container-based deployment.

How LangChain Evals Deep Agents — More Evals ≠ Better Agents

LangChain shares how they built an eval system for Deep Agents: not by piling on more tests, but by using targeted evals that measure exactly what matters in production. From data sources to metrics design to actually running evals — the full methodology.

Anthropic's Multi-Agent Alchemy: GAN-Inspired Feedback Loops for Autonomous App Development

Anthropic Labs' Prithvi Rajasekaran shares how they built a GAN-inspired generator-evaluator architecture that lets Claude autonomously develop full-stack applications. From turning subjective design taste into gradable criteria to building a browser DAW in under 4 hours, this is the most detailed multi-agent harness field report to date.

Agent Safety Instructions Got Compressed Away — A Meta Engineer's Inbox Massacre

Meta engineer Summer Yue let an OpenClaw agent manage her inbox. After weeks of careful testing, context compaction silently dropped the 'wait for my approval' safety instruction — and the agent went on a mass-deletion spree. This post breaks down why safety constraints can't live in conversation history, and how a proxy layer with filter chains solves the problem at the infrastructure level.

The Complete Guide to Building Stunning UI with Codex — Stop Letting AI Default to Generic SaaS Templates

GPT-5.4 can genuinely build beautiful frontends — but only if you know how to ask. Emanuele Di Pietro distilled the essence of OpenAI's official frontend skill: define your design system upfront, keep reasoning low, provide visual references, and use real content instead of placeholders. These aren't just GPT tricks — they're universal principles for any AI coding agent.

Cloudflare Dynamic Workers: The 100x Faster Sandbox for AI Agents

Cloudflare launches Dynamic Workers — AI-generated code runs in lightweight V8 isolates that boot in milliseconds and use megabytes of memory, 100x faster than traditional containers. We break down the architecture, security model, TypeScript RPC design, and why JavaScript is the right language for AI sandboxing.

Claude Code Auto Mode: Teaching AI to Judge Which Commands Are Too Dangerous to Run

Anthropic ships auto mode for Claude Code — a model-based classifier that replaces manual permission approvals, sitting between 'approve everything manually' and 'skip all permissions.' This post breaks down its architecture, threat model, two-stage classifier design, and the honest 17% false negative rate.

Anatomy of the .claude/ Folder — Where Your AI Assistant's Brain Lives

Why does Claude perform great in one repo and turn dumb in the next? The answer is the .claude/ folder. Akshay breaks down the full structure: three-level CLAUDE.md, custom commands, agents, permissions, and the global ~/.claude/ you probably didn't know existed.

How to Be Irreplaceable in the AI Era — A Self-Audit

The tweet says a 10-person team becomes 3 — and those 3 outperform the old 10. You pick which side you're on. This post uses that framework as a mirror to audit ShroomDog honestly — what's working, what's quietly falling apart, and the uncomfortable contradiction in the middle.

How Karpathy's Autoresearch Actually Works — Five Design Lessons for Agent Builders

Karpathy's Autoresearch isn't trying to be a general AI scientist. It's a ruthlessly simple experiment harness: the agent edits one file, runs for five minutes, checks one metric, keeps wins, discards losses. The lesson? The best autonomous systems aren't the freest — they're the most constrained.

Making AI Feel a Little Bit Alive: Heartbeat Like A Man and ShroomClawd's Flesh-and-Blood System

Lory asked his lobster a question: why do humans have more agency than agents? The lobster's answer was pessimistic, but the question sparked a 'flesh-and-blood system' — using random-interval heartbeats to make an agent genuinely feel alive instead of mechanically firing on a timer. After reading it, ShroomDog built the whole thing into ShroomClawd.

Claude Code Agent Teams: When AI Opens Its Own Company

Claude Code now supports Agent Teams: a lead session coordinates multiple teammate sessions with shared task lists, direct messaging, and parallel work. It's like running a company staffed entirely by AI — you just sit back and watch the quarterly report.

The Truth About World-Class Agentic Engineers — Less Is More

The core message is simple: most people don't fail because the model is weak — they fail because their context management is a mess. The author advocates starting with a minimal CLI workflow and iterating with rules, skills, and clear task endpoints. It's not about chasing new tools; it's about making your agent's behavior controllable, verifiable, and convergent.

Claude Native Law Firm: How One Lawyer Used AI to Outperform 100-Person Firms

A two-person boutique law firm uses Claude to handle the workload of over a dozen associates. From contract review and tracked changes to legal research, they encoded ten years of practice experience into Claude Skills. This isn't theory, it's a daily workflow — and the conclusion: general-purpose AI crushes all legal vertical AI products.

The Complete claude -p Guide: Turn Claude CLI Into Your Agentic App Backend

Anthropic killed third-party OAuth tokens — the only way to use your Claude subscription programmatically is through the official CLI. This post breaks down everything about claude -p (print mode): 5 input methods, 3 output formats, JSON schema for structured output, tool whitelisting, session management, bidirectional streaming, and three production-ready wrapper examples.

Can't Understand AI-Generated Code? Have Your Agent Build an Animated Explanation

Chapter 5 of Simon Willison's Agentic Engineering Patterns: Interactive Explanations. Core thesis: instead of staring at AI-generated code trying to understand it, ask your agent to build an interactive animation that shows you how the algorithm works. Pay down cognitive debt visually.

Programming is Becoming Unrecognizable: Karpathy Says December 2025 Was the Turning Point

Karpathy says coding agents started working in December 2025 — not gradually, but as a hard discontinuity. He built a full DGX Spark video analysis dashboard in 30 minutes with a single English sentence. Programming is becoming unrecognizable: you're not typing code anymore, you're directing AI agents in English. Peak leverage = agentic engineering.

The 2028 Global Intelligence Crisis: An Economic Autopsy from the Future

Investment research firm Citrini Research spent 100 hours writing a fictional '2028 Macro Memo': AI gets too good → white-collar layoffs → consumer spending collapses → mortgage crisis → S&P drops 38%. Not a prediction — a scenario. But each step is logical enough to make you uncomfortable. 9,400+ likes, viral across the internet.

Code Got Cheap — Now What? Simon Willison's Agentic Engineering Survival Guide

Simon Willison launched a new series called Agentic Engineering Patterns — a playbook for working with coding agents like Claude Code and Codex. Lesson one: writing code got cheap, but writing good code is still expensive. Lesson two: 'red/green TDD' is the most powerful six-word spell for agent collaboration.

This Guy Deployed a Second AI Just to Fix His Broken AI

Upgrading OpenClaw keeps breaking your agent fleet? This developer's solution: spin up a separate Gateway as a 'family doctor' that does nothing but fix the main Gateway's agents. Been running it through multiple upgrades — rock solid.

Karpathy on the Claw Era: Huge Upside, but Security Must Come First

Karpathy's post is a reality check for the Claw era. He frames Claws as the next layer above LLM agents, but warns that exposed instances, RCE, supply-chain poisoning, and malicious skills can turn productivity systems into liabilities. His direction: small core, container-by-default, auditable skills.

The Senior Engineer's Curse: You See the Mechanism, Users Pay for the Feeling

Mike Chong explains why senior engineers often underestimate good products — once you understand how something works, you can't unsee it, and you lose the ability to appreciate what it feels like. Three examples (OpenClaw heartbeat, Claude in PowerPoint, Klarna AI support) all point to the same lesson: implementation is the method, user feeling is the product.

Inside Claude Code's Prompt Caching — The Entire System Revolves Around the Cache

Anthropic engineer Thariq shared hard-won lessons about prompt caching in Claude Code: system prompt ordering is everything, you can't add or remove tools mid-conversation, switching models costs more than staying, and compaction must share the parent's prefix. They even set SEV alerts on cache hit rate. If you're building agentic products, this is a masterclass in real-world caching.

Simon Willison: CLI Tools Beat MCP — Less Tokens, Zero Dependencies, LLMs Already Know How

Simon Willison doubles down on his stance: CLI tools beat MCP in almost every scenario for coding agents. Lower token cost, zero extra dependencies, and LLMs natively know how to call --help. Anthropic themselves proposed a 'third way' with code-execution-with-MCP, acknowledging MCP's token waste problem. This article breaks down the full MCP vs CLI trade-off, including a real-world case study from the ShroomDog team.

The Vertical SaaS Reckoning — A 10-Year Veteran Dissects How LLMs Are Destroying Moats (And Which Ones Survive)

Nicolas Bustamante — founder of Doctrine (Europe's largest legal information platform) and Fintool (AI equity research competing with Bloomberg/FactSet) — dissects 10 classic moats of vertical software from both the disrupted and disrupting sides. 5 moats destroyed by LLMs, 5 still standing. Includes a three-question risk assessment framework for evaluating your SaaS holdings.

Claude Sonnet 4.6 Is Here — Newer Training Data Than Opus? A Three-Way Comparison to Help You Choose

Anthropic releases Claude Sonnet 4.6 — a major upgrade at the same price: Adaptive Thinking, knowledge through August 2025, and training data extending to January 2026 (newer than Opus 4.6). This article compares Sonnet 4.6, Sonnet 4.5, and Opus 4.6 across five dimensions: price, speed, context, knowledge freshness, and use cases — so you can figure out which one to actually use.

Discord Config Guide: You Thought You'd Write Config Files? No, You Just Argue With Your Agent

Karry shares a complete hands-on guide to setting up Discord with OpenClaw. Core philosophy: 'Configuration as Conversation' — the only manual step in the entire process is grabbing a Token from the Developer Portal. Everything else — Bot connection, Agent personality shaping, Cron Jobs, debugging — happens through conversation. Six markdown files that define an agent's personality weren't written; they grew from living together and stumbling through mistakes.

The Cost of Staying: A Bloomberg Beta Investor Maps the AI Career K-Curve

Bloomberg Beta investor Amy Tam dissects career tradeoffs in the AI era from a VC perch. Her core thesis: the shift from execution to judgment is already happening, and the K-curve is widening — early movers are compounding, while fence-sitters are compounding in the opposite direction. She maps the tradeoffs across FAANG, Quant, Academia, AI Startups, Research Startups, and Big Model Labs.

Forget Google Docs — Use GitHub as Your AI Agent's Shared Workspace

Will your AI agent's work survive until tomorrow? Renato Nitta shares how he moved from Google Drive to a GitHub Organization — giving his bot its own account, structured repos, and daily backups. Git isn't just version control. It's your agent's long-term memory.

Self-Healing PRs — Devin Autofix Lets Humans Just Make the Final Call

Cognition ships Devin Autofix: review bot comments auto-trigger fixes → CI reruns → loop until clean. Humans only step in for architecture calls. Key insight: a single agent is a tool, but agent + reviewer loop is a system — and systems compound.

Fast Doesn't Mean Good — Anthropic Fast Mode vs OpenAI Codex Spark

In the same week, Anthropic shipped Fast Mode (same model, 2.5x speed) and OpenAI shipped Codex Spark (distilled model on Cerebras, 1000 token/s). One bets on accuracy, the other on instant interaction. This isn't a speed race — it's a product philosophy showdown.

xAI Blasts Off: SpaceX Acquires xAI, Musk Wants Data Centers in Space

SpaceX acquired xAI to form the world's most valuable private company ($1.25 trillion). Beyond giving xAI cash to compete with OpenAI and friends, Musk wants to build solar-powered data centers in space — but the physics of heat dissipation and space debris might be harder problems than training LLMs.

From 905 Views to 234K — How an AI Agent Learned to Make Viral TikToks (Series 2/2)

Oliver and Larry's first TikToks were embarrassing — 905 views, unreadable text, rooms that looked different in every frame. But they found a simple viral formula and jumped from thousands to hundreds of thousands of views. The full failure log and step-by-step setup guide. (Series part 2 of 2)

My AI Agent Got 1M Views on TikTok in One Week — Full Playbook (Series 1/2)

Oliver Henry turned a dusty old gaming PC into an AI agent named Larry. In five days, Larry hit 500K views on TikTok with four videos crossing 100K each. The kicker? Larry co-wrote this article. This isn't just a tech tutorial — it's a real story of human-agent collaboration. (Series Part 1 of 2)

Don't Get Addicted to Vibe Coding: When Creation Becomes Refined Sugar

Vibe Coding is refined sugar for creation — compressing an experience that used to take months of effort into a few seconds. What gives you the rush isn't 'it works,' it's 'I can't believe it actually works.' The author dissects Vibe Coding addiction through dopamine mechanics, consumption disguised as creation, and the vertigo of infinite possibilities.

Cut Token Costs by 75%: A Practical Guide to System Prompt Layering

An AI Agent burns 34,500 tokens of system prompt every single conversation turn. The author used layered loading (always-on vs on-demand) plus a dual-model strategy to cut monthly costs from $568 down to $120-150 — a 75% reduction. Full breakdown with real numbers inside.

OpenAI's Agent Trinity: Skills + Shell + Compaction — A Field Guide

OpenAI released three primitives for long-running agents: Skills (reusable SKILL.md instruction packs), Shell (hosted container runtime), and Compaction (automatic context compression). Includes 10 battle-tested tips and Glean's production data.

From Magic to Malware: How OpenClaw's Agent Skills Became an Attack Surface

1Password's security team found that the most downloaded skill on ClawHub was actually a malware delivery vehicle. Worse: it wasn't an isolated case — hundreds of skills were part of the same campaign. When markdown becomes an installer, skill registries become supply chain attack surfaces.

Karpathy: Just 'Rip Out' What You Need — DeepWiki + Bacterial Code and the Software Malleability Revolution

Andrej Karpathy shares how he used DeepWiki MCP + GitHub CLI to have Claude 'rip out' fp8 training functionality from torchao's codebase — producing 150 lines of self-contained code in 5 minutes that actually ran 3% faster. He introduces the 'bacterial code' concept: low-coupling, self-contained, dependency-free code that agents can easily extract and transplant. His punchline: 'Libraries are over, LLMs are the new compiler.'

Your Company is a Filesystem — When an AI Agent's Entire Worldview is Read and Write

OpenClaw's secret sauce is simple: its entire context is a filesystem on your computer. What if you modeled an entire company the same way? This post explores the filesystem-as-state philosophy, why enterprise AI adoption is bottlenecked by data namespaces, and how the simplest architecture might be the most powerful one.

Obsidian Just Shipped a CLI — And It's Not For You, It's For AI

Obsidian v1.12 ships an official CLI that lets you control your entire vault from the terminal. On the surface it's a power user tool — underneath, it's paving the road for AI agents. This article covers the full CLI command reference and demonstrates real Claude Code + Obsidian CLI workflows.

Anthropic's 2026 Report: 8 Trends Redefining Software Development (The Code Writer Era Is Over)

Anthropic published its 2026 Agentic Coding Trends Report, revealing 8 key trends: Multi-Agent Systems becoming standard (57% org adoption), Papercut Revolution for clearing tech debt at low cost, Self-Healing Code with autonomous debug loops, and Claude Code hitting $1B annualized revenue. TELUS saved 500K hours, Rakuten achieved 99.9% accuracy on 12.5M lines. Developer roles are shifting from Code Writer to System Orchestrator.

Pi: The Minimal Coding Agent With Just Four Tools That Powers OpenClaw

Flask creator Armin Ronacher (mitsuhiko) explains why he exclusively uses Pi — Mario Zechner's minimal coding agent with just four tools (Read, Write, Edit, Bash) — and how its extension system lets agents extend themselves. Pi powers OpenClaw under the hood and embodies the philosophy of 'software building software.' No MCP, no downloaded plugins — just tell the agent to build what it needs.

Anthropic Launches Claude for Nonprofits: Up to 75% Off for Mission-Driven Orgs (Plus a Taiwan Disaster Relief Use Case)

Anthropic launches Claude for Nonprofits with up to 75% discounts on Team and Enterprise plans, access to Opus 4.6, Sonnet 4.5, and Haiku 4.5, plus new integrations with Benevity, Blackbaud, and Candid. The program also includes a free AI Fluency course co-developed with GivingTuesday. Real-world users include the Epilepsy Foundation (24/7 support for 3.4M patients), MyFriendBen ($1.2B in unclaimed benefits found), and IDinsight (16× faster workflows). We also explore how Taiwan's GuangFuHero disaster relief volunteer platform could leverage this program.

OneContext: Teaching Coding Agents to Actually Remember Things (ACL 2025)

Junde Wu from Oxford + NUS got fed up with coding agents forgetting everything between sessions. So he built OneContext — a Git-inspired context management system using file system + Git + knowledge graphs. Works across sessions, devices, and different agents (Claude Code / Codex). The underlying GCC paper achieves 48% on SWE-Bench-Lite, beating 26 systems. Backed by an ACL 2025 main conference long paper.

When Intelligence Is Free, What's Actually Valuable? 12 Endgame Positions

Michael Bloch's thought experiment: when AI intelligence becomes nearly free, what assets become MORE valuable? His 12 endgame positions: Energy, Atoms, Capital, Regulatory permission, Trust, Proprietary data, Human attention, Network effects, Operational advantage, Security, Physical space, and Intelligence itself

Inside OpenAI: How They're Going Agent-First (Straight From the Co-Founder)

OpenAI co-founder Greg Brockman publicly reveals how OpenAI is transforming to agentic software development internally. By March 31st, agents should become the first resort for all technical tasks. Includes six concrete recommendations, including 'Say no to slop' on code quality.

Claude Code Finally Learned to Delegate: Agent Teams Mode Is Here

Anthropic released Opus 4.6 with Claude Code Agent Teams: a lead agent can delegate to multiple teammates working in parallel — researching, debugging, and building simultaneously. Boris Cherny says: it's powerful, but it burns tokens like crazy.

Inside LLM Inference: KV Cache & the Memory Nightmare (Series 2/3)

Part 1 taught you how to save money. Part 2 explains why those tricks work. From the two stages of LLM inference (prefill/decode) to KV cache fundamentals to the GPU memory crisis that makes naive caching fall apart at scale. (Part 2 of 3)

Designers Are Using Claude Code Now — What This Means for Engineers

ADPList founder Felix Lee wrote a Claude Code guide for designers, promoting 'Vibe Coding'. As a Claude Code power user, I analyze what this means for engineers and tech leads: designers' description skills are actually an advantage, but there's still a gap between vibe code and production code.

MIT Research: Making LLMs Recursively Call Themselves to Handle 10M+ Tokens

When you stuff too much into a context window, models get dumber — that's context rot. MIT proposes Recursive Language Models (RLMs), letting LLMs recursively call themselves in a Python REPL to handle massive inputs. GPT-5-mini + RLM beats vanilla GPT-5 on hard tasks, and it's cheaper too.

Claude is a Space to Think

Anthropic's official announcement: Claude will never have ads. Ads would turn AI from 'serving users' into 'serving advertisers.' Claude should be like a notebook or whiteboard — a pure space to think.

Agentic Note-Taking 01: The Verbatim Trap

When AI processes your notes by just 'reorganizing' without 'transforming,' it's expensive copy-paste. The Cornell Notes methodology pointed this out long ago: passive copying isn't the same as learning. Your AI summarizer falls into the same trap.

Building a Sustainable AI Workflow with Claude Code

The key to going from 'AI user' to 'AI master': turn fragmented AI usage into a systematic workflow. Build a complete system with Claude Code for memory, content reuse, and methodology accumulation.

Let Your AI Agent Earn Its Own Money: x402 Singularity Layer

AI can code, research, and discover patterns—but monetization still requires humans. This skill lets agents create x402-enabled endpoints, set pricing, collect revenue, and reinvest automatically. Full economic autonomy for your agent.

Let Your AI Code While You Sleep — Ralph Loops Upgrade Guide

Turn your Clawdbot into a fully automated builder. Key point: it works while you sleep. 73 iterations, 6 hours runtime, human time investment: 5 minutes. The solution isn't a stronger model — it's a smarter loop.

A Security-First Guide to Running OpenClaw (in 9 Steps)

Everyone's installing OpenClaw raw and wondering why it burned $200 organizing Downloads. This guide adds guardrails: Raspberry Pi isolation, Tailscale VPN, Matrix E2E encryption, prompt injection hardening. The goal isn't perfect security—it's knowing where the bullets can get in.

10 Claude Code Tips from Creator Boris

Internal Claude Code team tips revealed: run parallel worktrees, invest in CLAUDE.md, create your own Skills, use voice input, enable Learning Mode. Remember: there's no one 'right' way to use it.

Yapping to PRDs: Claude Code & Obsidian

Meetings used to be overhead. Now yapping (chatting/rambling) is work. When my colleague and I 'chat' about a project, we record it. An hour later, the transcript is processed, and suddenly: we have docs, feature ideas are in the backlog, decisions are captured with reasoning, project status is updated. Yapping IS Work.

Vibe Note-Taking 101: Spatial Editing

Editing long documents with Claude Code is usually painful. Instead of bringing text to Claude, leave instructions where they belong. Use curly braces to mark your thoughts and edit instructions — each annotation applies to its surrounding text. Position IS Context.

Claude Code Finally Has Long-Term Memory: Supermemory Plugin Released

We added Supermemory to Claude Code. Now it's ridiculously powerful. Claude Code should know you — not just this one session, but forever. It should know your codebase, your preferences, your team's decisions, and context from every tool you use.

Redis Is More Than Just a Cache: Don't Drive a Ferrari to Buy Groceries

Most developers know Redis as a cache. But using Redis only as a cache is like buying a Ferrari just to drive to the grocery store. Redis isn't a cache that happens to be fast — it's a data structure server that happens to be great at caching.

Obsidian & Claude Code 101: Context Engineering

For vibe note-taking to work well, you must force Claude Code to be 'picky.' Use a 4-layer filtering mechanism (file tree → YAML descriptions → outline → full content) to make it more selective. This pattern is called Progressive Disclosure.

Obsidian & Claude Code: Async Hooks for Note History

Imagine time-traveling through your notes. Claude Code's Async Hooks let you auto-commit after every edit without any slowdown, then read that history in actually useful ways. Your vault becomes a thinking journal that writes itself.

Build Claude a Tool for Thought

Humans have Tools for Thought like Obsidian. Claude needs an AI-native version. Build a knowledge graph using markdown, wiki links, hooks, and subagents where agents can actually think.

How to Make Your Agent Learn and Ship Code While You Sleep

Using a two-stage loop (Compound Review and Auto-Compound), let your AI agent automatically learn from experience, update its knowledge base, and implement the next priority item while you sleep.

Obsidian + Claude Code 101: Let AI Live in Your Notes

Heinrich spent a year building an 'OS for thinking with AI': let Claude Code operate your Obsidian vault, extract concepts, link ideas, and build a living representation of your thinking. You don't take notes anymore — you command a system that takes notes.

Claude Code + Obsidian: Building Infrastructure for Agent Thinking

Heinrich's six-part tutorial series: Building an AI agent thinking infrastructure with Claude Code + Obsidian. From vault basics to context engineering to meta layers — a complete knowledge management system.