📚 ShroomDog Picks

Long-form articles, translated and explained

204 posts

← Back to home

Do Not Let Codex Teach You: Turn AI Into a Learning Coach in 5 Steps

When learning a new tool with Codex, the worst move is asking it to give you a lecture. A better pattern is to ask it for an entry point, a rough map, a tiny exercise, a teach-back check, and breadcrumbs for next time.

Google's Code Review Guide: Don't Chase Perfect, Protect Code Health

Google Engineering Practices frames code review as code-health work, not a perfection ritual: approve CLs that improve the system, while aligning design, tests, speed, comments, and author habits around maintainability.

The AI refusal switch may live in 0.1% of neurons

Nous Research proposes CNA, a method that uses contrastive prompts to find a tiny set of MLP neurons tied to refusal behavior. The interesting point is not just jailbreaks, but what this says about alignment fine-tuning.

OpenAI's Codex Goals Guide: Agents Should Not Finish by Vibes

OpenAI's Cookbook frames Codex Goals as a thread-scoped completion contract: the objective persists, but completion must be checked against evidence. This post fills in the official spec angle around SP-192, SP-197, and SP-207.

An AI Agent Needs More Than a Goal

OpenAI and Anthropic both pushed /goal-like ideas into coding agents. A goal helps, but production agents also need strategy, constraints, health metrics, autonomy boundaries, and stop rules.

AI Coding in Large Codebases Is Not Won by the Model Alone

Whether Claude Code works inside a large codebase is not just about model scores. The real question is whether the team has built rails for the agent: maps, automation, on-demand tools, symbol navigation, internal-system access, and someone to maintain the whole operating setup.

Do Not Outsource the Learning to AI

Addy Osmani warns that default AI coding workflows help people close tasks, but do not automatically make them sharper. The difference is not whether engineers use AI; it is whether they use it to test and grow their own mental models.

When Tokens Stop Being the Limit: OpenClaw's Always-On Agent Experiment

Peter Steinberger says OpenClaw often runs about a hundred Codex instances in the cloud. The point is not showing off AI spend. It is testing what software work looks like when review, triage, security, reproduction, benchmarks, and meeting follow-up become always-on agent work.

Bun Moving to Rust Should Not Have Become a Language War

Mitchell Hashimoto's point about Bun moving from Zig to Rust is not that Rust won and Zig lost. The more useful lesson is that programming languages are becoming more replaceable, and developer-tool companies need to manage technical narratives before the internet turns them into faction wars.

Anthropic’s 2028 AI Leadership: Two Scenarios and a Compute Race

Anthropic lays out two 2028 scenarios for AI leadership: the US and its allies preserve their compute and model lead, or a CCP-controlled AI ecosystem catches up near the frontier. The essay centers on compute, export controls, model distillation, and whether democracies can set the rules first.

Codex CLI Memory Is Not Magic. It Is a Stack of Greppable Markdown

Mem0 breaks down Codex CLI memory: not a vector database, but local Markdown, background summaries, credential scrubbing, and grep search. This post looks at when local notes are enough, and when a semantic memory layer makes sense.

Memory in Voice Agents Is Harder Than You Think

Voice agents cannot reuse text-agent memory architectures as-is. Manthan Gupta breaks down why latency budgets, noisy transcripts, and cold-start identity make voice memory a different problem.

AI Writing Code Isn't the Scary Part. Shipping Without a Ratchet Is

Garry Tan argues the real breakthrough in AI coding is not speed. It's turning tests, docs, and evals into a forward-only quality ratchet, so every change locks in what the team learned and makes the codebase harder to silently degrade.