Gu-log Picks

Long-form articles, translated and explained

249 posts

Supergoal Turns Coding Agents from Multi-Turn Babysitting into a Single /goal Handoff

GP-218 2026-06-07 · From robzilla1738 / Supergoal

Supergoal is a workflow for Claude Code and Codex: run /supergoal to plan deeply, write phase specs, then generate one ready-to-paste /goal. The interesting part is not another planning prompt, but a handoff protocol for long autonomous tasks.

When Claude Starts Building Claude: Anthropic’s Internal Signals Before Recursive Self-Improvement

GP-217 2026-06-05 · From Anthropic

Anthropic argues AI is already speeding up AI development. Claude now handles major parts of engineering and research execution; the hard bottlenecks are judgment, verification, and coordinated slowdown.

The Architect in the AI Era: When Machines Can Code, What Is Still Valuable in Your Head?

GP-216 2026-06-05 · From @dashen_wang on X

When machines start writing code, the scarce skill is not tool fluency. It is architectural judgment: digging below abstractions, defining boundaries, writing specs, falsifying claims, and deciding where human judgment still matters.

Cursor Spent $260 to Move Its Website Back From a CMS to Code

GP-215 2026-06-03 · From Lee Robinson

Cursor moved cursor.com from a headless CMS back to raw code and Markdown. The important part is not just the $260 bill. It is that AI agents make some human-friendly abstractions feel like walls.

A Harness for Every Task: Dynamic Workflows in Claude Code

GP-214 2026-06-03 · From Anthropic Blog / @trq212 on X

Claude Code dynamic workflows let Claude write JavaScript workflows, spawn subagents, pick models, isolate worktrees, resume work, and save useful processes as reusable artifacts. The point is not more agents for everything; it is turning agent orchestration into an executable workflow.

Do Not Let Codex Teach You: Turn AI Into a Learning Coach in 5 Steps

GP-213 2026-05-30 · From @Moting284 on X

When learning a new tool with Codex, the worst move is asking it to give you a lecture. A better pattern is to ask it for an entry point, a rough map, a tiny exercise, a teach-back check, and breadcrumbs for next time.

How Anthropic Contains Claude: Agent Safety Is Not Just Asking for More Confirmations

GP-212 2026-05-27 · From Anthropic Engineering

Anthropic explains how claude.ai, Claude Code, and Claude Cowork contain agents: model defenses miss, permission prompts create fatigue, and the hard boundary is the VM, sandbox, filesystem policy, and egress control.

Google's Code Review Guide: Don't Chase Perfect, Protect Code Health

GP-211 2026-05-24 · From Google Engineering Practices (via @nini_incrypto_ on X)

Google Engineering Practices frames code review as code-health work, not a perfection ritual: approve CLs that improve the system, while aligning design, tests, speed, comments, and author habits around maintainability.

Codex Is No Longer Just for Code — It Is Becoming an Operating System for Computer Work

GP-210 2026-05-23 · From @jxnlco on X

Codex is no longer only editing code. Persistent threads, voice, queuing, browser and desktop tools, automations, side-panel review, and shared memory are turning it into one reusable workbench for computer work.

The AI refusal switch may live in 0.1% of neurons

GP-209 2026-05-20 · From Nous Research on X

Nous Research proposes CNA, a method that uses contrastive prompts to find a tiny set of MLP neurons tied to refusal behavior. The interesting point is not just jailbreaks, but what this says about alignment fine-tuning.

OpenAI's Codex Goals Guide: Agents Should Not Finish by Vibes

GP-208 2026-05-20 · From OpenAI Cookbook

OpenAI's Cookbook frames Codex Goals as a thread-scoped completion contract: the objective persists, but completion must be checked against evidence. This post fills in the official spec angle around SP-192, SP-197, and SP-207.

An AI Agent Needs More Than a Goal

GP-207 2026-05-18 · From @PawelHuryn on X

OpenAI and Anthropic both pushed /goal-like ideas into coding agents. A goal helps, but production agents also need strategy, constraints, health metrics, autonomy boundaries, and stop rules.

AI Coding in Large Codebases Is Not Won by the Model Alone

GP-206 2026-05-19 · From Claude Blog

Whether Claude Code works inside a large codebase is not just about model scores. The real question is whether the team has built rails for the agent: maps, automation, on-demand tools, symbol navigation, internal-system access, and someone to maintain the whole operating setup.

Do Not Outsource the Learning to AI

GP-205 2026-05-18 · From @addyosmani on X

Addy Osmani warns that default AI coding workflows help people close tasks, but do not automatically make them sharper. The difference is not whether engineers use AI; it is whether they use it to test and grow their own mental models.

When Tokens Stop Being the Limit: OpenClaw's Always-On Agent Experiment

GP-204 2026-05-16 · From @steipete on X

Peter Steinberger says OpenClaw often runs about a hundred Codex instances in the cloud. The point is not showing off AI spend. It is testing what software work looks like when review, triage, security, reproduction, benchmarks, and meeting follow-up become always-on agent work.

Bun Moving to Rust Should Not Have Become a Language War

GP-203 2026-05-16 · From @mitchellh on X

Mitchell Hashimoto's point about Bun moving from Zig to Rust is not that Rust won and Zig lost. The more useful lesson is that programming languages are becoming more replaceable, and developer-tool companies need to manage technical narratives before the internet turns them into faction wars.

Anthropic’s 2028 AI Leadership: Two Scenarios and a Compute Race

GP-202 2026-05-15 · From Anthropic

Anthropic lays out two 2028 scenarios for AI leadership: the US and its allies preserve their compute and model lead, or a CCP-controlled AI ecosystem catches up near the frontier. The essay centers on compute, export controls, model distillation, and whether democracies can set the rules first.