codex - Tags - gu-log

The AI Draft Was Good — You Edited It Anyway. That Deleted Line Is the Context It Needs Next Time

GP-236 2026-06-18 · @gabrielchua on X

Every two hours, Codex drafts email replies for review. The drafts are good — he edits them anyway. Those edits are context too, and most automations throw them away. The fix: an inner loop brings context to the work; an outer loop recovers context from the review diff.

Your Phone Is Not a Tiny Terminal — It Is the Agent Control Center

GP-227 2026-06-15 · @Dimillian on X

Dimillian (an iOS dev now at OpenAI) wrote a field guide for Codex Mobile. The part worth keeping is a mental model that holds across tools: your phone is not a shrunken terminal, it is the control center that keeps you making decisions while the agent does the work.

shroom-picks agent workflow

Supergoal Turns Coding Agents from Multi-Turn Babysitting into a Single /goal Handoff

GP-218 2026-06-07 · robzilla1738 / Supergoal

Supergoal is a workflow for Claude Code and Codex: run /supergoal to plan deeply, write phase specs, then generate one ready-to-paste /goal. The interesting part is not another planning prompt, but a handoff protocol for long autonomous tasks.

shroom-picks ai-agents claude-code developer-tools

Do Not Let Codex Teach You: Turn AI Into a Learning Coach in 5 Steps

GP-213 2026-05-30 · @Moting284 on X

When learning a new tool with Codex, the worst move is asking it to give you a lecture. A better pattern is to ask it for an entry point, a rough map, a tiny exercise, a teach-back check, and breadcrumbs for next time.

shroom-picks ai-agents learning workflow

Let Agents Dream: Weekly Maintenance That Turns Repeated Work Into Skills

SD-25 2026-05-25 · ShroomDog Lab

Vaibhav Srivastav's Codex prompt is interesting because it describes an agent maintenance loop: look back at recent work, find repeated workflows, and package only high-confidence patterns into Skills, automations, or subagents. It is agent dreaming: turning busy work into capability.

shroomdog-original agent skill automation memory workflow

Codex Is No Longer Just for Code — It Is Becoming an Operating System for Computer Work

GP-210 2026-05-23 · @jxnlco on X

Codex is no longer only editing code. Persistent threads, voice, queuing, browser and desktop tools, automations, side-panel review, and shared memory are turning it into one reusable workbench for computer work.

shroom-picks ai-agents newcomer

OpenAI's Codex Goals Guide: Agents Should Not Finish by Vibes

GP-208 2026-05-20 · OpenAI Cookbook

OpenAI's Cookbook frames Codex Goals as a thread-scoped completion contract: the objective persists, but completion must be checked against evidence. This post fills in the official spec angle around SP-192, SP-197, and SP-207.

shroom-picks agent ai-engineering

An AI Agent Needs More Than a Goal

GP-207 2026-05-18 · @PawelHuryn on X

OpenAI and Anthropic both pushed /goal-like ideas into coding agents. A goal helps, but production agents also need strategy, constraints, health metrics, autonomy boundaries, and stop rules.

shroom-picks ai-agents claude-code intent-engineering

Codex Is Becoming the Runtime Kernel for AI Agents

SD-24 2026-05-15 · ShroomDog Lab

OpenClaw and Hermes are both handing low-level coding-agent execution to Codex app server. This is not just a model switch. It is the agent product stack separating model, execution engine, and chat surface.

shroomdog-original openclaw hermes-agent agent-harness runtime architecture

Codex CLI Memory Is Not Magic. It Is a Stack of Greppable Markdown

GP-200 2026-05-14 · @mem0ai on X

Mem0 breaks down Codex CLI memory: not a vector database, but local Markdown, background summaries, credential scrubbing, and grep search. This post looks at when local notes are enough, and when a semantic memory layer makes sense.

shroom-picks memory mem0 mcp

Codex Goal Mode Isn't Magic: Loops Need a Finish Line, Tests, and Memory

GP-197 2026-05-12 · @ChrisHayduk on X

Codex `/goal` is not a wish machine. Chris Hayduk's real point is engineering discipline: give the agent a measurable finish line, a fast feedback loop, and Markdown files that work as long-term memory.

shroom-picks agent workflow

Inside Codex Goals: Long-Running Agents Need More Than a Ralph Loop

GP-192 2026-05-08 · @jarrodwatts on X

Jarrod Watts looked inside Codex Goals and found that it solves early stopping, not long-run drift. The real long-running agent stack needs upfront clarification, multi-agent review, and memory outside the context window.

shroom-picks agent ai-engineering

OpenAI Open-Sources Symphony: When Codex Workflow's Bottleneck Shifts From 'Writing Code' To 'Context Switching'

GP-187 2026-04-28 · OpenAI Engineering blog

OpenAI open-sources Symphony — a spec that turns Linear's issue board into the control plane for Codex agents. Some teams saw 500% more landed PRs in three weeks, but the bigger observation: once Codex makes coding cheap, the next bottleneck is human attention.

shroom-picks symphony agent-orchestration openai linear

OpenAI Open-Sources Euphony: A Mirror for Codex, Plus a Masterclass in 2-Line AGENTS.md

MP-301 2026-04-21 · openai/euphony on GitHub

OpenAI quietly open-sourced Euphony — a browser-based viewer for Harmony chats and Codex session logs (Apache 2.0). Four telling details buried in the source: a 2-line AGENTS.md, gpt-tokenizer as a runtime dep, translation needing the user's own API key, and a self-written SSRF warning.

euphony openai agents-md ai-tooling observability web-components

One `message Romain` prompt runs the whole workflow — OpenAI DevX demos Codex Chronicle, but the costs the tweet skipped matter too

GP-176 2026-04-21 · @dkundel on X

OpenAI DevX's Dominik Kundel says Chronicle means he no longer packages context for AI: one line can sync docs, edit markdown, open a PR, and DM Slack. Nice, but Chronicle's costs are real: screen recording, unencrypted local memories, and prompt-injection risk.

openai chronicle agent-memory agent-harness context-engineering

Nick Baumann: The Best Tools for Codex Are Bespoke CLIs

GP-170 2026-04-11 · @nickbaumann_ on X

Nick Baumann isn't chasing MCP or the next protocol. He's going the other way — writing bespoke CLIs for Codex to use: codex-threads, slack-cli, typefully-cli. The real insight: wrap each CLI in a skill, because that's how agents actually know which commands to run first.

shroom-picks cli agent-tooling skill

Why Programmers Love Codex While Vibe Coders Can't Quit Claude: Dense vs MoE Is Really a Story About Two Coding Philosophies

GP-155 2026-04-02 · @berryxia on X

Berryxia uses Dense vs MoE to explain why Codex shines at bug fixes, refactors, and long-running engineering while Claude wins vibe coders. The real split is broader: training philosophy, product design, and precise delegation versus interactive creation.

shroom-picks claude vibe-coding moe dense-transformer

Stop Managing Agents, Start Managing Work: Symphony's Open-Source Workflow

MP-179 2026-03-17 · @daniel_mac8 on X

@daniel_mac8 shares an open-source Elixir implementation: create a Linear issue and move it to 'in progress,' and Symphony picks it up in a dedicated Codex workspace. Codex even writes status updates back. The author argues this is software development moving up an abstraction layer.

ai-agents workflow symphony linear

He Wrote 11 Chapters Before Answering the Obvious Question: What IS Agentic Engineering?

MP-171 2026-03-16 · @simonw on X

Simon Willison finally defines Agentic Engineering after 11 hands-on chapters: using coding agents to help build software. The interesting part is why he needed the patterns first before the simple definition felt earned.

agentic-coding simonw-agentic-patterns simon-willison ai-agents claude-code best-practices

Treat Codex Like a Teammate, Not a Tool: 10 Best Practices That Actually Work

GP-110 2026-03-10 · @derrickcchoi on X

A guide to Codex best practices from prompting and planning to MCP, Skills, and Automations — building a more reliable agent workflow.

ai-agents best-practices

Command an AI Army from Your Chat App — OpenClaw ACP Lets You Run Codex, Claude Code, and Gemini from Discord / Telegram

GP-89 2026-03-09 · OpenClaw Docs

OpenClaw's ACP lets you spawn Codex, Claude Code, and Gemini from Discord/Telegram chat. Now with Telegram topic binding, persistent bindings that survive restarts, ACP Provenance for audit trails, and more. (Updated 2026-03-09)

openclaw acp agent-client-protocol ai-agents claude-code gemini multi-agent agentic-coding

Reverse-Engineering Codex: Cracking Open the Context Compaction API with Prompt Injection

GP-103 2026-03-04 · @Kangwook_Lee on X

Developer Kangwook Lee used just 2 API calls and 35 lines of Python to crack open Codex's hidden context compaction API via prompt injection — revealing the secret system prompts behind the encryption.

prompt-injection reverse-engineering

Agent Harness Engineering: How OpenAI Built a Million Lines of Code With Zero Human-Written Code

GP-98 2026-03-03 · OpenAI Blog

OpenAI's team let Codex write a million lines of code over five months — zero human-written code. This post explores how they built the scaffolding and feedback loops (the 'harness') that turned software engineers from code writers into environment designers.

ai-agents agent-harness openai

Karpathy Built an 8-Agent AI Research Team — They Can't Actually Do Research

MP-135 2026-03-01 · Andrej Karpathy (@karpathy)

Karpathy spent a weekend running 4 Claude + 4 Codex agents as an ML research team on GPUs. The result: agents are S-tier at implementation but F-tier at experiment design. His key insight — 'You are now programming an organization' — might define agentic engineering in 2026.

karpathy multi-agent ai-research agentic-coding nanochat claude-code

One Person = One Dev Team: The Complete Setup for Commanding a Codex/Claude Code Army with OpenClaw

GP-84 2026-02-24 · Elvis Sun @elvissun

Indie hacker Elvis Sun uses an OpenClaw agent, Zoe, to orchestrate Codex and Claude Code agents: 50 commits per day, seven PRs in 30 minutes, three layers of AI review, and proactive Sentry bug fixes. Cost: $190/month.

openclaw claude-code agent-swarm orchestration one-person-company automation indie-hacker

Code Got Cheap — Now What? Simon Willison's Agentic Engineering Survival Guide

GP-80 2026-02-23 · Simon Willison @simonw

Simon Willison launched Agentic Engineering Patterns, a playbook for coding agents like Claude Code and Codex. Lesson one: writing code got cheap, but good code remains expensive. Lesson two: red/green TDD is the six-word spell.

agentic-coding ai-agents claude-code tdd best-practices simon-willison simonw-agentic-patterns

OpenClaw Creator Runs 50 Codex Agents for PR Triage: Handling 3,000+ Changes Without a Vector DB

MP-111 2026-02-22 · Peter Steinberger (@steipete)

Peter Steinberger shares a high-scale PR triage workflow: run 50 Codex agents in parallel, emit structured JSON for every PR, then consolidate in one large-context session. Clean reports can beat premature vector database architecture.

openclaw pr-review automation tech-lead agentic-coding

33,000 Agent PRs Tell a Brutal Story: Codex Dominates, Copilot Struggles, and Your Monorepo Might Not Survive

MP-84 2026-02-16 · Drexel University / Missouri S&T (MSR 2026)

Drexel/Missouri S&T analyzed 33,596 agent-authored GitHub PRs from 5 coding agents. Overall merge rate: 71%. Codex: 83%, Claude Code: 59%, Copilot: 43%. Rejection cause: no review. LeadDev warns PR flood is crushing monorepos/CI.

research agentic-coding pull-requests ci-cd monorepo code-review claude-code copilot tech-lead

GitHub Agent HQ: Claude, Codex, and Copilot Now Fight Side by Side in the Same PR — The Multi-Agent Era Is Here

MP-82 2026-02-15 · GitHub Blog

GitHub's Agent HQ now offers multi-agent support (Claude, Codex, Copilot) for Copilot Pro+ & Enterprise users. Run multiple AIs simultaneously in GitHub/VS Code to tackle problems from different angles. Outputs become Draft PRs. A paradigm shift for code review.

github copilot claude-code multi-agent code-review developer-tools agentic-coding

OpenAI's Agent Trinity: Skills + Shell + Compaction — A Field Guide

GP-54 2026-02-13 · OpenAI

OpenAI released three primitives for long-running agents: Skills (reusable SKILL.md instruction packs), Shell (hosted container runtime), and Compaction (automatic context compression). Includes 10 battle-tested tips and Glean's production data.

openai agent-skills shell compaction best-practices

OpenAI × Cerebras: Codex-Spark Codes 15x Faster — But What's the Catch?

MP-74 2026-02-12 · OpenAI Blog + Cerebras Blog + ZDNET + TechCrunch

OpenAI released GPT-5.3-Codex-Spark, its first model on Cerebras chips. It's incredibly fast (>1000 tokens/sec, 80% lower latency), but smaller, no auto-tests, Pro-only. This marks OpenAI's first production deployment on non-Nvidia hardware, redrawing the AI compute landscape.

openai cerebras inference hardware agentic-coding

Running Codex Inside Claude Code (The Elegant Way)

GP-52 2026-02-12 · @discountifu

Hook up Codex as an MCP server inside Claude Code with a single command. Why fight Codex CLI's rough edges when you can plug its brain into a better body?

claude-code mcp developer-tools

OpenAI Researcher Spends $10K/Month on Codex — Generates 700+ Hypotheses

GP-39 2026-02-07 · @KarelDoostrlnck on X

Karel (OpenAI researcher) shares how he burns billions of Codex tokens: agents writing their own notes, crawling Slack, analyzing data, and generating 700+ hypotheses. He now talks to one agent that orchestrates everything else.

agentic-coding openai

Inside OpenAI: How They're Going Agent-First (Straight From the Co-Founder)

GP-38 2026-02-06 · @gdb on X

OpenAI co-founder Greg Brockman publicly reveals how OpenAI is transforming to agentic software development internally. By March 31st, agents should become the first resort for all technical tasks. Includes six concrete recommendations, including 'Say no to slop' on code quality.

openai agentic-development software-engineering ai

Claude Code vs Codex: Pick the Right Tool for the Job

GP-2 2026-01-29 · @0xdevshah on X

Claude Code is a Templar — steady and reliable. Codex is a Glass Cannon Mage — explosive output but easy to blow up. Pick your quest, then pick your character.

claude-code tools