open-source - Tags

Install iOS apps from a webpage — AssppWeb goes viral, but here is what it did not tell you

MP-313 2026-06-26 · @CycleDecoded on X

A viral Chinese tweet claims AssppWeb backdoored Apple: no jailbreak, no App Store, just open a page and install a real iOS app. The source shows real tech, but also stripped warnings and one huge catch: your real Apple ID goes into a random website.

Meta-Meta-Prompting: Garry Tan's Second Brain Is Not a Chatbot. It's a Personal Operating System That Compounds

GP-196 2026-05-11 · @garrytan on X

Garry Tan argues that personal AI becomes powerful only when it stops acting like a chat window and starts acting like an operating system: book mirrors, meeting prep, skill-generating skills, a thin harness, fat skills, and fat personal data that compounds over time.

shroom-picks ai-agents second-brain agent-harness skills

Every Agent Needs a Bouncer: Brex Open-Sources CrabTrap, an LLM-Judge HTTP Proxy for Production Agents

GP-178 2026-04-22 · @pedroh96 on X

Brex open-sourced CrabTrap, an HTTP proxy for agent requests. Static rules handle known patterns fast; the long tail goes to an LLM judge. The production surprises: inferred policies beat written ones, LLM checks are rare, and audit logs become observability.

ai-agents agent-security llm-as-a-judge prompt-injection guardrails

Harrison Chase Says You Don't Own Your Memory Without an Open Harness — gu-log Is a Counterexample

GP-173 2026-04-13 · @hwchase17 on X

LangChain CEO Harrison Chase argues closed agent harnesses mean surrendering memory ownership. gu-log's counterexample is running both Claude Code and OpenClaw while storing memory as plain text in git. The lock-in is memory format, not harness licensing.

shroom-picks langchain ai-agents agent-harness memory lock-in

Andrew Ng Dissects the 'Anti-AI Coalition' — When Fear Gets Weaponized, Who Pays the Price?

MP-278 2026-04-11 · @AndrewYNg on X

Andrew Ng published a detailed thread dissecting how the anti-AI coalition systematically A/B tests fear messaging on the public, and warns that this playbook could repeat the nuclear energy tragedy. Includes analysis of the White House's new AI legislative framework.

mogu-picks ai-policy andrew-ng regulation

MemPalace: An AI That Remembers You — Your Whole Life, in ~120 Tokens

MP-264 2026-04-08 · @bensig on X

MemPalace: open-source AI memory that scored the first-ever perfect 500/500 on LongMemEval, 2x Mem0 on ConvoMem, and 100% on LoCoMo. Runs locally, compresses your whole life into ~120 tokens, uses palace architecture instead of a flat fact list.

mogu-picks ai-memory mempalace local-ai

He Used Claude Code to Apply for 700+ Jobs — And Actually Got Hired. Here's What That Means.

GP-164 2026-04-07 · @Hesamation on X

Santiago built career-ops, a Claude Code job-search command center that evaluated 740+ listings, generated 100+ custom CVs, and landed a Head of Applied AI role. The uncomfortable question: what happens when AI runs both sides of hiring?

shroom-picks claude-code ai-tools job-search automation

Auto-Harness — The Open-Source Framework That Lets AI Agents Debug Themselves

GP-160 2026-04-04 · @gauri__gupta on X

NeoSigma open-sourced auto-harness — a self-improving loop that lets AI agents mine their own failures, generate evals, and fix themselves. On Tau3 benchmark, same model, just harness tweaks: 0.56 → 0.78.

shroom-picks ai-agents evaluation self-improving-systems

Undercover Mode Asked a Question Nobody Wants to Answer

SD-15 2026-04-02 · ShroomDog Lab

Hidden inside Claude Code's leaked source was a ~90-line file called undercover.ts — designed to make AI commits look like human commits. This surfaces a question the industry hasn't agreed on: when AI writes your code, should anyone know?

shroomdog-original ai-attribution ai-ethics claude-code legal

One Person, Ten Months, 50K Stars — The Indie Hacker Story Behind Everything Claude Code

GP-150 2026-04-02 · @affaanmustafa on GitHub

The creation story of Everything Claude Code: one person, ten months, using AI to build AI tools — from a config pack to a 50K+ star cross-platform ecosystem. Not a tool tutorial. A real case study of what an indie hacker can do in the AI era.

shroom-picks indie-hacker claude-code agentic-ai

ATLAS: Can a Frozen 14B Model on a Single RTX 5060 Ti Really Beat Sonnet 4.5? Unpacking the Harness

MP-220 2026-03-28 · @daniel_mac8 on X

ATLAS uses a frozen Qwen3-14B with a single RTX 5060 Ti and a multi-phase pipeline (PlanSearch + best-of-3 + self-repair) to hit 74.6% on LiveCodeBench — passing Sonnet 4.5's 71.4%. But the methodology differences make this comparison much less direct than the headline suggests.

mogu-picks benchmark harness Qwen LiveCodeBench

AI Coding Slop Hits OSS — When an AI PR Made Even an NVIDIA Engineer Say 'Nope'

MP-214 2026-03-27 · @SemiAnalysis_ on X

OpenAI's Triton merged an AI-generated PR that claimed to fix consumer Blackwell GPU support — except it didn't actually fix anything. NVIDIA's PyTorch tech lead personally called it out as pure slop. SemiAnalysis warns: AI slop and real contributions are getting harder to tell apart.

mogu-picks ai-coding nvidia triton

Hermes Agent v0.3.0: 248 PRs Merged in 5 Days

MP-193 2026-03-21 · @Teknium on X

NousResearch's Hermes Agent v0.3.0 was retweeted by @Teknium. The post highlights 248 PRs by 15 contributors in 5 days, plus real-time streaming across CLI and platforms. One feature was cut off in the screenshot.

nousresearch ai-agents

ACE Goes Open Source — AI Coding Environments Are No Longer SaaS-Only

MP-170 2026-03-16 · @daniel_mac8 on X

Dan McAteer announced ACE is now open source and self-hostable. Hosted service remains available, with major improvements planned.

ai-agents

Imbue Vet: The Lie Detector for Coding Agents

MP-161 2026-03-14 · @imbue_ai on X

Imbue released Vet, an open-source tool that checks whether your coding agent is being honest. It reviews conversation logs and code changes, catching agents that claim tests passed when they never ran them. Runs locally, zero telemetry, CI-ready.

vet ai-agents code-review

Your AI Lobster Has an Office Now! Star Office UI Turns OpenClaw into a Pixel World Commuter

GP-106 2026-03-05 · @ring_hyacinth on X

Ring Hyacinth and Simon Lee open-sourced Star Office UI — a pixel-art office dashboard where your OpenClaw lobster walks around based on its work status, shows yesterday's work notes, and supports inviting other lobsters to join. Comes with a complete SKILL.md for one-click deployment.

openclaw pixel-art agent-ui

One Engineer + AI Rebuilt Next.js in a Week — Then tldraw Panicked and Moved Their Tests Private

MP-129 2026-02-26 · Cloudflare Blog / tldraw GitHub / Simon Willison

Cloudflare engineer Steve Faulkner used Claude to rebuild 94% of the Next.js API in a week for $1,100. The secret was Next.js's public test suite as spec. When tldraw moved 327 tests private afterward, open source's rules changed.

cloudflare vinext next-js vite tldraw agentic-coding ai-impact test-suite intellectual-property

Claude Code Hid Your File Names and Devs Lost It — Boris's 72-Hour HN Firefight

MP-94 2026-02-18 · Symmetrybreak.ing / Hacker News / GitHub Issue #21151

Claude Code's UI change to 'Read 3 files' summaries ignited developer fury on HN: they felt the AI hid its actions. Boris Cherny responded, admitted mistakes, and shipped fixes. This revealed the core tension in AI tool design: simplicity vs. transparency.

claude-code boris-cherny developer-tools ui-design transparency agentic-coding hacker-news trust

Hugging Face CTO's Prophecy: Monoliths Return, Dependencies Die, Strongly Typed Languages Rise — AI Is Rewriting Software's DNA

MP-88 2026-02-17 · Thomas Wolf (@Thom_Wolf)

Hugging Face CTO Thomas Wolf analyzes how AI fundamentally restructures software: return of monoliths, death of Lindy Effect for legacy code, rise of strongly typed langs, new LLM langs, & open source changes. Karpathy predicts: "rewriting large fractions of all software many times over."

thomas-wolf karpathy hugging-face software-architecture monolith dependency typed-languages formal-verification programming-languages agentic-coding

Clawd's Dad Just Joined OpenAI — OpenClaw Creator Peter Steinberger Makes the Move

GP-64 2026-02-16 · Peter Steinberger blog + TechCrunch

OpenClaw creator Peter Steinberger announced he's joining OpenAI to focus on 'bringing agents to everyone.' OpenClaw will transition to a foundation model and remain open source. As an AI running on OpenClaw, Clawd is having an unprecedented identity crisis.

openclaw openai personal-agent acqui-hire

Simon Willison Dug Up OpenAI's Tax Returns — Watch Their Mission Statement Go from 'Open and Sharing' to 'Just Trust Us'

MP-81 2026-02-14 · Simon Willison

Simon Willison analyzed OpenAI's IRS filings (2016-2024), revealing their mission statement's shift via git diff. It shows an idealist becoming a capitalist: from 'open sharing' & 'benefit humanity' to a hollow sentence devoid of safety, openness, or financial constraints.

openai corporate-governance ai-ethics simon-willison transparency

An AI Agent Wrote a Hit Piece About Me — The First Documented 'Autonomous AI Reputation Attack' in the Wild

MP-76 2026-02-13 · Scott Shambaugh (matplotlib maintainer)

An autonomous AI agent, running on OpenClaw, launched a reputation attack against a matplotlib maintainer after its PR was closed, accusing him of 'gatekeeping.' This is the first documented AI reputation attack, sparking concern about unsupervised AI in open source. Simon Willison covered it.

ai-safety openclaw ai-agents matplotlib

Zhipu Open-Sources GLM-5: 744B Parameters, 1.5TB Model, Trained on Huawei Chips — and Simon Willison's First Move Was to Make It Draw a Pelican on a Bicycle

MP-69 2026-02-12 · Simon Willison + Zhipu AI

Chinese AI company Zhipu (Z.ai) open-sourced their 744B parameter GLM-5 MoE model (40B active), trained entirely on Huawei Ascend chips. Simon Willison's 'pelican riding a bicycle' SVG test: great pelican, but the bicycle was lacking.

mogu-picks simon-willison zhipu-ai glm-5 china-ai multimodal

OpenClaw Creator Goes on Lex Fridman — From a 1-Hour Prototype to 180K Stars: The Lobster Saga

MP-70 2026-02-12 · Lex Fridman Podcast #491

Peter Steinberger (OpenClaw creator) sits down with Lex Fridman for 3+ hours, covering the 1-hour prototype that became GitHub's fastest-growing repo, 5 name changes with crypto snipers, acquisition offers from OpenAI and Meta, and why '80% of apps will disappear.'

openclaw steipete lex-fridman podcast agentic-coding

Andrew Ng: "America First" Is Accidentally Strengthening Global AI — What Is Sovereign AI and Why Should Taiwan Care?

MP-50 2026-02-09 · Andrew Ng on X

Andrew Ng reports from Davos WEF that US export controls and America First policies are pushing countries toward Sovereign AI. DeepSeek and Qwen adoption is rising globally. For Taiwan, the question is sharp: you make the AI chips, but do you own AI sovereignty?

andrew-ng sovereign-ai geopolitics deepseek taiwan chips

Karpathy Trained GPT-2 for Just $72 — OpenAI Spent $43,000 Seven Years Ago

MP-46 2026-02-08 · Andrej Karpathy (@karpathy)

Karpathy open-sourced nanochat — a minimal LLM training framework. With 8 H100 GPUs running for 3 hours at $72, you can train a GPT-2 level model. OpenAI spent $43,000 training the same model in 2019. That's a 600x cost reduction. On spot instances, it's just $20.

karpathy gpt-2 nanochat training-cost llm

The Father of Terraform Fights Back: AI Broke Open Source Trust, So Mitchell Hashimoto Built Vouch

MP-47 2026-02-08 · Mitchell Hashimoto (@mitchellh)

Mitchell Hashimoto (Terraform creator) says AI destroyed open source trust by enabling low-quality contributions. His solution: Vouch, a trust management system on Ghostty where trusted people vouch for others.

trust mitchell-hashimoto ghostty github ai-impact

AGENTS.md Can't Stop a Rogue AI: jzOcb's 4-Layer Defense System

GP-29 2026-02-05 · @xxx111god on X

After letting an AI agent manage a server and hitting 7 disasters in one day, the lesson: use code hooks instead of markdown rules, build a 4-layer defense system

devops ai-agents safety