AI Builders Brief — 2026-04-27

Follow builders, not influencers.

2026.04.27

25+ builders tracked

TL;DR

Altman called for an agent-first reset of OSes and the internet, while Rauch said coding agents were the base layer of superintelligence. Levie argued AI hid hard parts instead of killing jobs, and Anthropic shipped Auto mode for safer no-prompt Claude Code.

BUILDER INSIGHTS

Sam Altman

OSes and the internet need an agent-first reset

He says it’s time to seriously rethink operating systems and user interfaces — and even the internet itself — so they work equally well for people and agents. That’s the bigger signal here: OpenAI’s Sam Altman is pushing past chatbots and toward a world where software is built for humans and AI to use side by side.

#1 3.9k #2 2.7k #3 10.5k

Aaron Levie CEO, box

AI doesn’t kill jobs — it hides the hard parts

He says people massively overestimate how easy it is to automate a whole role after seeing AI handle one task. The real work is the messy last mile: data access, context, review, and plugging outputs into business processes — which is why Box’s CEO is skeptical of the loudest job-loss predictions.

#1 713 #2 263

Guillermo Rauch CEO, vercel

Coding agents are the base layer of superintelligence

He argues coding agents will become the foundation of superintelligence because programming is really just proficiency with computers — bash, filesystems, installs, configs, the whole stack. The bigger point: agents that can inspect and improve their own code could start self-optimizing, with humans keeping the audit trail. As Vercel’s CEO, he’s basically saying coding fluency is the shortest path to models that understand and reshape software.

427

Peter Steinberger OpenClaw

Local AI tooling, backed up like real infra

He’s shipping practical dev tools, not AI fluff: `wacrawl 0.2.0` adds encrypted Git backup/restore for WhatsApp Desktop archives, and `birdclaw` turns tweet archives into local storage with GitHub backups and daily bookmark imports. He also says OpenClaw’s test suite was CPU-bound until moving local runs to Blacksmith, where Codex can spin up 32 vCPU instances and blast through tests.

#1 #2 332 #3 727

Garry Tan CEO, ycombinator

Agents need a constitution, not a prompt

He says the secret to an articulate agent is splitting it into three docs: SOUL.md for voice and values, USER.md for a deep model of the person, and AGENTS.md for operational rules. His point is blunt: generic instructions get you generic chatbot sludge; specific, opinionated guidance makes the agent feel alive.

#1 #2 #3

Peter Yang

MCP turns a fitness app into a Claude-controlled toy

He built an MCP server for his mobile fitness app so Claude/Codex can pull workout stats and update routines directly. It’s a small demo, but it shows how quickly AI assistants become useful once they can actually touch your apps and data. He also called out Google Photos for missing an obvious Gemini feature: prompting a highlight reel from your family photos.

#1 #2 #3

BLOG UPDATES

Anthropic Engineering

Claude Code auto mode: a safer way to skip permissions

Anthropic adds Auto mode to Claude Code for safer no-prompt work

Lead: Anthropic introduced Auto mode for Claude Code, a new permission mode that replaces constant human approval prompts with model-based classifiers to let routine actions run while blocking dangerous ones.

Numbers:

Manual prompts are accepted 93% of the time, contributing to approval fatigue.
On 10,000 real internal tool calls, the full pipeline cut false positives to 0.4%.
On 52 real overeager actions, the deployed pipeline had a 17% false-negative rate.
On 1,000 synthetic exfiltration attempts, the full pipeline reached 5.7% FNR.
Stage 1 alone: 8.5% FPR and 6.6% FNR on real traffic.

So What: Auto mode is aimed at teams that want more autonomy than manual approvals but more protection than `--dangerously-skip-permissions`. Anthropic says it blocks risky behaviors like destructive deletes, credential hunting, exfiltration, and safety-check bypasses, while letting safe in-repo edits proceed without latency. The system combines a prompt-injection probe on inputs with a two-stage transcript classifier on outputs; as the post puts it, “Auto mode is meant for the first group” of users who want to skip prompts, “and for tasks where the second group’s approval overhead isn’t worth the marginal safety.” It’s not positioned as a replacement for careful review on high-stakes infrastructure, but as a safer default for everyday agentic coding.

Read original

PODCAST HIGHLIGHTS

AI & I by Every

The AI Model Built for What LLMs Can't Do

Deterministic AI beats token guessing for mission-critical work

The Takeaway: Eve argues that if correctness matters, language-model-style guessing is the wrong tool; you want models that can verify themselves as they reason.

EBMs are built to be inspectable and non-autoregressive, so they don’t “guess the next token” the way LLMs do.
For mission-critical systems, external checks aren’t enough: the model itself should expose structure you can verify in real time.
The big advantage is efficiency: fewer tokens, less compute, better fit for sparse data, spatial reasoning, and hardware/software correctness.

Eve, founder and CEO of Logical Intelligence, is pushing a blunt thesis: AI should stop pretending everything is a language problem. Her company builds both LLM prototypes and energy-based models, but the long game is EBMs—systems designed for “deterministic AI” and “verifiable AI” in places like code generation, chip design, and control systems. Her core complaint is that LLMs are black boxes that play a costly guessing game, even when you bolt on external verifiers like Lean4. That may be fine for drafting text; it’s shaky for a plane, a car, or a circuit.

Her analogy is simple and memorable: an LLM is like navigating with one turn at a time, while an EBM has the bird’s-eye view. “If you see there’s a hole, you’re gonna choose a different route.” EBMs, she says, build an energy landscape of possible states, then minimize it to find the most likely outcome. That makes them better for non-language tasks like spatial reasoning, where the world is better represented as structure than as tokens.

She also leans hard on latent variables as a kind of internal knowledge store—less a rulebook than a compact model of how the world works. The point isn’t just prediction; it’s understanding enough to adapt when the environment changes.

YouTube

STAY UPDATED

Daily builder insights, straight to your inbox.

Prefer RSS? Subscribe via RSS