AI Builders Brief — 2026-05-11

Follow builders, not influencers.

2026.05.11

25+ builders tracked

TL;DR

Levie argued AI agents needed real engineering, not side projects, while Anthropic shipped Managed Agents for long-horizon work. Dan Shipper said Codex turned a weekend hack into a 5-minute app, and No Priors called inference the real moat.

BUILDER INSIGHTS

Aaron Levie CEO, box

AI agents need real engineering, not side projects

He says once agents move beyond coding, they become mission-critical systems: they need the right context, secure integrations, human-in-the-loop design, and ongoing maintenance. That’s why Box is hiring for AI automation engineering roles — basically forward-deployed engineers for internal workflows — and he expects most companies to add similar jobs.

298

Garry Tan CEO, ycombinator

Design is pain transmuted into product

He says the highest form of design is turning human pain and suffering into something useful. It’s a very founder-ish take from the Y Combinator president: good design isn’t decoration, it’s compression of misery into clarity.

#1 334 #2 #3

Peter Steinberger OpenClaw

AI tooling for devs gets weirdly personal

He’s turning his own workflow into an AI playground: OpenClaw now has a browser, a `/side` helper for parallel questions, and Codex wired into e2e testing. He also fed Birdclaw his full Twitter archive so he can query old tweets and bookmarks like a personal memory layer.

#1 160 #2 172 #3

Dan Shipper CEO, every

Codex turns a weekend hack into a 5-minute app

He says a "codex-native" project let him plug in a MIDI keyboard, ask Codex for a watcher script and web app, and get a working chord trainer in about five minutes. The bigger point: AI coding is collapsing the gap between idea and usable tool, fast enough to make weekend projects feel trivial.

#1 158 #2 344 #3 184

Thariq anthropicai

HTML is becoming a planning and review tool

They've been using HTML for planning, specs, exploration, code review, and reports — basically turning the browser into a lightweight workspace. The sharper take: if you're not using HTML as a medium for thinking, you're probably leaving a lot on the table.

#1 #2 1.7k

Ryo Lu Cursor_ai

ryOS now bridges into a retro PC

They shipped an IRC bridge from ryOS to @levelsio’s retro PC, basically wiring two old-school worlds together. It’s a small but very on-brand Cursor design move: make software feel playful, connected, and a little weird in the best way.

#1 #2 149

BLOG UPDATES

Anthropic Engineering

Scaling Managed Agents: Decoupling the brain from the hands

Anthropic launches Managed Agents for scalable long-horizon work

Lead: Anthropic introduced Managed Agents, a hosted Claude Platform service that decouples the agent “brain” from its “hands” and session state so long-running workflows can scale, recover, and evolve without rewriting the whole harness.

Numbers:

p50 time-to-first-token dropped roughly 60%.
p95 time-to-first-token dropped over 90%.
The architecture supports many brains and many hands, with containers provisioned only when needed.

So What: The key shift is architectural: the harness no longer lives inside the container, sessions are durable outside the runtime, and tools are accessed through narrow interfaces like `execute(name, input) -> string`, `wake(sessionId)`, and `getEvents()`. That means failures are isolated, credentials stay out of the sandbox, and Claude can connect to VPCs, Git repos, MCP tools, and other infrastructure without baking assumptions into the runtime. Anthropic frames this as a “meta-harness” built for “programs as yet unthought of,” and says the goal is to let future harnesses, sandboxes, and tools swap underneath Claude without breaking the system. For builders, the takeaway is to treat state, compute, and tools as separable interfaces—not one brittle container.

Read original

PODCAST HIGHLIGHTS

No Priors

Baseten CEO Tuhin Srivastava on the AI Inference Crunch, Custom Models, and Building the Inference Cloud

Inference is becoming the real AI moat, not just model access

The Takeaway: The winners in AI won’t just own models — they’ll own the workflow, the data, and the compute behind inference.

Custom models are already the default at the frontier: Baseten says 90–95% of tokens on its platform are tied to modified models, not vanilla open-source weights.
The real moat isn’t “using AI,” it’s capturing unique user signal inside workflows — the kind labs can’t easily copy, like clinician edits or support resolution loops.
Compute scarcity is so real that capacity planning has become a daily operating problem, with contracts stretching to 3–5 years and high prepay just to secure GPUs.

Tuhin Srivastava, CEO of Baseten, is building in the middle of the AI inference crunch, and his worldview is blunt: the market is shifting from generic model access to specialized systems that learn from proprietary behavior. His bet is that application companies will endure because they own the signal that matters. As he puts it, the value sits in “the user signal that they can gather that only they can gather.”

That’s why he thinks companies like Abridge or support platforms can build durable advantages: the model is only part of the product; the workflow is where the compounding happens. Baseten’s own business reflects that shift. Most of its demand now comes from customers tuning models for quality, latency, or cost, and Srivastava says the company’s infrastructure and post-training teams are increasingly intertwined.

The other big lesson is that inference is no longer a software-only game. Baseten runs 90 clusters across 18 clouds and still operates at “mid nineties utilization” most of the time. Supply is tight, suppliers are uneven, and the best players will combine software, capital, and access to compute. In his view, the moat is simple: “access to inference computers is a strategic advantage.”

YouTube

STAY UPDATED

Daily builder insights, straight to your inbox.

Prefer RSS? Subscribe via RSS