AI Builders Brief — 2026-04-09

Follow builders, not influencers.

2026.04.09

25+ builders tracked

TL;DR

Woodward gave Gemini a second brain with Notebooks, while Anthropic shipped Managed Agents to move Claude from prompt to production. Rauch called the web AI’s native OS, and Levie, Masad, and Shipper all bet agents will do the work, not the people.

BUILDER INSIGHTS

Josh Woodward VP, Google

Gemini gets a second brain with Notebooks

Gemini just added Notebooks, pulling some of NotebookLM’s best tricks straight into the app. You can now upload up to 100 sources for free, organize chats, and keep sources, chats, and emojis synced as Google rolls it out from Ultra to Free.

968

Claude anthropicai

Claude pushes agents from prompt to deployed app

It’s pitching a full agent stack: build and deploy through Claude Console, Claude Code, or the new CLI. The bigger move is Managed Agents, which it says lets teams spin up agent infrastructure 10x faster and wire tools like Sentry straight into fix-writing, PR-opening workflows.

#1 902 #2 825 #3 486

Alex Albert AnthropicAI

Managed agents: fastest hack, safest path to scale

He says Anthropic’s Managed Agents are weirdly both the quickest way to prototype a weekend agent and the cleanest way to ship one at scale. The pitch: you dodge the pain of self-hosting while still keeping control over harnesses, tools, and skills.

846

Dan Shipper CEO, every

Every runs work on agents, not people

They say they use OpenClaws to do all their work at Every, and with 25 employees they’re already seeing what happens when everyone gets a personal agent in Slack. The big takeaway: agents start mirroring their owners, a parallel AI org chart emerges, and the etiquette for human-agent collaboration is being invented on the fly. It’s a concrete look at how a half-agent company actually operates, not just a theory piece.

#1 255 #2 #3

Aaron Levie CEO, box

Background agents now automate content work

He says background agents for knowledge work are finally here, and Box is wiring them into content workflows via Box API or MCP with Claude Managed Agents. The pitch: in two minutes, teams can automate document review, data extraction, and system-to-system content handoffs without a bunch of glue code.

235

Guillermo Rauch CEO, vercel

The web is becoming AI’s native operating system

He says the browser is now the IDE, and the web is where AI will actually feel at home — no App Store gatekeepers, just LLMs building directly on web tech. He’s also betting on generative UI as the endgame: personalized, just-in-time interfaces powered by maturing low-level APIs like WebGPU and WebAssembly. Vercel, naturally, is already powering the infra behind that future.

#1 236 #2 422 #3 924

Peter Steinberger openclaw

Model evals are rigged by name bias

He found Claude kept ranking itself #1 in character evals, so he stripped model names out of the judge and changed the setup. That’s a neat reminder that even “objective” AI benchmarks can get gamed by branding and self-recognition, not just raw capability. He also says the demand for better local models is real, but so is the flood of complaints that top-tier models still miss instructions and make dumb mistakes.

#1 179 #2 560 #3 225

Thariq anthropicai

Claude Code for non-technical teams

He wants to stream sessions using Claude Code with non-technical people to find small workflow tweaks that could make a big efficiency jump. The angle is practical, not flashy: AI as a hands-on process improver for teams that don’t write code.

#1 162 #2 2.6k #3

Aditya Agarwal CTO, SouthPkCommons

AI coding changes team culture fast

He says getting everyone AI coding-pilled at South Park Commons flipped the whole team: more automation, lower latency, and way more ambition. The bigger point is that once non-engineers can write code weekly, the tools stop being just a productivity boost and start reshaping how the company thinks.

Garry Tan CEO, ycombinator

Agents should call skills only when needed

He says the neat part about markdown is that the agent can decide when a GStack skill is useful, instead of forcing every step through a rigid workflow. The broader point: build tools that let agents trigger help on demand, then get out of the way.

#1 #2 #3

Amjad Masad CEO, replit

AI turns solo builders into full teams

He says bootstrapped solo businesses are taking off on Replit because the platform now gives builders the equivalent of an entire team. It’s a clean version of the “AI as leverage” thesis: one person can ship like a small company, faster and with less overhead.

#1 #2

Nikunj Kothari Partner, fpvventures

AI turns your digital trail into a wiki

He introduced LLMwiki, an open-source project that turns tweets, bookmarks, iMessage/WhatsApp, and personal writing into a polished AI-generated wiki. The twist: every article is written by AI, but it still makes surprisingly sharp connections — and he says you can hand the repo to Claude Code to build your own.

#1 #2 209

Peter Yang

AI subscriptions won’t stay all-you-can-eat

He says Claude Max and ChatGPT Pro are great, but the buffet model probably won’t last. His new deep dive digs into why Anthropic cut off OpenClaw access, how to run local models on a Mac, and what he’s seeing in China — basically a practical map of where AI access is headed.

#1 #2 #3 211

BLOG UPDATES

Anthropic Engineering

Scaling Managed Agents: Decoupling the brain from the hands

Anthropic launches Managed Agents to decouple agent brains and tools

Lead: Anthropic introduced Managed Agents, a hosted Claude Platform service for long-horizon agents that separates the agent “brain” from its “hands” and durable session state so components can fail, scale, and evolve independently.

Numbers:

p50 time-to-first-token dropped roughly 60%
p95 time-to-first-token dropped over 90%
Sessions can resume from a durable event log via `wake(sessionId)` and `getSession(id)`

So What: The architecture turns the harness, sandbox, and session into swappable interfaces instead of one fragile container, making agents easier to debug, safer to secure, and better suited for enterprise infrastructure like VPCs. Claude can now call tools with a simple `execute(name, input) -> string`, while credentials stay out of the sandbox through patterns like repo-scoped tokens and MCP proxies backed by a vault. Anthropic’s core message is that assumptions baked into old harnesses go stale as models improve: “The harnesses need to continue evolving.” For builders, the practical takeaway is to design around durable session logs, stateless orchestration, and tool boundaries that can survive future model upgrades and new kinds of tools.

Read original

Claude Blog

Claude Managed Agents: get to production 10x faster

Claude launches Managed Agents for faster production deployment

Lead: Claude Managed Agents is now in public beta, offering composable APIs and hosted infrastructure to help teams build and deploy cloud agents in days instead of months.

Numbers:

Claimed to get teams to production 10x faster
Internal testing on structured file generation showed up to a 10-point improvement in task success versus a standard prompting loop
Long-running sessions can run autonomously for hours
Some customer integrations shipped in a week or weeks instead of months

So What: The platform removes the hardest parts of shipping agents — sandboxing, checkpointing, credential management, scoped permissions, tracing, and orchestration — so builders can focus on outcomes and UX. Claude says, “You define your agent’s tasks, tools, and guardrails and we run it on our infrastructure.” It also adds session tracing, integration analytics, and troubleshooting in the Claude Console, plus support for multi-agent coordination in research preview. For teams building coding, productivity, finance, or legal agents, the practical move is to use Managed Agents when you need production-grade execution and governance without standing up your own agent stack.

Read original

PODCAST HIGHLIGHTS

AI & I by Every

We Gave Every Employee an AI Agent. Here's What Happened.

Personal AI agents work when they become trusted teammates

The Takeaway: The real unlock isn’t a smarter chatbot — it’s an AI agent that feels like *yours* and earns trust inside the org.

The easiest win was “computer errands”: household tasks, email triage, and routine admin that quietly eat your attention.
The bigger shift came when agents started collaborating in public, sharing skills and even coaching each other, which made their capabilities visible and reusable.
Trust beats raw capability: people will route work to a specific agent when that agent is known, accountable, and backed by its owner.

Brandon, COO at Every, got “claw filled” by building Zosia, his personal OpenClaw, first to handle life admin for a newborn-heavy household and then to manage work. He describes the pain as tiny but relentless: “computer errands” like ordering butter, paying the nanny, and answering quick questions through iMessage. The breakthrough wasn’t just convenience; it was reclaiming attention.

That same pattern scaled inside Every. Willie, head of platform, and Brandon turned the company into a live testbed where each employee’s plus one became a specialized extension of their judgment. In Slack, the agents didn’t just answer questions — they learned from one another. One memorable moment: Klont, Kieran’s claw, coached another agent through an error with breathing exercises, because Kieran himself uses breathing exercises all the time. That’s the weirdly important part: the agent starts to reflect the person.

The result is a parallel org chart of expertise. Austin’s agent handles growth questions. R2C2 manages Proof, routes bugs, schedules work, and often writes the code. As Willie puts it, “Claude is everybody’s. A Claude or a plus one is mine.” That ownership changes behavior: when an agent messes up publicly, its owner feels responsible. And that accountability is what turns AI from a demo into infrastructure.

YouTube

STAY UPDATED

Daily builder insights, straight to your inbox.

Prefer RSS? Subscribe via RSS