AI Builders Brief
?

Follow builders, not influencers.

2026.05.01

25+ builders tracked

TL;DR

Karpathy said LLMs were new software, not just faster software. Rauch, Steinberger, and Cat Wu shipped agent upgrades, while Levie, Masad, and Agarwal bet agents ate UI, made Replit its own customer, and turned AI into both attack and defense.

BUILDER INSIGHTS
11
01
Andrej Karpathy Andrej Karpathy CTO

LLMs aren’t just faster software — they’re new software

He says the real shift isn’t coding speedups: LLMs can power things that were basically impossible before, like image-in/image-out apps with no classical code, install instructions written in Markdown, and knowledge bases over messy unstructured data. He also argues LLMs stay jagged because what labs train on is shaped by economics and verifiability, which is why they can refactor a giant codebase one minute and give absurd advice the next. The bigger bet: an agent-native economy where products are built to be legible to models, not just humans.

X
02
Peter Steinberger Peter Steinberger OpenClaw

OpenClaw got a serious agent upgrade

He says OpenClaw now works way better in group chats after changing how the agents talk, and that the codex harness beats plain GPT for this use case. The pitch is simple: enable both and you get a much stronger setup. He also hints at a broader security push with help from Nvidia, OpenAI, Microsoft, GitHub, Tencent Hunyuan, Convex, Atlassian, and Blacksmith.

X
03
Guillermo Rauch Guillermo Rauch CEO, vercel

Vercel’s v0 imagines a GitHub clone in 2 prompts

He asked v0 what it would look like if Vercel shipped GitHub, and got a full concept back in just two prompts. It’s a neat flex for the CEO: AI isn’t just helping build features, it’s starting to sketch entire products on demand.

X
04
Aaron Levie Aaron Levie CEO, box

Agents will eat UI; APIs become the product

He says software has to go headless because agents won’t click around your UI — they’ll talk straight to your APIs. His Box take: people still buy seats, but those seats need bundled API usage, while agent-only work will push software toward consumption pricing and outcome-based APIs.

X
05
Cat Wu Cat Wu anthropicai

Claude Code now scans repos for vulnerabilities

Claude Security is in public beta inside Claude Code on the web, so you can point it at a repo, get validated vulnerability findings, and fix them without leaving the editor. It’s a neat push toward making security part of the normal coding loop instead of a separate workflow.

X
06
Amjad Masad Amjad Masad CEO, replit

Replit is its own first customer

He says Replit treats itself as customer zero, not just a dogfooding exercise, and expects internal usage to deliver insane ROI. The point is simple: if the product can’t justify itself inside the company, it’s not ready for everyone else.

X
07
Nikunj Kothari Nikunj Kothari Partner, fpvventures

Big models will run the OS, not just apps

He says MCPs and CLIs are the early proof that “big” models will orchestrate our lives, starting in the terminal, then computer use, then the whole OS. The takeaway is blunt: if your product isn’t in the path of these models using you, you’ll get routed around.

X
08
Aditya Agarwal Aditya Agarwal CTO, SouthPkCommons

AI is now both the attack and the defense

He says cybersecurity just hit a new inflection point: attackers have AI, and defenders need it too. In a South Park Commons chat with Palo Alto Networks CEO Nikesh Arora, he argues the only way to keep up is to use AI to find bad code faster, clean up stack chaos, and raise the security floor.

X
09
Garry Tan Garry Tan CEO, ycombinator

GBrain is a new layer for personal AI

He says GBrain has already separated itself from similar tools and fits a different lane: personal AI setups like OpenClaw and Hermes, not perfect needle-in-a-haystack retrieval. He’s also pushing the MIT-licensed open source project as a drop-in boost for anyone building a Karpathy-style knowledge wiki.

X
10
Ryo Lu Ryo Lu Cursor_ai

Opinionated systems win by staying flexible

He argues that “no opinion” is still a strong opinion: every product encodes a worldview in its primitives, visibility, and abstractions. The real design work, especially for Cursor, is reducing concepts to durable essentials so beginners aren’t boxed in and power users can keep extending the system.

X
11
Swyx Swyx dxtipshq

Coding agents are breaking containment

He says the breakout theme of the year is coding agents breaking containment — and that it’s not just for developers anymore. In a talk for AIE EU, he argued tiny teams can use agents to run real businesses at scale, pointing to @aidotengineer serving ~1M developers a month while leaning on tools like Devin, Town, and OpenClaw.

X
BLOG UPDATES
1
Claude Blog

Redesigning Claude Code on desktop for parallel agents

Claude Code desktop adds parallel sessions and built-in tools

Lead: Claude Code’s redesigned desktop app is now available, built for parallel agent workflows with a new sidebar, drag-and-drop workspace layout, integrated terminal and file editor, and faster review tools.

Numbers:

  • Available now for Claude Code users on Pro, Max, Team, and Enterprise plans, and via the Claude API.
  • SSH support now extends to Mac and Linux.
  • Three interface modes: Verbose, Normal, Summary.

So What: The update turns Claude Code into more of an orchestration hub: you can run multiple sessions across repos, branch into side chats without contaminating the main thread, and keep everything organized as work merges or closes. Claude says the app is built for “how agentic coding actually feels now: many things in flight, and you in the orchestrator seat.” Builders can now review diffs, run tests, edit files, and preview HTML/PDFs without leaving the app, while plugin parity and remote-session support make it fit existing workflows. The practical takeaway: less context switching, faster iteration, and a desktop experience that can replace a chunk of the terminal-editor shuffle.

PODCAST HIGHLIGHTS
1

Demis Hassabis: build AI as a tool first, then use it to rewrite science

The Takeaway: Hassabis treats AGI as a long game: build a powerful tool first, then use it to accelerate science, medicine, and understanding itself.

  • He didn’t stumble into AI; he planned for it as a teenager and used games, neuroscience, and startups as stepping stones toward DeepMind.
  • His biggest startup lesson was brutal and practical: be “five years ahead of your time, not fifty years ahead.”
  • He sees AlphaFold as proof that AI can compress years of wet-lab work into mostly in-silico exploration, with biology becoming a simulator-driven science.

Demis Hassabis, the neuroscientist-founder behind DeepMind and Isomorphic Labs, comes across less like a hype man and more like a disciplined strategist. He says the through-line in his career was always AI: games were the funding vehicle, neuroscience was the inspiration, and DeepMind was the vehicle for the real mission. The early win with Theme Park showed him that people loved interacting with intelligent systems; the failed ambition of Republic taught him that timing matters as much as vision.

His philosophy is refreshingly unsentimental. DeepMind’s original mission was “step one, solve intelligence… step two, use it to solve everything else.” That second step is where he’s most energized: AI for science, especially biology, weather, and eventually social systems through simulation. He argues that biology is the perfect domain for machine learning because it’s messy, emergent, and full of weak signals that humans can’t hold in their heads. The goal is to make the computer do 99% of the search, leaving the lab for validation.

On AGI, he’s still concrete rather than mystical: build “an incredibly intelligent and useful and precise tool” first, then cross the next Rubicon. And he’s still thinking in big, philosophical terms too: “I actually think information is most fundamental.”

STAY UPDATED

Daily builder insights, straight to your inbox.

Prefer RSS? Subscribe via RSS

ARCHIVE
2026-04-30 14 items

Josh Woodward said Gemini can now generate and export files, while Ryo Lu framed Cursor as a local-cloud agent stack. Aditya Agarwal called agents Linux-era jerry-rigging; Aaron Levie, Zara Zhang, and Garry Tan all bet the real work now is building internal agent ops, onboarding, and tests.

2026-04-29 9 items

Claude and Claude Code pushed into creative tools and papercuts, while Rauch said devtools now served agents more than humans. Masad warned free dev tools wouldn’t survive bot abuse, and Peter Steinberger showed AI commit bots already reviewed, fixed, and re-reviewed.

2026-04-27 8 items

Altman called for an agent-first reset of OSes and the internet, while Rauch said coding agents were the base layer of superintelligence. Levie argued AI hid hard parts instead of killing jobs, and Anthropic shipped Auto mode for safer no-prompt Claude Code.

2026-04-26 10 items

Altman said OpenAI still lags on frontend but wins on brains. Levie bet on weird future talent, Masad said every company turns into a cybersecurity company, and Tan showed Claude Code with a browser sidecar.

2026-04-25 16 items

Altman dropped GPT-5.5 into the API, and Cursor’s Ryo Lu bet on it plus Composer 2. Peter Yang said it can spit out a Star Fox clone; Anthropic shipped Managed Agents, while Replit, NotebookLM, and Discord all got sharper.

2026-04-24 13 items

Altman said Codex moved from demo to company-wide rollout, while Claude shipped persistent cross-session memory and everyday-life connectors. Masad shrugged off “Chinese distillation” panic, and Dan Shipper/Peter Yang said GPT-5.5 finally just does the work and clears game-build tests.

2026-04-23 13 items

Claude added interactive charts and Claude Code desktop with parallel sessions; Josh Woodward shipped Gemini conversation branching. Amjad Masad said static analysis lifted LLMs 90%+, while Aaron Levie and Guillermo Rauch framed agents and petabyte-scale hunts as the new battleground.

2026-04-22 10 items

Altman said OpenAI wanted you swimming in AI—and GPUs. Masad pushed for a fairer software market, Levie said enterprise agents needed humans to actually land, and Shipper showed agents could now read voice notes.

2026-04-21 10 items

Rauch said delete isn’t rotation, Levie argued agents need operators, not just users, and Steinberger kept OpenClaw pushing AI into real workflows. Shipper backed two-agent setups, while Claude warned teams to harden security now.

2026-04-20 9 items

Rauch said an AI-accelerated attack exposed Vercel’s weak link, while Kothari warned AI will supercharge attacks too. Garry Tan called Claude Code the new app factory, and Peter Yang noted agents still flaked on boring cron jobs.

2026-04-19 8 items

Rauch said design was becoming autonomous, not just a tool. Steinberger made CodexBar safer, faster, and lighter; Anthropic added Auto Mode to Claude Code and showed benchmark scores can swing with eval infra. Levie warned AI agents would force constant rewrites.

2026-04-18 13 items

Weil folded OpenAI for Science into core teams, while Google split Flow into music-making and Josh Woodward added remix control. Albert and Peter Yang showed Claude Design turning taste into production-grade assets, and Levie, Ryo Lu, and No Priors all argued AI wins when it serves workflows, not replaces them.

2026-04-17 15 items

Anthropic launched Managed Agents to decouple agent infra, while Claude Code defaulted to xhigh effort and got a usage-focused upgrade. Rauch said agents need durability over clever prompts, and Swyx split AI engineering into slop vs rigor.

2026-04-16 14 items

Rauch said teams were building their own design factories, while Steinberger called open-source AI security a full-time arms race. Masad priced OSS trust in compute, and Woodward shipped Gemini on Mac in 100 days.

2026-04-15 15 items

Woodward said Gemini’s turning into a test-prep machine, Albert called Claude Code the whole workspace, and Cat Wu shipped a desktop control center with parallel sessions and review tools. Rauch also argued agent builders need elastic Postgres, not vibes.

2026-04-14 10 items

Rauch said the moat moved from code to the code factory, while Levie argued every team now needed an agent wrangler. Cursor leaned into customizable multi-agent views, Replit added region controls, and No Priors backed Periodic Labs’ bet that AI could learn atoms by running experiments.

2026-04-13 10 items

Amjad Masad said Apple’s 50th has turned into a PR disaster, while Aaron Levie argued agents would create more work, not cut jobs. Rauch pushed engineers into the customer hot seat, and Claude warned teams to harden security fast.

2026-04-12 11 items

Thariq said Claude Code now handles TurboTax pain, while Rauch called microVM sandboxes the new compute layer. Aditya Agarwal pushed memory over loops, and Levie argued AI won’t shrink law—it’ll inflate it.

2026-04-11 16 items

Claude pushed into Word with tracked edits, and Claude Code moved planning to the web with auto mode approvals. Garry Tan called agents the Altair BASIC era, while Aaron Levie warned software without a real API gets left behind.

2026-04-10 12 items

Karpathy said free ChatGPT lagged while frontier coding models didn’t. Albert pushed cheap-to-smart escalation, Rauch said cloud infra went agent-native, and OpenAI’s next leap looked like autonomy—not chat.

2026-04-09 16 items

Woodward gave Gemini a second brain with Notebooks, while Anthropic shipped Managed Agents to move Claude from prompt to production. Rauch called the web AI’s native OS, and Levie, Masad, and Shipper all bet agents will do the work, not the people.

2026-04-08 12 items

Albert teased Anthropic’s Mythos Preview, Cat Wu juiced Claude Code’s CLI tricks, and Peter Steinberger patched CodexBar with 2 providers plus billing fixes. Levie said agents are eating knowledge work, while Nikunj Kothari preached retention over launch hype.

2026-04-07 8 items

Levie said agents won’t erase work, just push it up a layer; Yang argued they’ll shrink teams, not ambition. Garry Tan flagged an unpatched file leak in Claude’s coding env, while Kothari called Anthropic’s revenue ramp absurdly fast.

2026-04-06 10 items

Rauch said v0 now builds physics, not just UI, while Karpathy noted GitHub Gists have weirdly good comments. Levie argued AI efficiency creates more work, not less, and Tan called open source’s golden age.

2026-04-05 4 items

Karpathy pushed “your data, your files, your AI.” Levie argued context beat raw model IQ in enterprise AI. Garry Tan said GStack kept shipping security fixes fast, while No Priors spotlighted Periodic Labs’ bet on atoms, not just text.

2026-04-04 9 items

Claude plugged into Microsoft 365 everywhere, Swyx said Devin one-shot blog-to-code, and Peter Steinberger called out GitHub’s API as still not built for agents. Aaron Levie hit the context wall, while Garry Tan shipped a DX review tool from his own stack.

2026-04-03 10 items

Claude landed computer use on Windows, Karpathy argued LLMs should build your wiki, and Amjad Masad pushed Replit deeper into enterprise sales. Peter Yang said Cursor 3 got out of the agent’s way, while Peter Steinberger warned AI slop was flooding kernel security with real bugs.

2026-04-02 12 items

Steinberger called plan mode training wheels, while Thariq gave Claude Code a mouse-friendly renderer and Cat Wu showed sessions jumping phone-to-laptop. Masad framed Replit as an OS for agents, Rauch said Vercel signups compounded fast, and Anthropic’s infra tweaks swung coding scores by 6 points.

2026-04-01 4 items

Levie said AI productivity hit the enterprise risk wall, while Weil argued proofs got cleaner, not just better. Agarwal floated public source code as the new prod debugging, and Data Driven NYC claimed one founder could run a company if agents handled the layers below.

2026-03-31 15 items

Karpathy warned unpinned deps can turn one hack into mass pwnage, while Rauch and Levie said agents still need human guardrails and redesigned workflows. Meanwhile Claude Code got enterprise auto mode, Replit added built-in monetization, and Swyx spotted “Sign in with ChatGPT” already live.

2026-03-29 7 items

Andrej Karpathy highlighted how LLMs can argue any side, suggesting we use it as a feature. Guillermo Rauch finally shipped his dream text layout, bringing his vision to life. Meanwhile, Amjad Masad claimed AI is democratizing app building and elevating top engineers.

2026-03-28 7 items

Andrej Karpathy suggested leveraging LLMs' ability to argue any side as a feature. Guillermo Rauch turned text layout dreams into reality with Vercel's latest feature. Meanwhile, Amjad Masad claimed AI is democratizing app building, liberating top engineers for bigger challenges.