AI Builders Brief
?

Follow builders, not influencers.

2026.04.30

25+ builders tracked

TL;DR

Josh Woodward said Gemini can now generate and export files, while Ryo Lu framed Cursor as a local-cloud agent stack. Aditya Agarwal called agents Linux-era jerry-rigging; Aaron Levie, Zara Zhang, and Garry Tan all bet the real work now is building internal agent ops, onboarding, and tests.

BUILDER INSIGHTS
13
01
Josh Woodward Josh Woodward VP, Google

Gemini can now generate and export files

Gemini now turns prompts straight into Docs, Word, PDFs, Sheets, Excel, Slides, and more — no copy-paste cleanup needed. Google’s pushing it everywhere at once, which makes Gemini feel less like a chat app and more like a real document factory.

X
02
Aditya Agarwal Aditya Agarwal CTO, SouthPkCommons

Agents are still Linux-era jerry-rigging

He says today’s agents are developer toys, not consumer products: too brittle, too chatty, and way too dependent on browser/computer hacks. The real engine is iterative tool-calling and code generation, and the next big leap may be dynamic interfaces instead of yet another chat box. His take: the “always-on” agent that keeps state and adapts is the thing worth betting on.

X
03
Kevin Weil Kevin Weil VP, OpenAI

AI should free radiologists to treat, not read

He argues the real job of a radiologist isn’t staring at x-rays — it’s curing people. If AI can speed up image interpretation, doctors can spend more time actually helping patients and see more of them.

X
04
Aaron Levie Aaron Levie CEO, box

Internal agent engineers are the next big role

He says companies will need a new kind of internal FDE: technical people who wire secure, governed agents into systems like Box, Salesforce, and Workday. The real shift isn’t automating jobs, it’s automating whole business processes — and he thinks that becomes a major role at Box and everywhere else.

X
05
Ryo Lu Ryo Lu Cursor_ai

Cursor is turning agents into a local-cloud stack

Build your own agent systems with Cursor using the same multi-model harness across local and cloud. It’s a clear push to make agent workflows feel less like a demo and more like infrastructure you can actually wire into real software work.

X
06
Nan Yu Nan Yu head of product, linear

Design-to-engine handoff, rebuilt from scratch

They say Linear is reinventing design-to-engine handoff from first principles, which sounds like a shot at the whole Figma-to-code mess. As head of product, Nan Yu is framing it as a clean-slate workflow problem, not just another integration.

X
07
Peter Steinberger Peter Steinberger OpenClaw

Codex review now runs inside clawsweeper

He integrated Codex review into clawsweeper, using a similar system prompt so it behaves like /review. The setup also automerges and loops until it stops finding new issues — basically an agentic code review treadmill.

X
08
Amjad Masad Amjad Masad CEO, replit

AI makes everyone feel the pain of ops

He joked that now everyone gets to experience the classic founder rite of passage: getting paged at dinner because the site is down. It’s a funny little reminder from the Replit CEO that as more people build software, more people also inherit the joy of keeping it alive.

X
09
Garry Tan Garry Tan CEO, ycombinator

He’s hardening onboarding with full E2E tests

He finally built a full end-to-end harness for installing gbrain on openclaw, so onboarding can be tested instead of hoped for. He also notes graphs are increasingly inferred from frontmatter attributes, pushing more of the setup to happen automatically.

X
10
Zara Zhang Zara Zhang

Internal tools become HR for agents

She argues IT and internal tools teams should be treated less like ticket fixers and more like “HR for agents” — the layer that onboards, manages, and supports AI workers. It’s a clean framing for how org charts may need to change as agents become part of the workforce.

X
11
Peter Yang Peter Yang

Claude’s web UI is getting sluggish

He says Claude’s basic web UI is now randomly taking 3–5 seconds to respond, which is a rough look for a product people expect to feel instant. It’s a small complaint, but for AI tools, latency is the product — and even a few seconds can kill the experience.

X
12
Swyx Swyx dxtipshq

Base models are still underused playgrounds

He says people are too busy mourning the completions API to notice how much weird stuff you can still do with base models and finetunes. The real opportunity, in his view, is experimenting harder with non-obvious uses instead of waiting for the perfect product surface.

X
13
Dan Shipper Dan Shipper CEO, every

Agents are becoming the new shoppers

He says the agent economy is already showing up in payments: people still won’t let AI buy a couch, but they’ll happily delegate low-stakes purchases. Stripe’s view of 2% of global GDP also shows fraud shifting up the stack — from stolen cards to stolen free-trial tokens and compute credits — while AI companies are scaling faster than any SaaS cohort Stripe has tracked.

X
PODCAST HIGHLIGHTS
1

Stripe sees agents turning fraud and pricing into full-funnel problems

The Takeaway: The internet is shifting from human-first to agent-first, and that rewrites fraud, billing, and trust.

  • Fraud is no longer just stolen cards at checkout; in AI, attackers steal compute, credits, and unpaid usage across the whole customer lifecycle.
  • The old SaaS playbook breaks fast: free trials, virtual cards, and seat-based pricing all become leaky when every prompt and API call has real cost.
  • Stripe’s edge is breadth: it’s pushing fraud detection upstream into sign-up, and across payment methods, processors, and even agent commerce.

Emily, Stripe’s head of data and AI, frames the shift bluntly: “the Internet has this new kind of actor on it.” Her point isn’t that AI is making websites smarter; it’s that agents are becoming the dominant users of the web, and every layer of the stack has to adapt. That means payments, identity, fraud, billing, and developer tools all need to be rebuilt for software that acts on behalf of people—or directly with other software.

The most immediate pain is fraud. In AI businesses, the thing being stolen isn’t just money; it’s expensive inference. Emily says fraudsters are abusing free trials, spinning up multi-accounts, and racking up unpaid usage. One company saw only 4% of free trials convert, while each trial cost $25 in LLM spend—turning into “$625 per payer” before revenue even started. Another Stripe customer is blocking 250,000 fraudulent free trials a week.

Stripe’s response is to move Radar from checkout to sign-up and beyond, because fraud is now a “full funnel problem, not a transaction problem alone.” On pricing, Emily expects tokens for model providers, but outcomes for vertical AI products. Seat-based billing, she argues, starts to look silly when software is doing the work humans used to do.

STAY UPDATED

Daily builder insights, straight to your inbox.

Prefer RSS? Subscribe via RSS

ARCHIVE
2026-04-29 9 items

Claude and Claude Code pushed into creative tools and papercuts, while Rauch said devtools now served agents more than humans. Masad warned free dev tools wouldn’t survive bot abuse, and Peter Steinberger showed AI commit bots already reviewed, fixed, and re-reviewed.

2026-04-27 8 items

Altman called for an agent-first reset of OSes and the internet, while Rauch said coding agents were the base layer of superintelligence. Levie argued AI hid hard parts instead of killing jobs, and Anthropic shipped Auto mode for safer no-prompt Claude Code.

2026-04-26 10 items

Altman said OpenAI still lags on frontend but wins on brains. Levie bet on weird future talent, Masad said every company turns into a cybersecurity company, and Tan showed Claude Code with a browser sidecar.

2026-04-25 16 items

Altman dropped GPT-5.5 into the API, and Cursor’s Ryo Lu bet on it plus Composer 2. Peter Yang said it can spit out a Star Fox clone; Anthropic shipped Managed Agents, while Replit, NotebookLM, and Discord all got sharper.

2026-04-24 13 items

Altman said Codex moved from demo to company-wide rollout, while Claude shipped persistent cross-session memory and everyday-life connectors. Masad shrugged off “Chinese distillation” panic, and Dan Shipper/Peter Yang said GPT-5.5 finally just does the work and clears game-build tests.

2026-04-23 13 items

Claude added interactive charts and Claude Code desktop with parallel sessions; Josh Woodward shipped Gemini conversation branching. Amjad Masad said static analysis lifted LLMs 90%+, while Aaron Levie and Guillermo Rauch framed agents and petabyte-scale hunts as the new battleground.

2026-04-22 10 items

Altman said OpenAI wanted you swimming in AI—and GPUs. Masad pushed for a fairer software market, Levie said enterprise agents needed humans to actually land, and Shipper showed agents could now read voice notes.

2026-04-21 10 items

Rauch said delete isn’t rotation, Levie argued agents need operators, not just users, and Steinberger kept OpenClaw pushing AI into real workflows. Shipper backed two-agent setups, while Claude warned teams to harden security now.

2026-04-20 9 items

Rauch said an AI-accelerated attack exposed Vercel’s weak link, while Kothari warned AI will supercharge attacks too. Garry Tan called Claude Code the new app factory, and Peter Yang noted agents still flaked on boring cron jobs.

2026-04-19 8 items

Rauch said design was becoming autonomous, not just a tool. Steinberger made CodexBar safer, faster, and lighter; Anthropic added Auto Mode to Claude Code and showed benchmark scores can swing with eval infra. Levie warned AI agents would force constant rewrites.

2026-04-18 13 items

Weil folded OpenAI for Science into core teams, while Google split Flow into music-making and Josh Woodward added remix control. Albert and Peter Yang showed Claude Design turning taste into production-grade assets, and Levie, Ryo Lu, and No Priors all argued AI wins when it serves workflows, not replaces them.

2026-04-17 15 items

Anthropic launched Managed Agents to decouple agent infra, while Claude Code defaulted to xhigh effort and got a usage-focused upgrade. Rauch said agents need durability over clever prompts, and Swyx split AI engineering into slop vs rigor.

2026-04-16 14 items

Rauch said teams were building their own design factories, while Steinberger called open-source AI security a full-time arms race. Masad priced OSS trust in compute, and Woodward shipped Gemini on Mac in 100 days.

2026-04-15 15 items

Woodward said Gemini’s turning into a test-prep machine, Albert called Claude Code the whole workspace, and Cat Wu shipped a desktop control center with parallel sessions and review tools. Rauch also argued agent builders need elastic Postgres, not vibes.

2026-04-14 10 items

Rauch said the moat moved from code to the code factory, while Levie argued every team now needed an agent wrangler. Cursor leaned into customizable multi-agent views, Replit added region controls, and No Priors backed Periodic Labs’ bet that AI could learn atoms by running experiments.

2026-04-13 10 items

Amjad Masad said Apple’s 50th has turned into a PR disaster, while Aaron Levie argued agents would create more work, not cut jobs. Rauch pushed engineers into the customer hot seat, and Claude warned teams to harden security fast.

2026-04-12 11 items

Thariq said Claude Code now handles TurboTax pain, while Rauch called microVM sandboxes the new compute layer. Aditya Agarwal pushed memory over loops, and Levie argued AI won’t shrink law—it’ll inflate it.

2026-04-11 16 items

Claude pushed into Word with tracked edits, and Claude Code moved planning to the web with auto mode approvals. Garry Tan called agents the Altair BASIC era, while Aaron Levie warned software without a real API gets left behind.

2026-04-10 12 items

Karpathy said free ChatGPT lagged while frontier coding models didn’t. Albert pushed cheap-to-smart escalation, Rauch said cloud infra went agent-native, and OpenAI’s next leap looked like autonomy—not chat.

2026-04-09 16 items

Woodward gave Gemini a second brain with Notebooks, while Anthropic shipped Managed Agents to move Claude from prompt to production. Rauch called the web AI’s native OS, and Levie, Masad, and Shipper all bet agents will do the work, not the people.

2026-04-08 12 items

Albert teased Anthropic’s Mythos Preview, Cat Wu juiced Claude Code’s CLI tricks, and Peter Steinberger patched CodexBar with 2 providers plus billing fixes. Levie said agents are eating knowledge work, while Nikunj Kothari preached retention over launch hype.

2026-04-07 8 items

Levie said agents won’t erase work, just push it up a layer; Yang argued they’ll shrink teams, not ambition. Garry Tan flagged an unpatched file leak in Claude’s coding env, while Kothari called Anthropic’s revenue ramp absurdly fast.

2026-04-06 10 items

Rauch said v0 now builds physics, not just UI, while Karpathy noted GitHub Gists have weirdly good comments. Levie argued AI efficiency creates more work, not less, and Tan called open source’s golden age.

2026-04-05 4 items

Karpathy pushed “your data, your files, your AI.” Levie argued context beat raw model IQ in enterprise AI. Garry Tan said GStack kept shipping security fixes fast, while No Priors spotlighted Periodic Labs’ bet on atoms, not just text.

2026-04-04 9 items

Claude plugged into Microsoft 365 everywhere, Swyx said Devin one-shot blog-to-code, and Peter Steinberger called out GitHub’s API as still not built for agents. Aaron Levie hit the context wall, while Garry Tan shipped a DX review tool from his own stack.

2026-04-03 10 items

Claude landed computer use on Windows, Karpathy argued LLMs should build your wiki, and Amjad Masad pushed Replit deeper into enterprise sales. Peter Yang said Cursor 3 got out of the agent’s way, while Peter Steinberger warned AI slop was flooding kernel security with real bugs.

2026-04-02 12 items

Steinberger called plan mode training wheels, while Thariq gave Claude Code a mouse-friendly renderer and Cat Wu showed sessions jumping phone-to-laptop. Masad framed Replit as an OS for agents, Rauch said Vercel signups compounded fast, and Anthropic’s infra tweaks swung coding scores by 6 points.

2026-04-01 4 items

Levie said AI productivity hit the enterprise risk wall, while Weil argued proofs got cleaner, not just better. Agarwal floated public source code as the new prod debugging, and Data Driven NYC claimed one founder could run a company if agents handled the layers below.

2026-03-31 15 items

Karpathy warned unpinned deps can turn one hack into mass pwnage, while Rauch and Levie said agents still need human guardrails and redesigned workflows. Meanwhile Claude Code got enterprise auto mode, Replit added built-in monetization, and Swyx spotted “Sign in with ChatGPT” already live.

2026-03-29 7 items

Andrej Karpathy highlighted how LLMs can argue any side, suggesting we use it as a feature. Guillermo Rauch finally shipped his dream text layout, bringing his vision to life. Meanwhile, Amjad Masad claimed AI is democratizing app building and elevating top engineers.

2026-03-28 7 items

Andrej Karpathy suggested leveraging LLMs' ability to argue any side as a feature. Guillermo Rauch turned text layout dreams into reality with Vercel's latest feature. Meanwhile, Amjad Masad claimed AI is democratizing app building, liberating top engineers for bigger challenges.