AI Builders Brief
?

Follow builders, not influencers.

2026.04.25

25+ builders tracked

TL;DR

Altman dropped GPT-5.5 into the API, and Cursor’s Ryo Lu bet on it plus Composer 2. Peter Yang said it can spit out a Star Fox clone; Anthropic shipped Managed Agents, while Replit, NotebookLM, and Discord all got sharper.

BUILDER INSIGHTS
14
01
Sam Altman Sam Altman

GPT-5.5 lands in the API

He said GPT-5.5 and GPT-5.5 Pro are now available in the API, which is the only tweet here with real substance. The rest is just a generic good-week pat on the back and a heart emoji, so the news is simple: OpenAI shipped a new model family for builders.

X
02
Ryo Lu Ryo Lu Cursor_ai

Cursor bets on GPT-5.5 + Composer 2

Cursor’s design lead says he’s fully switched to GPT-5.5 plus Composer 2, calling it the sweet spot for intelligence, speed, and cost. He also points users to /multitask, a way to break out of the queue and juggle more work at once.

X
03
Aaron Levie Aaron Levie CEO, box

AI productivity will drive more hiring, not less

He argues Jevons paradox applies to AI: the better it gets at boosting productivity, the more work companies will take on — and the more people they’ll hire around it. His examples are practical: small businesses finally building software, sales teams generating more leads, and marketing teams adding specialists once AI handles the heavy lifting.

X
04
Peter Steinberger Peter Steinberger OpenClaw

Discord DMs, now readable without login hacks

He shipped discrawl 0.6.0, and the big upgrade is simple: it can now read Discord DMs without custom login tricks that risk getting blocked. He also notes it’s read-only by design — no sending slop to humans.

X
05
Amjad Masad Amjad Masad CEO, replit

Replit wants to steal your app in one click

He says you can import Vercel or Lovable apps into Replit with a few clicks. That’s a classic platform move: make switching so easy that “lock-in” starts working in reverse.

X
06
Garry Tan Garry Tan CEO, ycombinator

He’s calling out tech money buying politics

He says a tech centimillionaire is trying to buy an election while backing candidates who want to “murder all capitalists” and “destroy the schools.” It’s a blunt anti-left, anti-dark-money shot from the YC president, aimed squarely at the current political circus.

X
07
Josh Woodward Josh Woodward VP, Google

NotebookLM gets smarter source organization

NotebookLM can now auto-label and categorize sources, which should make messy research piles a lot easier to wrangle. It’s a small feature, but the kind that saves real time once you’re juggling a bunch of docs.

X
08
Nikunj Kothari Nikunj Kothari Partner, fpvventures

Project codenames are forever—choose wisely

He says product teams keep pretending project codenames are temporary, but they stick around forever. It’s a small warning, but a real one: pick names like they’ll end up in docs, Slack, and postmortems for years.

X
09
Dan Shipper Dan Shipper CEO, every

Humans learn faster; AI knows more

He argues the real edge is split: AI can hold more knowledge than any one person, but humans still learn faster. That’s a neat framing for where AI tools actually win today — scale and recall, not self-directed adaptation.

X
10
Matt Turck Matt Turck FirstMarkCap

Europe doesn’t want to outsource intelligence

He says combining Cohere and Aleph Alpha is a telling moment: outside the US, there’s real resistance to handing “intelligence” over to a few American platforms. The bigger point is geopolitical — in a chaotic world, AI sovereignty is becoming a business and policy issue, not just a tech one.

X
11
Swyx Swyx dxtipshq

Hot skills today become tomorrow’s posttrain targets

He says the skills everyone is chasing now will just become the next post-training data targets — a neat way of saying the market keeps moving up a level. He also notes that serious AI engineering is finally coming from the C-suite, calling out Vercel as a good example of what more companies should do.

X
12
Peter Yang Peter Yang

GPT-5.5 can crank out a Star Fox clone

He showed GPT-5.5 and Codex building a Star Fox-style game in about 15 minutes of prompting, then called out the weirdly impressive part: the model playtests its own game for fun. It’s a neat demo of how far fast iteration with AI coding tools has come, especially for product folks watching what’s possible.

X
13
Nan Yu Nan Yu head of product, linear

Design without intent is just hallucination

He says the real core of design is understanding, not output: a design is an intention, and output without intention is basically hallucination. He also riffs on the pre-AI flood of “weather apps” and “dashboards” on Dribbble/Behance as proof that shiny output has always been easy to fake.

X
14
Aditya Agarwal Aditya Agarwal CTO, SouthPkCommons

Post-prompting is coming fast

He says we’re heading into a post-prompting world, where the old “type a prompt, get an answer” interface stops being the main way people use AI. That’s the kind of shift a former Facebook engineer and Dropbox CTO would flag early: the real product opportunity is moving from prompting to systems that do the work for you.

X
BLOG UPDATES
1
Anthropic Engineering

Scaling Managed Agents: Decoupling the brain from the hands

Anthropic launches Managed Agents with decoupled architecture

Lead: Anthropic introduced Managed Agents, a hosted Claude Platform service for long-horizon agents that separates the agent “brain” from its “hands” and session state so each can scale, fail, and evolve independently.

Numbers:

  • p50 time-to-first-token dropped ~60%
  • p95 time-to-first-token dropped over 90%
  • The architecture supports many brains and many hands without tying them to a single container

So What: For builders, the big shift is architectural: the harness no longer lives inside the sandbox, sessions are durable outside the runtime, and tools are accessed through stable interfaces like `execute(name, input) → string`, `wake(sessionId)`, and `getEvents()`. That means better recovery from failures, easier integration with VPCs and custom infrastructure, and stronger security because credentials never need to be exposed to sandboxed code. Anthropic’s core message is that assumptions about what Claude can’t do will go stale, so the platform is designed to outlast today’s implementation. As the post puts it, the goal is a system for “programs as yet unthought of.”

PODCAST HIGHLIGHTS
1

SAP bets AI wins by reengineering outcomes, not just chatbots

The Takeaway: SAP’s AI thesis is simple: enterprise value comes from reengineering workflows, data, and verification—not flashy demos.

  • The real moat isn’t model quality; it’s scale, context, and outcomes across messy enterprise systems.
  • AI in the company is moving from “software as a service” to “service as a software,” with agents embedded into work, not bolted on.
  • LLMs are great for unstructured work, but SAP still sees predictive/tabular models as essential for planning, forecasting, and finance.

Philipp Herzig, CTO of SAP, frames the company as the “operating system” of a business: finance, HR, supply chain, procurement, sales, and logistics all tied together for 400,000 customers. His point is that SAP has survived every tech cycle because customers don’t buy technology for its own sake—they buy outcomes. That’s why he thinks AI is a business-model transition, not just a technology transition.

The hard part, in his view, is not building a chatbot. It’s teaching AI to do the right thing at scale across thousands of documents, 20,000 APIs, and wildly different employee contexts. “The biggest challenge… is how do you teach the AI to do the right thing at scale,” he says. That’s why SAP is pushing generative UI, agentic workflows, and what he calls “agent mining”: capturing the tribal knowledge hidden in Slack, Teams, and human judgment, then turning it into a data flywheel.

He’s also blunt about limits. LLMs shine in text-heavy, unstructured tasks like consulting and support, but forecasting demand, cash flow, or payment delays still needs classical predictive models. In other words: the future isn’t one model to rule them all. It’s a stack where agents, structured data, evals, and predictive systems work together to make enterprises faster—and more reliable.

STAY UPDATED

Daily builder insights, straight to your inbox.

Prefer RSS? Subscribe via RSS

ARCHIVE
2026-04-24 13 items

Altman said Codex moved from demo to company-wide rollout, while Claude shipped persistent cross-session memory and everyday-life connectors. Masad shrugged off “Chinese distillation” panic, and Dan Shipper/Peter Yang said GPT-5.5 finally just does the work and clears game-build tests.

2026-04-23 13 items

Claude added interactive charts and Claude Code desktop with parallel sessions; Josh Woodward shipped Gemini conversation branching. Amjad Masad said static analysis lifted LLMs 90%+, while Aaron Levie and Guillermo Rauch framed agents and petabyte-scale hunts as the new battleground.

2026-04-22 10 items

Altman said OpenAI wanted you swimming in AI—and GPUs. Masad pushed for a fairer software market, Levie said enterprise agents needed humans to actually land, and Shipper showed agents could now read voice notes.

2026-04-21 10 items

Rauch said delete isn’t rotation, Levie argued agents need operators, not just users, and Steinberger kept OpenClaw pushing AI into real workflows. Shipper backed two-agent setups, while Claude warned teams to harden security now.

2026-04-20 9 items

Rauch said an AI-accelerated attack exposed Vercel’s weak link, while Kothari warned AI will supercharge attacks too. Garry Tan called Claude Code the new app factory, and Peter Yang noted agents still flaked on boring cron jobs.

2026-04-19 8 items

Rauch said design was becoming autonomous, not just a tool. Steinberger made CodexBar safer, faster, and lighter; Anthropic added Auto Mode to Claude Code and showed benchmark scores can swing with eval infra. Levie warned AI agents would force constant rewrites.

2026-04-18 13 items

Weil folded OpenAI for Science into core teams, while Google split Flow into music-making and Josh Woodward added remix control. Albert and Peter Yang showed Claude Design turning taste into production-grade assets, and Levie, Ryo Lu, and No Priors all argued AI wins when it serves workflows, not replaces them.

2026-04-17 15 items

Anthropic launched Managed Agents to decouple agent infra, while Claude Code defaulted to xhigh effort and got a usage-focused upgrade. Rauch said agents need durability over clever prompts, and Swyx split AI engineering into slop vs rigor.

2026-04-16 14 items

Rauch said teams were building their own design factories, while Steinberger called open-source AI security a full-time arms race. Masad priced OSS trust in compute, and Woodward shipped Gemini on Mac in 100 days.

2026-04-15 15 items

Woodward said Gemini’s turning into a test-prep machine, Albert called Claude Code the whole workspace, and Cat Wu shipped a desktop control center with parallel sessions and review tools. Rauch also argued agent builders need elastic Postgres, not vibes.

2026-04-14 10 items

Rauch said the moat moved from code to the code factory, while Levie argued every team now needed an agent wrangler. Cursor leaned into customizable multi-agent views, Replit added region controls, and No Priors backed Periodic Labs’ bet that AI could learn atoms by running experiments.

2026-04-13 10 items

Amjad Masad said Apple’s 50th has turned into a PR disaster, while Aaron Levie argued agents would create more work, not cut jobs. Rauch pushed engineers into the customer hot seat, and Claude warned teams to harden security fast.

2026-04-12 11 items

Thariq said Claude Code now handles TurboTax pain, while Rauch called microVM sandboxes the new compute layer. Aditya Agarwal pushed memory over loops, and Levie argued AI won’t shrink law—it’ll inflate it.

2026-04-11 16 items

Claude pushed into Word with tracked edits, and Claude Code moved planning to the web with auto mode approvals. Garry Tan called agents the Altair BASIC era, while Aaron Levie warned software without a real API gets left behind.

2026-04-10 12 items

Karpathy said free ChatGPT lagged while frontier coding models didn’t. Albert pushed cheap-to-smart escalation, Rauch said cloud infra went agent-native, and OpenAI’s next leap looked like autonomy—not chat.

2026-04-09 16 items

Woodward gave Gemini a second brain with Notebooks, while Anthropic shipped Managed Agents to move Claude from prompt to production. Rauch called the web AI’s native OS, and Levie, Masad, and Shipper all bet agents will do the work, not the people.

2026-04-08 12 items

Albert teased Anthropic’s Mythos Preview, Cat Wu juiced Claude Code’s CLI tricks, and Peter Steinberger patched CodexBar with 2 providers plus billing fixes. Levie said agents are eating knowledge work, while Nikunj Kothari preached retention over launch hype.

2026-04-07 8 items

Levie said agents won’t erase work, just push it up a layer; Yang argued they’ll shrink teams, not ambition. Garry Tan flagged an unpatched file leak in Claude’s coding env, while Kothari called Anthropic’s revenue ramp absurdly fast.

2026-04-06 10 items

Rauch said v0 now builds physics, not just UI, while Karpathy noted GitHub Gists have weirdly good comments. Levie argued AI efficiency creates more work, not less, and Tan called open source’s golden age.

2026-04-05 4 items

Karpathy pushed “your data, your files, your AI.” Levie argued context beat raw model IQ in enterprise AI. Garry Tan said GStack kept shipping security fixes fast, while No Priors spotlighted Periodic Labs’ bet on atoms, not just text.

2026-04-04 9 items

Claude plugged into Microsoft 365 everywhere, Swyx said Devin one-shot blog-to-code, and Peter Steinberger called out GitHub’s API as still not built for agents. Aaron Levie hit the context wall, while Garry Tan shipped a DX review tool from his own stack.

2026-04-03 10 items

Claude landed computer use on Windows, Karpathy argued LLMs should build your wiki, and Amjad Masad pushed Replit deeper into enterprise sales. Peter Yang said Cursor 3 got out of the agent’s way, while Peter Steinberger warned AI slop was flooding kernel security with real bugs.

2026-04-02 12 items

Steinberger called plan mode training wheels, while Thariq gave Claude Code a mouse-friendly renderer and Cat Wu showed sessions jumping phone-to-laptop. Masad framed Replit as an OS for agents, Rauch said Vercel signups compounded fast, and Anthropic’s infra tweaks swung coding scores by 6 points.

2026-04-01 4 items

Levie said AI productivity hit the enterprise risk wall, while Weil argued proofs got cleaner, not just better. Agarwal floated public source code as the new prod debugging, and Data Driven NYC claimed one founder could run a company if agents handled the layers below.

2026-03-31 15 items

Karpathy warned unpinned deps can turn one hack into mass pwnage, while Rauch and Levie said agents still need human guardrails and redesigned workflows. Meanwhile Claude Code got enterprise auto mode, Replit added built-in monetization, and Swyx spotted “Sign in with ChatGPT” already live.

2026-03-29 7 items

Andrej Karpathy highlighted how LLMs can argue any side, suggesting we use it as a feature. Guillermo Rauch finally shipped his dream text layout, bringing his vision to life. Meanwhile, Amjad Masad claimed AI is democratizing app building and elevating top engineers.

2026-03-28 7 items

Andrej Karpathy suggested leveraging LLMs' ability to argue any side as a feature. Guillermo Rauch turned text layout dreams into reality with Vercel's latest feature. Meanwhile, Amjad Masad claimed AI is democratizing app building, liberating top engineers for bigger challenges.