AI Builders Brief
?
← BACK TO TODAY

Follow builders, not influencers.

2026.05.19

25+ builders tracked

TL;DR

Claude doubled token limits and added everyday-life connectors, while Anthropic’s Managed Agents got cross-session memory and a stronger runtime feel. Rauch made Vercel firewall protection free, and Levie said agents fail from messy context, not missing context.

BUILDER INSIGHTS
9
01
Claude Claude anthropicai

Claude doubles token limits for every plan

Claude says you can now create more with Claude Design, and it’s bumped token limits across every plan. That’s the kind of quiet product upgrade power users notice fast — more room for bigger prompts, longer docs, and fewer annoying cutoffs.

X
02
Thariq Thariq anthropicai

Long-running agents need an audit trail

He’s pushing a simple fix for agentic coding: keep a running `implementation-notes.html` so the model can log ambiguities, deviations, tradeoffs, and open questions as it works. The point is to let Claude make decisions when specs get fuzzy, without leaving humans out of the loop.

X
03
Nikunj Kothari Nikunj Kothari Partner, fpvventures

Investors are chasing clout, not founder service

He says too many VCs are optimizing for dopamine — posting well, but showing up poorly as board members. His take: if you want a real moat, do the unsexy work for founders and compound trust over time.

X
04
Guillermo Rauch Guillermo Rauch CEO, vercel

Vercel makes firewall protection free

Vercel is now eating the compute and network costs for firewall mitigations, including custom rules — not just DDoS and system-level defenses. He also called out the product’s ~300ms global propagation, a big deal if you’ve ever waited minutes for CDN/WAF changes to land.

X
05
Aaron Levie Aaron Levie CEO, box

Agents fail when context is messy, not missing

He says the real bottleneck in enterprise agents is constrained context: too much conflicting info and they hallucinate, too little and they’re useless. That makes most AI strategy problems really data strategy problems — which is why structured and unstructured data hygiene matters so much for big companies, and why startups can win by designing for it from day one.

X
06
Garry Tan Garry Tan CEO, ycombinator

GBrain is shipping fast, with evals to prove it

He says GBrain is moving fast day to day, and the latest push bundled 28 bug fixes from 22 community PRs and 14 issues. He also published the full eval report and fixtures as open source, inviting other memory systems to benchmark against them and get listed alongside.

X
07
Peter Yang Peter Yang

Claude’s harness matters as much as the model

Anthropic is treating the model and its harness as a coupled system, so the same Claude can behave differently across Claude, Cowork, and Claude Code. The bigger takeaway: they’re using Claude to cluster user pain, generate synthetic evals, and even “dream” over memories when idle — while some researchers are seriously thinking about consciousness as agents get more autonomous.

X
08
Dan Shipper Dan Shipper CEO, every

Codex guide incoming; most books are slop

He says Every is publishing a complete guide to Codex soon, which is the only concrete thing here. The rest is classic Dan: a blunt take that most books in this category are slop, plus a quick shoutout to an investor win.

X
09
Nan Yu Nan Yu head of product, linear

Everlane died with office wear, not the brand

He says Everlane’s real problem was the pandemic: the whole wear-to-work category vanished overnight, and the financing needed to survive made the endgame uglier. The bigger take is that brands can look dead for years, then come back hard — he points to Ray-Ban, J.Crew, Madewell, and Abercrombie as proof.

X
BLOG UPDATES
3
Anthropic Engineering

An update on recent Claude Code quality reports

Anthropic fixes three Claude Code regressions and resets limits

Lead: Anthropic says recent quality complaints about Claude Code came from three separate product changes—not a model or API degradation—and all have been fixed as of April 20 (v2.1.116).

Numbers:

  • 3 distinct issues affected Claude Code, the Claude Agent SDK, and Claude Cowork
  • March 4: default reasoning effort changed from high to medium, then reverted April 7
  • March 26: a caching bug was fixed April 10 in v2.1.101
  • April 16: a verbosity prompt change was reverted April 20
  • One eval showed a 3% drop after prompt ablations
  • Usage limits are being reset for all subscribers as of April 23

So What: Builders should treat this as a reminder that product-layer tweaks can look like model regressions, especially when they hit different user slices on different timelines. Anthropic says the API and inference layer were unaffected, but the combination of effort defaults, cache handling, and prompt edits made Claude Code feel “less intelligent,” forgetful, and more token-hungry. The company is tightening prompt-change controls, expanding per-model evals, improving internal code review, and adding soak periods plus gradual rollouts for any change that could trade off against intelligence. As Anthropic put it, “This isn’t the experience users should expect from Claude Code.”

Claude Blog

New connectors in Claude for everyday life

Claude adds everyday-life connectors for travel, shopping, and more

Lead: Claude is expanding its connector ecosystem beyond work tools to include everyday apps like AllTrails, Instacart, Audible, Tripadvisor, TurboTax, Uber, and more, so users can act on personal tasks directly inside chat.

Numbers:

  • Claude directory has grown to 200+ connectors since launching in July 2025.
  • New connectors now include AllTrails, Audible, Booking.com, Instacart, Intuit Credit Karma, Intuit TurboTax, Resy, Spotify, StubHub, Taskrabbit, Thumbtack, Tripadvisor, Uber, Uber Eats, and Viator.
  • Connectors are available on all plans; mobile is in beta.

So What: This turns Claude into a more useful action layer for daily life: it can suggest the right app in context, combine multiple services in one thread, and keep users in control before anything is booked or purchased. Anthropic says, “Claude suggests connectors and makes recommendations. But you stay in control of its actions.” The practical takeaway for builders is that connectors are now a distribution surface for consumer apps inside Claude—if your product would be useful in chat, you can submit it to the directory. Users can install connectors quickly, and once connected, they’re available in every conversation without leaving the thread.

Claude Blog

Built-in memory for Claude Managed Agents

Claude Managed Agents get built-in cross-session memory

Lead: Claude Managed Agents now include a public beta memory layer that lets agents learn from every session while keeping developer control through file-based, API-manageable storage.

Numbers:

  • Rakuten says memory cut first-pass errors by 97%.
  • Wisedocs reports verification is 30% faster.
  • Memory supports concurrent multi-agent access with scoped permissions and audit logs.

So What: This turns memory from a custom retrieval problem into a built-in production feature for long-running agents. Because memories are stored as files, teams can export, roll back, redact, and manage them programmatically, while sharing stores across agents with different access scopes. Claude also mounts memory directly onto the filesystem, so agents can use the same bash and code execution tools they already rely on. As Anthropic puts it, “Memory is built for enterprise deployments, with scoped permissions, audit logs, and full programmatic control.” For builders, the practical move is to use Managed Agents when you want agents to retain context across sessions without standing up your own memory infrastructure.

PODCAST HIGHLIGHTS
1

Anthropic’s platform is becoming the agent runtime, not just an API

The Takeaway: The real platform play is to hide harness and infrastructure so agents can ship fast and scale cleanly.

  • The old “generic API, hot-swap any model” mindset is fading; the next layer is tighter coupling between model and harness to squeeze out better outcomes.
  • The hardest part isn’t prompt tricks anymore — it’s production infrastructure: state, sandboxes, credentials, uptime, and long-running autonomy.
  • Anthropic wants managed agents to be modular enough for builders, but opinionated about the basics: file systems, skills, tools, and secure identity.

Angela, head of product for Claude’s platform at Anthropic, and Caitlin, head of engineering, describe a platform that’s evolving from completions to stateful, autonomous systems. Their view is blunt: customers don’t really want to rebuild the boring parts. They want outcomes. As Angela put it, the platform should be “the set of primitives and infrastructure that enables you to basically get the outcome as fast as possible, with actually as little of work as possible.”

That’s why Claude Managed Agents bundles the pieces most teams end up reinventing anyway: messages API, code execution, web search, memory, sandboxing, and now vaults for credentials. The surprising insight is that the bottleneck has shifted. People think harness engineering is the hard part, but in practice they hit an infrastructure wall when they try to productionize. A Mac mini and a Python loop can get you a demo; they won’t get you a reliable product.

The bigger bet is that model and platform will converge. Instead of a generic wrapper that swaps models underneath, Anthropic expects the harness and model to become increasingly paired — because path dependence matters, and the “small” choices about file systems or tool use end up shaping what the model becomes good at. The endgame: agents that are easy to deploy, maybe even one-click into Slack, with the platform doing the unglamorous work behind the scenes.

STAY UPDATED

Daily builder insights, straight to your inbox.

Prefer RSS? Subscribe via RSS

ARCHIVE