How much runway did Claude-dependent wrappers actually lose this week?

Roughly 20–40% of effective runway, because subscription-tier arbitrage that delivered 70–90% gross margins on inference is gone. A wrapper paying $200/month for tokens that now cost $700–2,000 at API rates has seen unit economics flip toward negative on those workloads, and most founders haven't yet flagged the change to their boards.

Which coding-agent wrappers actually survive the Anthropic repricing?

Three archetypes: open-source distribution plays like Cline that own the anti-lock-in segment, bounded-execution and enterprise-security tools like Cursor's cloud agents that compete on trust rather than tokens, and anything accumulating proprietary workflow data the model layer can't replicate externally. Everything else is a thin subscription-arbitrage wrapper waiting to be marked down.

Why apply a reversibility discount to LLM-layer ARR?

Because enterprise AI spend lacks the structural lock-in of traditional SaaS — no SLAs, no per-user telemetry, and low switching costs mean each model release is effectively a procurement reset. ServiceNow exhausting its full-year Anthropic budget by May, with its CIO publicly calling Claude consumer-grade, is the proof point. A 20–40% discount against comparable SaaS multiples is defensible before the next LP meeting.

What's the investable window for agent infrastructure pure-plays?

Roughly 12–18 months before incumbents like SAP, ServiceNow, and Salesforce either acquire the category or ship competing fabric. The wedges that fit inside that window are vertical orchestration with institutional context, agent identity and governance, MCP tooling and knowledge-graph infrastructure, and agent-specific observability distinct from model-layer FinOps.

What does LiteLLM hitting CISA's KEV catalog mean for AI security deal flow?

It establishes AI infrastructure as a federally recognized attack surface, which creates the first defensible enterprise budget line for AI security as a standalone category rather than an AppSec feature. Combined with NSA-routed access to Anthropic's Mythos and OpenAI's Daybreak partner list, it signals pulling AI-gateway firewalls, model-artifact scanners, and LLM-runtime sandboxes forward by a quarter at Series A entry.

Edition 2026-05-25 · read as Investor

AnthropicCreditSwapVaporizesCodingWrapperMargins

Sources: 36
Words: 1,927
Read: 10min

Topics AI Capital Agentic AI LLM Inference

◆ The signal

Anthropic converted every Claude subscription into dollar-matched API credits on Monday, which is a polite way of ending the seventy to ninety percent margin arbitrage that dozens of coding-agent wrappers were quietly running. OpenAI answered within hours with two months of free Codex for enterprise switchers. Ramp also has Anthropic passing OpenAI in business adoption for the first time, 34.4 to 32.3. Every Claude-dependent name in the book lost twenty to forty percent of effective runway this week. Most of them have not noticed yet.

◆ INTELLIGENCE MAP

01
Coding-Agent Pricing War Erupts — Wrapper Economics Collapse
act now
Anthropic killed subscription arbitrage (70-90% margin) by converting plans to API credits. OpenAI countered with 2 months free Codex. Ramp confirms Anthropic at 34.4% vs OpenAI 32.3% in business spend. Third-party harness providers face two-sided squeeze as Notion ships agent hosting.
34.4%
Anthropic enterprise share
12
sources
- Anthropic biz share
- OpenAI biz share
- Wrapper margin lost
- Anthropic IPO target
1. Anthropic34.4
2. OpenAI32.3
02
Enterprise AI Revenue Quality Crisis — The SLA Gap
act now
ServiceNow exhausted its full-year Claude budget by May with zero SLAs and no granular usage telemetry. Google, OpenAI/Bain, and Salesforce are all hiring hundreds of FDEs — conceding that deployment, not model capability, is the bottleneck. AI observability/FinOps is forming as a standalone category.
May
annual budget exhausted
6
sources
- Budget exhaustion
- Modal valuation
- AI-ready enterprises
- FDE demand signal
1. Enterprises AI-ready for agentic deployment15
03
Agent Infrastructure Layer: 12-Month Window Before Incumbents Close It
monitor
Vercel production data shows 59% of token volume is agentic. a16z published GTM thesis with Stitch check — system of intelligence over system of record. SAP committed €100M. ServiceNow shipped headless Action Fabric. Incumbents moving first this cycle; pure-play window narrowing to 12-18 months.
59%
agentic token volume
8
sources
- Agentic workload share
- Anthropic spend share
- Google volume share
- SAP fund size
1. Agentic workloads59
2. Chat/completion41
04
AI-Native Security Crosses from Thesis to Budget Line
monitor
LiteLLM added to CISA KEV — first AI-infra software federally flagged as actively exploited. Microsoft MDASH shipped 16 validated CVEs in one Patch Tuesday. Anthropic Mythos cleared both AISI ranges, with NSA (not CISA) winning access. OpenAI launched Daybreak with 8 incumbent 'partners.'
16
MDASH CVEs shipped
7
sources
- MDASH CVEs/cycle
- LiteLLM exploit
- PraisonAI weaponized
- Deepfake losses 2027
1. 01Microsoft MDASH (autonomous)16 CVEs/cycle
2. 02Mozilla/Mythos (custom harness)271 bugs
3. 03Mythos (generic scan)1 CVE
4. 04DepthFirst vs Mythos cost10x cheaper
05
AI IPO Window Confirmed — First-Day Pops Set Permanent Comps
background
Cerebras closed day 1 at $311 (70% pop, $41.7B market cap). Eclipse netted 17x. Fervo Energy popped 33% to $10B+. Benchmark raised a $225M SPV, breaking its early-stage-only model. The comp set for every pre-IPO AI infra name permanently moved up this week.
70%
Cerebras first-day pop
6
sources
- Cerebras close
- Eclipse return
- Fervo pop
- Benchmark SPV
1. Cerebras70
2. Fervo Energy33
3. Nebius (Q1 rev growth)684

◆ DEEP DIVES

01
The Wrapper Margin Collapse: Anthropic's IPO-Ready Pricing Move Kills a Business Model
What Happened
On May 12–13, Anthropic and OpenAI repriced the coding-agent market on the same afternoon. Anthropic converted every Claude subscription into a dollar-matched API credit pool, meaning a $200 plan now buys exactly $200 of programmatic tokens and nothing more. That kills the 70–90% arbitrage the third-party harnesses (Cline, OpenCode, OpenClaw) had been running by funneling production workloads through subscription-tier access. OpenAI answered within hours with two months of free Codex for enterprise switchers inside a 30-day enrollment window.
Read against Ramp's April data showing Anthropic at 34.4% of business spend vs OpenAI at 32.3%, which is the first documented lead change, plus a fresh CFO hire and a likely October IPO target, the read is narrow: margin recovery dressed as developer generosity, timed to pre-IPO diligence.
What this does to wrapper margins
Any Claude-dependent portfolio company whose COGS quietly assumed subscription-rate token access just absorbed a structural margin hit. The arithmetic is unkind. A wrapper paying around $200/month for inference that would cost $700–2,000 at API rates has watched gross margin on that workload move from above 70% to potentially negative inside a single weekend. The change is four days old, and most founders have not flagged it to their boards.
Call it margin recovery, not a pricing tweak. The model layer is reclaiming the surplus that funded an entire category of startups.
Notion's External Agents API, which now hosts Claude, Codex, Cursor, Decagon, Warp, and Devin inside the same workspace, adds a second compression vector. The interface layer is being metered from above by the model vendors and aggregated from below by the workspace platforms, which leaves whoever is sitting in the middle paying retail and selling wholesale.
The Duopoly Subsidy Fight
Multiple sources describe the same dynamic. Anthropic is spending pre-IPO capital to lock in developers while OpenAI rents enterprise switching with free quarters, and the layer between them pays for both. Sources disagree on whether this is permanent or tactical, and there are at least three readings worth holding at once: the October IPO is a natural sunset for Anthropic's subsidy, OpenAI's counter expires the moment the Ramp share gap widens, and a third where neither side blinks because the developer surface is now considered strategic enough to bleed for indefinitely. This is probably wrong, but the third reading is the one I would underwrite.
The training-efficiency stack matters here too. Nous Research's Token Superposition Training shows 2–3x wall-clock speedup validated to 10B MoE, and Datology claims +11.7 points across 20 benchmarks at 17x less compute. If either replicates at scale, the capex moat sitting underneath both labs starts to compress, and the subsidy fight gets cheaper to sustain rather than shorter.
Surviving Moats
The wrappers that survive this fall into a narrow set:
1. Open-source distribution, where Cline's rebuilt SDK with subagents and scheduled jobs owns the slice of the community that refuses vendor lock-in
2. Bounded execution / enterprise security, where Cursor's cloud agents with full dev environments, rollback, and isolated secrets compete on trust rather than token economics
3. Proprietary workflow data, meaning any tool accumulating institutional context the model layer cannot replicate from the outside
Everything else is a thin wrapper running on subscription arbitrage. That is a markdown waiting to be written down.
Action items
- Call every Claude-dependent portfolio company CEO this week and request updated gross margin models assuming API-rate billing
- Kill or restructure any active term sheet for a coding-agent wrapper without one of the three surviving moats (OSS distribution, enterprise security, proprietary data)
- Accelerate Anthropic pre-IPO secondary sizing decisions before book-building begins in August
- Request Vercel AI Gateway production data as a recurring diligence benchmark for every model-layer pitch
Sources:Anthropic squeezing its pre-IPO round · Anthropic is now at thirty billion dollars of annualized revenue · Anthropic at $900B > OpenAI's $852B · Vercel's first production AI Gateway index · Anthropic has apparently overtaken OpenAI in enterprise
02
The Enterprise AI SLA Gap: $30B of Revenue Built on Consumer-Grade Plumbing
The ServiceNow Tell
ServiceNow — one of the more sophisticated enterprise software buyers anyone has ever underwritten — exhausted its full-year Anthropic budget by May 2026. Not because Claude was bad. Because Anthropic ships no granular per-user telemetry, no usage dashboards, and no SLAs worth the paper. National Life Group's CIO put it about as plainly as a CIO will: Anthropic is 'great for consumer usage but not great for companies.'
This is the asset the secondary market is being asked to mark at $900B+ on an enterprise revenue thesis. Consumer-grade plumbing does not, historically, clear enterprise-grade multiples.
The Observability Category Is Forming in Real Time
ServiceNow's internal response is the part to read twice: they built AI Control Tower to monitor what the vendor would not. That is a CDIO who watched a category form inside her own P&L and decided to be the vendor instead of the line item. Around her:
- Modal at $4.5B on inference orchestration revenue that is, by all accounts, surging
- Only 15% of enterprises have data foundations ready for agentic AI at scale, per the Fivetran index, despite spending in the millions
- Nearly half cite data quality and lineage as the actual blocker
- Four firms — Google, OpenAI/Bain, Salesforce, ServiceNow — independently concluded the margin lives in deployment, not the model
When four firms independently copy the Palantir FDE playbook in the same quarter, the margin is in deployment services. Period.
The Revenue Quality Discount
Several lines of evidence converge on a structural fact: enterprise AI spend is reversible in a way traditional SaaS never quite was. No SLAs means no contractual lock-in. No telemetry means budget blowouts happen quietly. Low switching costs mean the next model release becomes a procurement reset. Reasonable people disagree on severity — one camp says the Ramp lead is durable because enterprise buyers are sticky by nature, the other points at OpenAI's 0.3% YoY growth and concludes stickiness is not, in fact, doing the work here.
The synthesis, or rather the version of it I am willing to defend: LLM-layer ARR deserves a 20–40% reversibility discount against traditional enterprise SaaS. Apply it before the LP meeting, not during.
The FDE Land Grab
Google Cloud is hiring hundreds of forward-deployed engineers. OpenAI stood up DeployCo with Bain Capital and is openly recruiting Palantir alumni. Salesforce and ServiceNow are staffing the same function under different titles. The industry has conceded what Palantir figured out roughly twenty years ago: deployment is the bottleneck, and the bottleneck is where the margin sits. The software that productizes 60% of FDE work — context ingestion, custom eval harnesses, workflow templates — captures margin the labs cannot defend.
Action items
- Demand SLA and usage-telemetry roadmap from every model-layer company in the portfolio pitching enterprise ARR — flag those without as higher churn risk
- Launch a sourcing sprint on AI observability/FinOps at Seed-to-Series A: token-level cost attribution, per-user spend caps, SLA monitoring across model APIs
- Apply a 20-40% 'reversibility discount' to LLM-layer ARR multiples in every active deal model where SLAs and telemetry are absent
- Build a Palantir-alumni network map for FDE-layer investment targets and advisor sourcing
Sources:Anthropic has an enterprise gap · AI compute is sold out 4:1 · Anthropic's 80x growth broke its infra · DuckDB goes client-server, 85% agentic-AI readiness gap · Anthropic at $900B + Nebius +684%
03
Agent Infrastructure: The Race Between a16z's Thesis and SAP's Checkbook
The Category Is Being Named by Incumbents
Three things showed up in the same week, which is either coincidence or the kind of coincidence that isn't. a16z published its GTM software thesis arguing the system of intelligence captures the next decade's value, backed by the Stitch investment. SAP committed €100M to an Autonomous Enterprise fund wired to NVIDIA and Microsoft. ServiceNow shipped Action Fabric, a headless architecture built to host third-party AI agents via MCP servers. The autonomous enterprise category is being drawn by the incumbents before the pure-plays get to name it.
This is the opposite shape from cloud, where AWS defined infrastructure and the incumbents caught up roughly a decade late. This time the ERPs and workflow platforms moved first.
The Production Data
Vercel's AI Gateway index — the first production-grade measurement across more than 200,000 teams — delivers the numbers worth arguing about:
- Fifty-nine percent of token volume is agentic, which makes it the majority case rather than the niche
- Anthropic takes sixty-one percent of spend, on Opus, as premium reasoning
- Google takes thirty-eight percent of volume, on Flash, as commodity throughput
Two different businesses are now visible inside the phrase 'foundation models,' or rather, the more interesting version: one priced like a consultant, one priced like a utility. The spend-versus-volume split validates multi-model routing as the enterprise default and closes the argument that a single lab runs the table.
The Lemkin Proof Point
Jason Lemkin's SaaStr anecdote, which a16z is now using as the unit-economics template, runs as follows: Salesforce cut from more than ten human seats to two humans plus one API seat, while spend rose eighty-three percent, from twelve thousand to twenty-two thousand dollars, with twenty-plus agents running on top. The seat count collapsed. The bill went up.
If NRR structurally exceeds one hundred and fifty percent in this model, it justifies premium multiples. Three customers showing the same pattern is the Series A bar.
Where the Window Closes
This is probably wrong, but the sources disagree on timing in a useful way. One camp argues the incumbents absorb the intelligence layer through acquisition within eighteen months and the wedge narrows. The other points to Lemkin's observation that startups are currently moving faster than Salesforce and ServiceNow on agentic fabric. The synthesis is that twelve to eighteen months is the investable window for agent infrastructure pure-plays at reasonable multiples. After that, either the winners have compounded or the incumbents have written the check.
The wedges that fit inside that window:
1. Vertical orchestration with institutional-context accumulation, the Stitch archetype
2. Agent identity and governance, the gap Levie keeps flagging
3. MCP tooling and knowledge-graph infrastructure, now settling as enterprise default
4. AI observability, agent-specific, distinct from model-layer FinOps
Every dollar SAP deploys into this layer eventually is a dollar of Series A/B markup for the companies already on the cap table.
Action items
- Source 5+ agent infrastructure deals in MCP tooling, agent identity, agent governance, and agent observability before SAP's corp dev activation compresses entry points
- Rerank AI-GTM pipeline by orchestration moat depth — prioritize companies with narrow high-frequency workflows and measurable outputs over horizontal 'AI copilot for sales' plays
- Request consumption/agent-usage metrics in all active AI-GTM diligence — specifically agent-to-seat ratio, API call volume growth, and NRR on existing logos
- Map pipeline against 'agent-hosting platform' displacement risk — identify which deals get killed if Notion, Airtable, or Cursor absorbs the workflow
Sources:a16z has published another map of where value accrues · SAP wrote a check for one hundred million euros · Vercel's first production AI Gateway index · Anthropic at nine hundred billion dollars · Anthropic has apparently overtaken OpenAI

AI Security Crossed the KEV Line — Category Formation Is No Longer a Pitch

The Category-Defining Week

Four events crystallized AI security as a standalone budget category in the same seven days:

LiteLLM added to CISA's KEV catalog — the first LLM-routing control plane federally flagged as actively exploited
Microsoft MDASH shipped 16 validated Windows CVEs in a single Patch Tuesday — production-scale autonomous vulnerability discovery, not a demo
Anthropic's Mythos cleared both UK AISI attack ranges — first model to do so, with Congress routing access through NSA rather than CISA
OpenAI launched Daybreak with Cloudflare, Cisco, CrowdStrike, PANW, Oracle, Zscaler, Akamai, and Fortinet as 'partners'

When a platform launches with every major incumbent listed as a friend, the phase is pre-disintermediation, not partnership.

The Economics Are Bifurcating

DepthFirst's Open Defense Initiative landed FFmpeg, Envoy, and Kata as anchor tenants claiming 10x cost efficiency over Mythos — 12 memory corruption bugs for ~$1,000 vs Anthropic's ~$10,000 across several hundred scans. The AI-security market is splitting into:

Tier	Player	Economics
Specialized harness	DepthFirst, custom harness (Mozilla: 271 bugs)	10x cheaper than frontier models
Frontier capability	Mythos, MDASH	Premium pricing, capability-led
Generic wrapper	Various	Squeezed from both sides

Mozilla found 271 bugs in Firefox with a custom agentic harness on top of Claude — while Mythos found 1 real CVE in curl with a generic scan. Harness design beats model selection. That is the moat signal for any AI AppSec deal in your pipeline.

The NSA Signal

Congress routing Mythos access through NSA rather than CISA tells you which agency the first government dollars route through. This is offensive/IC-led procurement, not civilian defensive distribution. The implication for deal flow: tilt toward teams with TS/SCI talent, Five Eyes partnerships, and offensive-AI primitives wrapped in compliance.

EDR Moats Are Cracking

TrustedSec ran LLMs at five commercial EDRs and reported that reverse engineering that took weeks now takes days. All five products share identical architectural furniture — YARA rules, Lua engines, local ML classifiers. Detection-rule IP is becoming a commodity input. Daybreak's partner list is the confirmation: when the platform lists every incumbent as a friend, it is pre-disintermediation.

AI security is pricing as an AI feature and transacting as a defense-tech category. The Series B window closes on whichever framing the market agrees to first.

Action items

Pull forward AI-security diligence by one quarter — specifically AI-gateway firewalls, model-artifact scanners, and LLM-runtime sandbox plays targeting Series A entry
Apply the Mozilla-vs-curl test to every AI AppSec pitch: demand explanation of harness investment that produces 271 bugs instead of 1
Stress-test public EDR names (CRWD, PANW) assuming 30-50% erosion in detection-rule moat over 24 months; revisit any private EDR deals at 2025 prices
Run a portfolio-wide exposure sweep for LiteLLM, Traefik, Argo CD, and PAN-OS User-ID Portal — escalate any that are internet-exposed

Sources:Anthropic's Mythos cleared AISI this week · DepthFirst's Open Defense Initiative · The EDR moat is cracking · AI-infra CVEs hit KEV · Microsoft shipped sixteen validated Windows CVEs · AI infra capex hits $100B/partnership

◆ QUICK HITS

Update: Cerebras closed day 1 at $311 (70% pop) — Eclipse netted 17x, Tiger sitting on ~$1B paper gain at 3.5x in months; Benchmark's $225M SPV broke its own early-stage-only model to defend ownership
Cerebras printed a seventy percent first-day pop
Update: xAI leased entire Colossus 1 cluster (220K+ GPUs including GB200s) to Anthropic — a de facto retreat from the frontier race; reprice any xAI exposure as neocloud + X-distribution, not frontier lab
Anthropic's 80x growth broke its infra
Apple building agent governance into App Store — privacy standards, fee capture, and blocking agents from spinning up unapproved sub-apps; WWDC reveal expected, two-week window to pre-position portfolio
Anthropic has apparently overtaken OpenAI in enterprise
Abridge raised $550M at $5.3B with 250 health systems and 80M+ annual conversations — healthcare AI ambient scribe category is closed; redirect to payer-side prior auth and adjacent workflow layers
Abridge at $5.3B: the healthcare AI vertical just printed a category winner
Anthropic's June 15 third-party credit unbundling is a margin event disguised as policy — stress-test every portfolio company using Claude integrations against the new pricing before it takes effect
Vercel's first production AI Gateway index
Nebius guided to 6x revenue growth ($530M→$3.4B) with 4+ customers bidding per GPU — neocloud pricing power confirmed but unit economics ugly: $2.47B capex vs $2.26B operating cash
AI compute is sold out 4:1
a16z became #1 US political donor at $115.5M including $50M to 'Leading the Future' AI PAC — regulatory capture is now a visible, expensive portfolio-protective line item
AI governance crisis meets China decoupling
Six-month-old companies raising at $3.4B (Mind Robotics) and $4B+ (Recursive Superintelligence, Nvidia + AMD) on founder-brand alone — set IC guardrail: no AI Series A/B above 50x forward revenue
Anthropic is reportedly running at thirty billion dollars

◆ Bottom line

The take.

Anthropic killed the subscription-token arbitrage that funded a generation of coding-agent wrappers, OpenAI countered with free Codex, and Ramp confirmed the enterprise share flip (34.4% vs 32.3%) — all in the same week ServiceNow revealed it blew its annual Claude budget by May because Anthropic has no SLAs and no telemetry. The model layer is now a duopoly subsidy fight, wrapper economics are dead, and $30B of enterprise AI revenue is built on consumer-grade plumbing that procurement teams are about to discover they never agreed to. Call your Claude-dependent portfolio companies today — not next board meeting.

Frequently asked

How much runway did Claude-dependent wrappers actually lose this week?: Roughly 20–40% of effective runway, because subscription-tier arbitrage that delivered 70–90% gross margins on inference is gone. A wrapper paying $200/month for tokens that now cost $700–2,000 at API rates has seen unit economics flip toward negative on those workloads, and most founders haven't yet flagged the change to their boards.
Which coding-agent wrappers actually survive the Anthropic repricing?: Three archetypes: open-source distribution plays like Cline that own the anti-lock-in segment, bounded-execution and enterprise-security tools like Cursor's cloud agents that compete on trust rather than tokens, and anything accumulating proprietary workflow data the model layer can't replicate externally. Everything else is a thin subscription-arbitrage wrapper waiting to be marked down.
Why apply a reversibility discount to LLM-layer ARR?: Because enterprise AI spend lacks the structural lock-in of traditional SaaS — no SLAs, no per-user telemetry, and low switching costs mean each model release is effectively a procurement reset. ServiceNow exhausting its full-year Anthropic budget by May, with its CIO publicly calling Claude consumer-grade, is the proof point. A 20–40% discount against comparable SaaS multiples is defensible before the next LP meeting.
What's the investable window for agent infrastructure pure-plays?: Roughly 12–18 months before incumbents like SAP, ServiceNow, and Salesforce either acquire the category or ship competing fabric. The wedges that fit inside that window are vertical orchestration with institutional context, agent identity and governance, MCP tooling and knowledge-graph infrastructure, and agent-specific observability distinct from model-layer FinOps.
What does LiteLLM hitting CISA's KEV catalog mean for AI security deal flow?: It establishes AI infrastructure as a federally recognized attack surface, which creates the first defensible enterprise budget line for AI security as a standalone category rather than an AppSec feature. Combined with NSA-routed access to Anthropic's Mythos and OpenAI's Daybreak partner list, it signals pulling AI-gateway firewalls, model-artifact scanners, and LLM-runtime sandboxes forward by a quarter at Series A entry.

◆ Same day, different angle

Read this day as…

◆ Recent in investor

AnthropicCreditSwapVaporizesCodingWrapperMargins

◆ INTELLIGENCE MAP

◆ DEEP DIVES

What Happened

What this does to wrapper margins

The Duopoly Subsidy Fight

Surviving Moats

The ServiceNow Tell

The Observability Category Is Forming in Real Time

The Revenue Quality Discount

The FDE Land Grab

The Category Is Being Named by Incumbents

The Production Data

The Lemkin Proof Point

Where the Window Closes