Investor daily

Edition 2026-05-20 · read as Investor

AnthropicCreditSwapEndstheClaude-WrapperArbitrage

Sources
36
Words
1,854
Read
9min

Topics AI Capital Agentic AI LLM Inference

◆ The signal

Anthropic converted Claude subscriptions into dollar-matched API credits this week, which is a polite way of ending the seventy-to-ninety percent arbitrage that quietly underwrote most Claude-wrapper business models. Any portfolio company whose unit economics assumed twenty dollars a month bought two hundred dollars of inference is now repricing its margin structure in real time. ServiceNow, meanwhile, burned its entire annual Claude budget by May without any telemetry to notice. The revenue underwriting nine hundred billion dollars of marks is being booked without the SLAs, governance, or switching costs that would make it durable. That is the trade.

◆ INTELLIGENCE MAP

  1. 01

    Enterprise AI Revenue Quality Crisis

    act now

    Anthropic's subscription-to-API credit conversion ends the 70-90% arbitrage powering Claude wrappers. ServiceNow blew its full-year budget by May with no SLAs or telemetry. Ramp confirms Anthropic at 34.4% vs OpenAI 32.3% — the enterprise flip is real, but revenue quality is consumer-grade.

    70-90%
    margin arbitrage eliminated
    9
    sources
    • Anthropic B2B share
    • OpenAI B2B share
    • Budget blowout timing
    • Wrapper margin loss
    1. Anthropic34.4
    2. OpenAI32.3
    3. Google15.2
    4. Others18.1
  2. 02

    Agent Economy Crosses Majority — Platform Wars Begin

    monitor

    Vercel's production data shows 59% of token volume is now agentic workloads. Apple is walling off iOS agent access via App Store governance. SAP committed €100M, Notion shipped a developer platform, and a16z declared system-of-intelligence as the decade's GTM thesis. The window between category formation and platform absorption is 12-18 months.

    59%
    agent token share
    7
    sources
    • Agent token volume
    • SAP agent fund
    • Anthropic spend share
    • Google volume share
    1. Agentic workloads59
    2. Chat/completion28
    3. Embeddings/other13
  3. 03

    Compute Scarcity Structural: xAI Leases to Sworn Enemy

    monitor

    Anthropic leased xAI's entire Colossus 1 cluster (220K GPUs) from Musk — rivals renting from enemies is the clearest demand signal possible. Nebius printed 684% YoY growth with capex exceeding operating cash. Fervo Energy debuted at $10B+ (33% pop) on AI power demand. Compute supply is the binding constraint through 2027.

    220K
    GPUs leased from rival
    6
    sources
    • Colossus GPUs leased
    • Nebius YoY growth
    • Fervo IPO pop
    • Customers per GPU
    1. Nebius 2026 guide3.2
    2. Nebius 2025 rev0.53
    3. Nebius Q1 capex2.47
    4. Nebius Q1 op cash2.26
  4. 04

    AI Security Hits Production: First KEV + Platform Disintermediation

    monitor

    LiteLLM became the first AI-infrastructure component on CISA's KEV (actively exploited). Microsoft's MDASH shipped 16 validated CVEs in one Patch Tuesday. OpenAI launched Daybreak with 8 incumbent 'partners' — the setup pattern before platform eats partners. EDR detection IP is being reverse-engineered by LLMs in days, not weeks.

    16
    AI-discovered CVEs shipped
    6
    sources
    • MDASH CVEs shipped
    • LiteLLM status
    • EDR teardown time
    • Daybreak partners
    1. LiteLLM on KEVFirst AI-infra actively exploited
    2. MDASH ships 16 CVEsAutonomous vuln discovery at scale
    3. OpenAI DaybreakPlatform enters security
    4. PraisonAI 4hr exploitAI agents are attacker targets
  5. 05

    Vertical AI Data Moats Print Category Winners

    background

    Abridge raised $550M at $5.3B serving 250 health systems with 80M+ annual conversations — an unreplicable data moat. Health systems compressed release cycles from quarterly to monthly. The wedge-and-expand template (save time → save money → save lives) is now the underwriting framework for vertical AI.

    $5.3B
    healthcare AI winner
    2
    sources
    • Abridge valuation
    • Health systems live
    • Annual conversations
    • Languages supported
    1. 01Abridge (clinical docs)$5.3B
    2. 02Anthropic (horizontal)$900B
    3. 03Modal (inference)$4.5B
    4. 04Recursive SI (frontier)$4B+

◆ DEEP DIVES

  1. 01

    Anthropic's Pricing Reset Kills Wrapper Economics — and Reveals the Revenue Quality Problem Underneath

    The Two-Sided Squeeze

    On May 12-13, Anthropic converted every Claude subscription into a dollar-matched API credit pool, which is a polite way of saying that a $200 plan now buys exactly $200 of programmatic tokens and not a token more. The seventy to ninety percent arbitrage that third-party harnesses (Cline, OpenCode, and a long list of portfolio-company dev tools) had been quietly running against subscription tiers is gone. OpenAI answered inside the same news cycle with two months of free Codex for enterprise switchers.

    The timing is not an accident, or rather, the more interesting version is that it lines up with a new CFO and a likely October IPO. Anthropic is cleaning up margins for public-market diligence while defending the Ramp lead — 34.4% versus OpenAI's 32.3% in business spend, the first documented leadership flip.

    Any Claude-dependent portfolio company whose COGS ran against subscription tokens has lost 20-40% of runway since Friday. The change is four days old — most founders haven't flagged it yet.

    The Revenue Quality Problem

    ServiceNow — one of the most sophisticated enterprise buyers on earthexhausted its full-year Anthropic budget by May, which tells you something about either the appetite or the meter. Anthropic offers no granular per-user telemetry, no SLAs worth the name, and no enterprise dashboard that wouldn't embarrass a mid-tier SaaS vendor circa 2014. National Life Group's CIO put it without ornament: Anthropic is great for consumer usage but not great for companies.

    This is the structural contradiction at the $900B valuation mark. The revenue justifying that number has consumer-grade stickiness: no contractual lock-in, no switching costs beyond re-prompting, and budget overruns that procurement will eventually catch. Secondaries are pricing in SaaS-grade retention. The evidence is not that.

    Where the Alpha Sits

    This is probably wrong, but multiple sources converge on the same read: the gap between Anthropic's model quality and its enterprise infrastructure creates a standalone category. ServiceNow is already selling AI Control Tower into the gap. Google, OpenAI (via Bain's DeployCo), and Salesforce are hiring hundreds of forward-deployed engineers. The Palantir playbook, where deployment services capture more margin than the product, is now consensus.

    Emerging CategorySignalStage
    AI Observability/FinOpsServiceNow budget blowout; tokenmaxxing vocabularyNo winner yet — 6-12 month window
    FDE-as-a-Product4 firms independently hiring FDEsCategory forming; Palantir is the only comp
    Multi-model routingVercel data shows 61% spend / 38% volume splitSeries A pricing still reasonable

    Action items

    • Request updated gross-margin models from every Claude-dependent portfolio company by end of week — the 70-90% arbitrage is permanently gone
    • Launch a sourcing sprint on AI observability and FinOps-for-AI companies at Seed through Series A before end of quarter
    • Apply a 20-40% 'reversibility discount' to any LLM-layer ARR multiple in active diligence where SLAs and telemetry are absent

    Sources:AINews · Laura Bratton · TLDR AI · The Pragmatic Engineer · ben's bites · Morning Brew

  2. 02

    Agent Economy at 59%: The 12-Month Window Between Category Formation and Platform Absorption

    The Majority Case

    Vercel's first production AI Gateway index, drawn from downstream data across more than 200,000 teams rather than lab benchmarks, puts agentic workloads at 59% of all token volume. So the agent era is not the upside case anymore. It is the base rate, in production, today. The interesting bit is the split underneath: Anthropic takes 61% of spend on the expensive Opus reasoning calls while Google takes 38% of volume on cheap Flash throughput. Two entirely different businesses are emerging inside the thing we keep calling 'foundation models.'

    Multi-model routing is now the enterprise default, which means switching costs at the model layer approach zero. The moat has moved one layer up — to orchestration, workflow data, and institutional context.

    Platform Owners Are Moving

    Four platform moves landed inside seven days, which is either coincidence or, more likely, incumbents absorbing the agent layer before the pure-plays can define what it is.

    • Apple — building agent governance into the App Store; gating agents through review and fee capture; possible WWDC reveal in weeks
    • SAP — €100M Autonomous Enterprise fund, wiring NVIDIA and Microsoft into the platform layer
    • Notion — developer platform with Claude and Codex as hosted teammates; External Agents API commoditizing the harness layer
    • Google — Gemini Intelligence embeds autonomous task execution directly into Android at 97%+ share in emerging markets

    a16z formalized the thesis in the same week: value migrates from system of record to system of intelligence, with the Stitch investment as the deployed check. Jason Lemkin's number is the cleanest unit-economics template anyone has put on the table — cutting from ten-plus human seats to two humans plus twenty agents while spend rose 83%, from twelve thousand to twenty-two thousand dollars. The customer paid more and was happier about it.

    The Investable Window

    This is probably wrong, but the window looks specific: 12-18 months before incumbent absorption compresses returns. The wedges that survive are narrow and vertical, not horizontal, and they are not the ones with the best demos:

    WedgeWhy It SurvivesEntry Stage
    Agent identity and governance81% bot-detection bypass creates structural needSeed/Series A — 6 month window
    Vertical orchestration with institutional contextToo narrow for SAP/Notion to prioritizeSeries A — pricing still rational
    Multi-model routing and observability59% agent volume requires neutral routingSeries A/B — Vercel is the benchmark
    iOS-independent agent runtimesApple's walled garden creates the counter-tradePre-seed/Seed — watch WWDC

    The counter-thesis, which is not crazy: Salesforce, ServiceNow and SAP absorb the orchestration layer in one quarter rather than four, and the window closes before the checks clear. Enterprise software has done this before. The SAP fund announcement makes it more likely, not less.

    Action items

    • Map every pipeline deal against the Apple WWDC forcing event — any company whose GTM assumes friction-free iOS agent install needs a contingency before the June keynote
    • Source 3-5 agent infrastructure deals in MCP tooling, agent identity, and agent observability before Q3 Series A multiples re-rate
    • Request Vercel AI Gateway production index as a recurring diligence data source; use the spend-vs-volume split as a benchmark for every model-layer pitch

    Sources:ben's bites · a16z · TLDR IT · TLDR · Simplifying AI · TLDR Design

  3. 03

    xAI's Colossus Lease Confirms Compute Scarcity Is Structural — Reprice the Neocloud Thesis

    Rivals Renting From Enemies

    Anthropic leased the entire Colossus 1 cluster — 220,000+ NVIDIA GPUs including GB200s — from Elon Musk's xAI, which is interesting because the same Musk has publicly called Anthropic 'misanthropic and evil,' and people who think you are evil do not generally rent you their best asset unless the alternative is worse. This is not a partnership. It is what a shortage looks like when demand outruns plan by a factor most plans cannot survive: Anthropic modeled 10x growth and got 80x. The silent Claude Code nerfs, the Pro revocations, the enterprise bans — capacity triage dressed up as product roadmap.

    When rivals rent compute from declared enemies, you are not in a glut. You are in a shortage that bends strategy. The 'AI compute glut' is a public-markets narrative, not a private reality.

    The Neocloud Reality Check

    Nebius gave us the cleanest public read on neocloud unit economics anyone has bothered to publish: $2.47B capex against $2.26B operating cash flow in Q1, revenue up 684 percent, four-plus customers bidding per GPU. The business is structurally cash-negative at scale. That matters for anyone marking private positions to comp:

    CompanySignalImplication
    Nebius684% growth, capex > cash flowCompress private neocloud multiples from 8-10x to 2-3x forward revenue
    Fervo Energy$10B+ IPO (+33% day-1)AI-energy is a fundable sub-category; require hyperscaler offtake for entry
    CoreWeave peersxAI lease validates scarcityNeocloud multiples hold near-term despite ugly unit economics
    Cisco$9B AI orders (+80% YoY), stock +18%Networking pull-through is the cleanest capex proxy

    The xAI Repricing

    Several independent reads converge on the same conclusion, which is rare enough to take seriously: xAI is no longer operating as a frontier lab. Leasing roughly 45 percent of current compute to a direct competitor, trailing DeepSeek and Qwen in developer surveys, and showing no B2B or B2C product traction puts the correct comp set at neoclouds plus X-distribution, not OpenAI or Anthropic. Anyone offering xAI at frontier-lab multiples in a secondary is mis-marked.

    The counter-thesis, and it deserves a hearing, is that xAI keeps the Grok roadmap and uses lease revenue to fund the next training run, at which point this was financing, not concession. That reading needs you to ignore the talent walking out the door (Cursor is already buying xAI engineers) and the benchmark lag, which is a lot to ignore.

    Where Alpha Persists

    The scarcity thesis holds. The margin structure underneath it is ugly. Alpha has migrated off GPU neoclouds, where the cash burn argues for multiple compression, into the adjacent layers: Nvidia ecosystem (networking, optics, liquid cooling), energy-for-AI with signed hyperscaler offtake, and inference silicon with model-lab design wins. The picks-and-shovels trade, or rather the more interesting version of it.

    Action items

    • Re-underwrite any xAI/Grok exposure in the book as an infrastructure-plus-distribution play, not a frontier-model play, by end of week
    • Pull up active neocloud/GPU-as-a-service deals in pipeline and re-underwrite at 2-3x forward revenue ceiling, modeling perpetual capex > operating cash flow
    • Require signed hyperscaler offtake as a Series B gating criterion for any energy-for-AI deal; discount 40-60% from Fervo comps without one

    Sources:The Pragmatic Engineer · The Information AM · Bloomberg Technology · Martin Peers · Katie Roof · Morning Brew

  4. 04

    AI Security's KEV Moment: The Category Just Got a Budget Line

    The Legitimation Event

    LiteLLM, an AI gateway routing layer that quietly sits inside a lot of enterprise AI deployments, just landed on CISA's Known Exploited Vulnerabilities catalog. This is the first time AI-specific infrastructure has been federally flagged as actively exploited. The interesting part is not the bug. It is that 'AI security' stops being a pitch slide and becomes a procurement line with regulatory backing, which is a different sales motion entirely. Add Ollama's GGUF model-loader bug (CVSS 9.1, data exfiltration via malicious model files) and six critical CVEs in OpenClaw, and the AI-infra stack is roughly where enterprise SaaS sat in 2014, security-maturity-wise. That comparison flatters nobody.

    In the same week, Microsoft's MDASH system shipped sixteen validated Windows vulnerabilities in a single Patch Tuesday, which is the first production evidence (rather than the usual demo) that multi-model AI vulnerability discovery actually works at scale. PraisonAI's auth-bypass CVE was weaponized within four hours of disclosure. AI-agent frameworks are now active attacker targets, and the discovery side is no longer theoretical either.


    Platform Disintermediation Begins

    OpenAI launched Daybreak with Cloudflare, Cisco, CrowdStrike, Palo Alto Networks, Oracle, Zscaler, Akamai, and Fortinet listed as 'partners.' This is the classic platform-eats-partners setup, or rather the version of it where the partners know exactly what is happening and show up anyway because not showing up is worse. In the same week, TrustedSec showed that LLMs can reverse-engineer all five major commercial EDRs in days, pulling YARA rules, scoring thresholds, and exclusion lists out of products whose entire moat was the opacity of their detection logic. The moat was opacity. Opacity does not survive contact with a sufficiently patient model.

    DevelopmentWhat It KillsWhat It Creates
    LiteLLM on KEV'AI security is hypothetical' objectionAI-gateway firewalls, model-artifact scanning
    MDASH 16 CVEsScanner vendors' research moatAutonomous AppSec (XBOW, ZeroPath class)
    EDR reverse engineeringDetection-rule IP as moatBehavioral/runtime detection premium
    Daybreak + 8 partnersStandalone security point solutionsPlatform-integrated security primitives

    The Bifurcation Worth Funding

    DepthFirst's Open Defense Initiative claims ten times the cost efficiency of Anthropic's Mythos on vulnerability discovery, finding twelve memory corruption bugs in FFmpeg for roughly a thousand dollars while Mythos missed them across hundreds of scans at around ten thousand. This is probably wrong in detail, but the shape is right: a specialized-harness-versus-frontier-model split that supports two viable businesses, very cheap or very expensive. The middle, as usual, gets squeezed.

    The Mythos angle has a separate thesis. Clearing both UK AISI attack ranges and earning NSA access (not CISA) routes Anthropic toward IC-led procurement, which prices and transacts on a completely different curve. Cyber-AI is being priced as an AI feature and sold as a defense-tech category, and the Series A/B window closes on whichever framing the market agrees to first. The market has not decided yet.

    Action items

    • Run a portfolio-wide exposure sweep for LiteLLM, Traefik, Argo CD, and PraisonAI by end of week — four of these are in KEV and actively exploited
    • Pull forward AI-security deal diligence by one quarter — specifically AI-gateway security, model-artifact scanning, and LLM-runtime sandboxing at Series A
    • Add OpenAI Daybreak to the competitive tracker for every security portfolio company; require founders to articulate why their product isn't a Daybreak connector in 18 months

    Sources:SANS AtRisk · CyberScoop · TLDR InfoSec · Clint Gibler · The Hacker News · The Information AM

◆ QUICK HITS

  • Update: Cerebras closed day-1 at $311 (70% pop) vs $89 Tiger entry — Eclipse nets 17x, Benchmark's $225M SPV returned ~$300M but lowered the fund multiple; first AI-hardware liquidity event sets permanent late-stage comps

    Katie Roof

  • Abridge raised $550M at $5.3B valuation with 250 health systems and 80M+ annual patient conversations — the ambient clinical documentation category is closed; alpha now sits in payer-side prior auth and nursing workflows

    Latent.Space

  • Apple executing serial creator-tools roll-up (Pixelmator → MotionVFX → Color.io) — telegraphing a vertically-integrated Creator Studio that attacks Adobe's moat from the OS layer; scout remaining indie acquisition targets

    TLDR Design

  • Only 15% of enterprises have data foundations for agentic AI despite spending millions (Fivetran index) — data quality and lineage cited as #1 blocker by nearly half of respondents; greenfield category window

    TLDR Data

  • Anthropic's Mythos cleared both UK AISI attack ranges (first model to do so) with Congress routing access through NSA, not CISA — offensive/IC-led procurement, not civilian defense distribution

    CyberScoop

  • a16z published 10-principle AI liability framework while deploying $115.5M in midterm political spend — absolute-liability proposals would make open-source model distribution uninsurable; stress-test portfolio with 15-25% litigation-reserve scenario

    a16z AI Policy Brief

  • Tech layoffs hit 103K YTD by May 14 vs 124K for all of 2025 — LinkedIn cut 5%, Cisco cut 4,000 while stock popped 15% on AI orders; the AI-driven margin expansion cycle is live in public markets

    Techpresso

  • DuckDB shipped Quack client-server protocol breaking out of embedded-only positioning — direct threat to Spark/Glue-heavy ETL vendors on sub-TB workloads; audit portfolio for displacement risk

    TLDR Data

◆ Bottom line

The take.

Anthropic killed the subscription arbitrage powering most Claude-wrapper business models the same week Ramp confirmed it overtook OpenAI in enterprise — but ServiceNow blowing its full-year budget by May reveals the revenue quality underneath that $900B mark has consumer-grade stickiness with zero governance infrastructure. The alpha has rotated: short the model-layer revenue premium, long the observability and agent-governance layer forming in the gap, and triage every Claude-dependent portfolio company's gross margins before their founders realize the economics changed four days ago.

— Promit, reading as Investor ·

Frequently asked

How should I reprice Claude-dependent portfolio companies after the API credit conversion?
Apply a 20-40% runway haircut to any company whose unit economics assumed subscription-tier arbitrage, where $20/month plans bought $200 of inference. The dollar-matched credit pool eliminates that gap entirely. Request updated gross-margin models this week, since most founders haven't flagged the change yet, and add a reversibility discount on ARR multiples where SLAs and telemetry are absent.
What does ServiceNow exhausting its annual Claude budget by May actually signal?
It exposes that Anthropic's enterprise revenue lacks the governance, telemetry, and switching costs that justify SaaS-grade retention assumptions. One of the most sophisticated buyers on earth blew through a full-year budget without procurement noticing, because there is no per-user dashboard or meaningful SLA. That is consumer-grade stickiness underwriting a $900B mark, and it creates a standalone category for AI observability and FinOps.
Why is xAI leasing Colossus 1 to Anthropic the important signal, not the headline deal?
Rivals don't rent 220,000+ GPUs from declared enemies during a compute glut — they do it during a structural shortage. Anthropic modeled 10x growth and got 80x, which is why Pro revocations and Claude Code nerfs are capacity triage, not product strategy. It also reframes xAI's correct comp set as neocloud-plus-distribution rather than frontier lab, meaning secondaries pricing it at OpenAI/Anthropic multiples are mis-marked.
Where does alpha sit in agent infrastructure given the 12-18 month absorption window?
In narrow vertical wedges that are too small for SAP, Notion, or Salesforce to prioritize: agent identity and governance, vertical orchestration with institutional context, multi-model routing, and iOS-independent agent runtimes ahead of Apple's WWDC governance reveal. Horizontal agent platforms get absorbed by incumbents within a quarter or two; the SAP €100M fund and Notion's External Agents API are the activation signals. Series A pricing in these wedges is still rational but won't be by Q3.
What changes now that LiteLLM is on CISA's KEV catalog?
AI security stops being a pitch-deck category and becomes a procurement line with federal backing, which is the first defensible enterprise budget justification for AI-gateway firewalls, model-artifact scanning, and LLM-runtime sandboxing. Pull forward Series A diligence in the category by a quarter. Also run a portfolio-wide exposure sweep immediately for LiteLLM, Traefik, Argo CD, and PraisonAI — these are confirmed actively exploited, and a single portco breach destroys more LP goodwill than a missed deal creates.

◆ Same day, different angle

Read this day as…

◆ Recent in investor

Keep reading.