Investor daily

Edition 2026-05-17 · read as Investor

AnthropicRents220KGPUsFromxAI,KillsClaudeArbitrage

Sources
36
Words
1,832
Read
9min

Topics Agentic AI AI Capital LLM Inference

◆ The signal

Anthropic leased two hundred and twenty thousand GPUs from xAI's Colossus 1, which is to say from its sworn enemy, in the same week it passed OpenAI on enterprise spend (34.4% to 32.3%, per Ramp) and quietly converted every subscription into dollar-matched API credits. That last move closes the 70-90% arbitrage that funded most Claude-wrapper startups. Two readings: real demand forcing awkward bedfellows, or a tidy pre-IPO margin sweep. Probably both. Either way, every Claude-dependent name in the book has worse unit economics this Monday than last Friday.

◆ INTELLIGENCE MAP

  1. 01

    Enterprise AI Leadership Flipped — Revenue Quality Didn't Follow

    act now

    Ramp data shows Anthropic at 34.4% vs OpenAI's 32.3% of B2B spend — the first documented lead change. But ServiceNow blew its full-year Claude budget by May because Anthropic ships zero SLAs, zero granular telemetry. Enterprise share is real; enterprise-grade infrastructure is not.

    34.4%
    Anthropic enterprise share
    14
    sources
    • Anthropic B2B share
    • OpenAI B2B share
    • OpenAI YoY growth
    • Anthropic YoY growth
    1. Anthropic34.4
    2. OpenAI32.3
  2. 02

    Coding Agent Subsidy War Kills Wrapper Economics Overnight

    act now

    Anthropic converted subscriptions to dollar-matched API credits, eliminating the 70-90% cost arbitrage third-party harnesses exploited. OpenAI countered with 2 months free Codex for enterprise switchers. Claude Code shipped autonomous /goal mode. Standalone coding-agent startups (Cognition, Factory) just lost their pricing moat and their product differentiation in the same week.

    70-90%
    arbitrage eliminated
    5
    sources
    • Arbitrage killed
    • Codex free period
    • Anthropic Oct IPO target
    • Credit match date
    1. Wrapper COGS (before)10
    2. Wrapper COGS (after)85
  3. 03

    Agent Economy Reaches 59% Majority — Category Is Investable Now

    monitor

    Vercel production data across 200K+ teams shows agentic workloads at 59% of token volume. Anthropic captures 61% of spend (premium reasoning), Google takes 38% of volume (commodity throughput). SAP committed €100M to autonomous enterprise fund. a16z published system-of-intelligence thesis with live Stitch investment. Agent infra is the majority use case, not a speculative thesis.

    59%
    agentic token share
    8
    sources
    • Agentic workloads
    • Anthropic spend share
    • Google volume share
    • SAP agent fund
    1. Agentic workloads59
    2. Traditional chat/completion41
  4. 04

    AI Security Crosses First KEV Threshold — Category Has Budget Codes

    monitor

    LiteLLM (AI gateway) hit CISA's Known Exploited Vulnerabilities catalog — the first AI-infra component actively exploited in the wild. Separately, Mythos became the first model to clear both UK AISI attack ranges, with NSA (not CISA) winning access. DepthFirst claims 10x cost advantage over Mythos. AI security is no longer a thesis; it has KEV entries, Congressional briefings, and a $40B deepfake TAM anchor.

    $40B
    deepfake loss by 2027
    7
    sources
    • AISI ranges cleared
    • LiteLLM status
    • DepthFirst cost edge
    • Cyber task doubling
    1. LiteLLM hits KEVFirst AI-infra on exploited list
    2. Mythos clears AISIFirst model to pass both ranges
    3. NSA gets accessIC procurement, not civilian
    4. DepthFirst 10xSpecialized harness economics proven
  5. 05

    Neocloud Unit Economics Revealed — Structurally Cash-Negative

    background

    Nebius printed 684% YoY revenue growth but disclosed $2.47B capex against $2.26B operating cash — proving neoclouds are structurally cash-negative even at hypergrowth. The Colossus 1 lease (Anthropic renting from xAI) validates GPU scarcity but also confirms that neocloud multiples should compress from 8-10x to 2-3x forward revenue. The alpha has migrated to Nvidia adjacencies and hyperscaler-contracted energy.

    684%
    Nebius revenue growth
    5
    sources
    • Nebius Q1 growth
    • Capex vs cash
    • Cisco AI orders
    • Fervo IPO pop
    1. Nebius Capex2.47
    2. Nebius Op Cash2.26
    3. Nebius Revenue0.4

◆ DEEP DIVES

  1. 01

    Enterprise AI Revenue Is Real — Enterprise AI Infrastructure Is Consumer-Grade

    The Flip Is Real. The Foundation Beneath It Is Not.

    Ramp's April card-spend data puts Anthropic at 34.4% of US B2B spend versus OpenAI's 32.3%, the first documented lead change in any credible enterprise AI dataset. Over the trailing year Anthropic quadrupled business adoption while OpenAI grew three-tenths of a percent. The 0.3% deserves its own sentence. It is not a slowdown in a flat market. It is a stall while the category grew roughly four times around it.

    Anthropic also admitted it planned for 10x growth and got 80x, then leased the entire Colossus 1 cluster (220K+ NVIDIA GPUs) from Elon Musk's xAI, the same Musk who publicly called Anthropic 'misanthropic and evil.' When a competitor rents compute from a sworn enemy, the supply situation is not a glut. It is a shortage that bends strategy, and what Anthropic is not doing with that capital is building the next cluster itself.


    The Revenue Quality Problem Nobody Is Discussing

    The share number obscures a structural fragility. ServiceNow, one of the most sophisticated enterprise software buyers on earth, blew through its full-year Anthropic budget by May 2026. Not because Claude underdelivered. Because Anthropic provides:

    • Zero granular per-user, per-tool usage telemetry
    • Zero enterprise SLAs worth the name
    • No enterprise dashboard that would embarrass a mid-tier SaaS vendor from 2014

    National Life Group's CIO put it without ornament: Anthropic is 'great for consumer usage but not great for companies.'

    Enterprise AI ARR is not SaaS ARR. Switching costs are near zero, and the budget overruns are invisible until the invoice arrives.

    The Deployment Gap Creates the Alpha

    Every major player is now running the Palantir playbook. Google Cloud is hiring hundreds of forward-deployed engineers. OpenAI stood up DeployCo with Bain Capital and acquired a consulting firm for its 150-FDE starting roster. Salesforce and ServiceNow are staffing the same function. The industry has quietly conceded that deployment is the bottleneck, not model capability.

    Which means the $30B ARR figure is real but reversible. The quality of that revenue, measured by lock-in, telemetry, and contractual switching costs, is consumer-grade at best. There is a version where this gets cleaned up before a Q4 IPO, which is plausible and boring if it happens. There is a version where the gap persists long enough for an AI observability category to fill it, which is the more interesting one. This is probably wrong, but the second version is the base case. The bull case is not crazy. Read the next ServiceNow invoice before deciding which one you are underwriting.

    Action items

    • Demand SLA and usage-telemetry roadmap from every portco selling into enterprise AI buyers by end of month
    • Open sourcing sprint on AI observability / FinOps-for-AI at Seed-Series A within 30 days
    • Apply 20-40% 'reversibility discount' to any LLM-layer ARR multiple where SLAs and telemetry are absent
    • Re-underwrite xAI/Grok exposure as infrastructure + X-distribution play, not frontier lab

    Sources:The Pragmatic Engineer · Laura Bratton · TLDR AI · StrictlyVC · Morning Brew · Bloomberg Technology

  2. 02

    The Coding Agent Pricing War Just Reset Every Wrapper's Unit Economics

    What Happened

    On May 12-13, Anthropic and OpenAI repriced the coding-agent market within hours of each other, which is the sort of coincidence that isn't. Anthropic converted every Claude subscription into a dollar-matched API credit pool, so a two hundred dollar plan now buys two hundred dollars of programmatic tokens, and the seventy to ninety percent arbitrage third-party harnesses had been running quietly disappears. OpenAI answered the same afternoon with two months of free Codex for enterprise switchers inside a thirty-day window.

    Context: Ramp's April print showed Anthropic leading OpenAI for the first time, 34.4 against 32.3 percent, a new CFO is in the seat, and the IPO is probably October. The honest read is narrow. Margin recovery dressed as developer defense and timed to pre-IPO diligence.


    Who Gets Hurt

    Any coding tool running COGS against subscription tokens has lost twenty to forty percent of effective runway since Friday. The arbitrage was unsubtle: buy a two hundred dollar a month subscription, push thousands of API-equivalent calls through the UI, pay seventy to ninety percent below list. That trade is closed.

    Meanwhile Claude Code shipped /goal and Auto Mode, which removes the human-in-the-loop steps entirely. The product surface that standalone autonomous agents (Cognition's Devin, Factory, Cosine) were selling against is now a free CLI primitive. The rounds those companies raised were priced on features that just became table stakes.

    PlayerPre-Change PositionPost-Change Position
    Third-party harnesses (Cline, OpenCode)70-90% cost advantage via subscription abuseFull API-rate pricing; COGS jumps 3-9x
    Autonomous agents (Devin, Factory)Product differentiation on autonomyFeature parity with free Claude Code /goal
    Multi-model routing (Cursor)Provider-agnostic moatStrengthened — security/bounded execution is real moat

    The Squeeze From Above and Below

    This is probably wrong in the timing, but the cleanest framing is a two-sided squeeze. From above, Anthropic meters tokens and OpenAI subsidizes switches. From below, Notion's External Agents API now hosts Claude, Codex, Cursor, Decagon, Warp, and Devin inside the same workspace, which commoditizes the harness layer. Anyone thin-wrapping a frontier model now eats cost inflation and distribution compression at once.

    The coding-agent thesis is now a duopoly subsidy fight with a commoditized harness layer beneath. Every portco priced on a Claude subscription arbitrage is worth less today than it was last Friday.

    Three positions survive: proprietary workflow data, open-source distribution on Cline's model, or bounded enterprise execution on Cursor's security moat. The counter-thesis is that one of the harnesses quietly accumulates a real workflow datamoat before the subsidy ends, which is possible and not obvious. Anything that doesn't clear one of the three is a pass.

    Action items

    • Request updated gross margin models from every Claude-dependent coding tool portco by end of week — model the arbitrage as permanently gone
    • Re-underwrite any autonomous coding agent deals in pipeline against Claude Code's /goal + Auto Mode baseline
    • Accelerate Anthropic pre-IPO secondary positioning before September
    • Open diligence on training-efficiency infra (Nous TST 2-3x speedup, Datology 17x less VLM compute) at current pre-consensus valuations

    Sources:AINews · Daily Dose of DS · ben's bites · TLDR AI · The Pragmatic Engineer

  3. 03

    Agent Economy at 59% — The Infrastructure Layer Is Investable Before SAP and ServiceNow Absorb It

    The Data Point That Changes the Thesis

    Vercel published what is, to my knowledge, the first production-grade AI usage index across 200K+ teams, and the headline is that agentic workloads carry 59% of all token volume. This is not a lab benchmark or a self-reported ARR slide. It is downstream production data, which is rarer than it sounds. The chat-completion era is now a minority revenue line.

    The interesting cut is underneath. Anthropic captures 61% of spend, mostly via Opus on long, tool-using agent calls, while Google takes 38% of volume via Flash on cheap, high-throughput commodity work. Two different businesses are now visible inside the thing we have been calling 'foundation models,' and they probably should not trade on the same multiple.


    Incumbents Are Moving First This Time

    The cloud transition was defined by startups, with the incumbents catching up late and overpaying. The pattern this round looks different. The ERPs and workflow platforms are moving first, which is worth flagging because it changes the resource math for anyone underwriting a greenfield platform play:

    • SAP committed €100M to an Autonomous Enterprise partner fund wired into NVIDIA and Microsoft
    • ServiceNow shipped Action Fabric, decoupling logic from UI and exposing workflows as headless APIs for agents to consume
    • a16z published its system-of-intelligence thesis with a live Stitch investment, arguing the moat is shifting from data to orchestration

    The Lemkin case makes it concrete. Seats fell from 10+ humans to 2 humans plus 1 API seat, while spend rose 83% ($12K→$22K) with 20+ agents running on top. Seat count collapsed. The bill went up.

    Value in GTM software is migrating from data gravity to orchestration gravity, and the next 12-18 months is the last clean window to own the intelligence layer before incumbents or consensus close it.

    Where the Alpha Sits

    Three layers, three different capital dynamics, and only one of them is obviously defensible.

    LayerStageEntry Window
    Agent orchestration (standalone)Fragmented; LangChain, CrewAICompression risk — M&A exits preferred
    Agent infra picks-and-shovels (MCP, identity, governance, observability)Greenfield — Seed/A pricing rationalHighest alpha; 2-3 quarters before platform absorption
    Vertical workflow agents (narrow, structured)Category formationPrime entry window — too narrow for incumbents to prioritize

    Abridge at $5.3B is the vertical template, or rather the most legible version of it: 250 health systems, 80M+ conversations a year, wedging from documentation into prior auth and clinical decision support. The moat is proprietary interaction data per use, not model quality. Health systems compressed release cycles from quarterly to monthly for them, which is faster than regulated buyers usually move.

    This is probably wrong, but the timing question splits cleanly. a16z's thesis implies a 12-18 month window. ServiceNow's Action Fabric and SAP's fund suggest incumbents compress that to 6-9 months at the platform layer. The vertical layer holds longer, because incumbents will not prioritize narrow markets until someone embarrasses them in one.

    Action items

    • Source 3-5 agent infrastructure deals in MCP tooling, agent identity, and agent observability this quarter
    • Request Vercel AI Gateway production index as a recurring diligence data source — use spend/volume split as benchmark for every model-layer pitch
    • Update vertical AI underwriting framework: require per-use data flywheel, ≥2 monetization vectors, and explicit non-compete posture vs dominant system-of-record
    • Run portfolio revalidation on every seat-based SaaS position — model seat value at +30% and -30% scenarios using Lemkin's template

    Sources:ben's bites · TLDR AI · a16z · TLDR IT · TLDR · Laura Bratton

  4. 04

    AI Security Exits Thesis Stage — LiteLLM on KEV and Mythos at NSA Create Two Distinct Funding Opportunities

    AI security gets its first KEV entry

    LiteLLM, the LLM-routing control plane, landed on CISA's Known Exploited Vulnerabilities catalog this week. That is the first time AI infrastructure has been federally flagged as actively exploited, which matters less for what it says about LiteLLM than for what it does to procurement budgeting. Separately, Anthropic's Mythos became the first model to clear both UK AISI simulated attack ranges, and the House Homeland Security Committee routed access through NSA, not CISA. That last detail is the tell. Offensive and IC-led procurement, not civilian defense.

    Stack on Ollama's GGUF model-loader bug at CVSS 9.1, OpenClaw shipping six critical CVEs in a single cycle, and a PraisonAI auth-bypass weaponized within four hours of disclosure. The AI-infra stack is roughly where enterprise SaaS sat in 2014.


    Two markets under one label

    Sources describe a bifurcation in AI security economics that most investors are still pricing as one category:

    SegmentEconomics SignalProcurement PathEntry Timing
    AI-native vulnerability discoveryDepthFirst: 10x cost advantage over Mythos ($1K vs $10K for same FFmpeg bugs)Enterprise / DevSecOpsSeries A window open now
    AI-powered offensive/IC toolsMythos NSA access; $40B deepfake TAM by 2027IC/DoD — TS/SCI requiredDual-use Series A/B
    AI infrastructure defenseLiteLLM on KEV; PraisonAI 4-hour exploitEnterprise CISO budgetsCategory-defining — pre-consensus
    EDR incumbentsAll 5 tested share identical building blocks; AI extracts detection logic in daysLegacy renewal cyclesMultiples should compress

    OpenAI's Daybreak launch with 8 incumbent 'partners' (Cloudflare, Cisco, CrowdStrike, PANW, Oracle, Zscaler, Akamai, Fortinet) is the setup pattern of a platform preparing to disintermediate its partners. That move has played out in four prior waves of enterprise software. This is probably wrong, but the partners usually sign anyway, because the alternative is sitting outside the platform while a competitor sits inside it.

    Cyber-AI is pricing as an AI feature and transacting as a defense-tech category. The Series B window closes on whichever of those framings the market agrees to first.

    The Mozilla-vs-curl test

    One quality signal worth keeping. Mozilla found 271 bugs in Firefox using a custom agentic harness on Mythos Preview + Claude Opus 4.6. Mythos alone found 1 real CVE in curl, and the maintainer called it "primarily marketing." Harness design beats model selection, or rather, the more honest version of that claim: any AI AppSec founder who cannot explain why their harness investment produces 271 bugs instead of 1 is selling the model, not the product. Use it as a diligence filter.

    Action items

    • Run portfolio-wide exposure sweep for LiteLLM, Ollama, Traefik, and Argo CD versions — escalate any internet-exposed instances this week
    • Build target list of 5-8 AI-native identity/deepfake defense companies at Series A/B and initiate conversations before the $40B TAM number enters consensus
    • Request DepthFirst data room and validate the 10x cost claim against Mythos on 2-3 non-FFmpeg codebases before the round re-rates
    • Add 'Daybreak feature absorption' to the competitive tracker for every security portfolio company — require 2-sentence answer on defensibility by next board meeting

    Sources:CyberScoop · TLDR InfoSec · Clint Gibler · The Information AM · The Hacker News · SANS AtRisk

◆ QUICK HITS

  • Update: Cerebras closed day one at $311 vs $89 last private round (70% pop, ~$41.7B market cap) — Eclipse netted 17x, Tiger sitting on $1B paper gain; use as permanent comp for Modal ($4.5B), but note 70% first-day gaps historically retrace before lock-up expiry

    Katie Roof

  • Nebius guided $3.0-3.4B 2026 revenue (~6x YoY) with 4+ customers bidding per GPU — but $2.47B capex exceeds $2.26B operating cash, proving neoclouds are structurally cash-negative; compress private multiples from 8-10x to 2-3x forward revenue

    Martin Peers

  • Abridge raised $550M at $5.3B serving 250 health systems with 80M+ annual conversations — health systems compressed release cycles from quarterly to monthly; the ambient scribe category is closed, but payer-side prior auth and nursing workflows remain investable

    Latent.Space

  • Benchmark raised a $225M SPV to defend Cerebras ownership into Tiger's round — early-stage purity is dead in AI; 93% of Benchmark's cost basis was late-stage adds that lowered the fund multiple but added $300M cash

    Katie Roof

  • Google's Gemini Intelligence embeds autonomous task execution directly into Android OS (97%+ share markets like India, summer 2026 rollout) — stress-test every mobile AI agent portco against 'ships as OS feature' scenario before WWDC

    Simplifying AI

  • CLARITY Act passage probability collapsed from 80% to 55% on Polymarket after Armstrong called the draft 'unworkable' — structure binary hedge across any stablecoin/COIN exposure; vol is underpriced given the dislocation

    TLDR Crypto

  • Only 15% of enterprises have data foundations ready for agentic AI at scale (Fivetran index) despite spending millions — data quality and lineage cited as #1 blocker by nearly half; this is the cleanest picks-and-shovels TAM signal since early Snowflake era

    TLDR Data

  • DuckDB shipped Quack client-server protocol — directly targets single-node workloads that Spark/Glue currently over-serve; ask every sub-TB ETL portco what percentage of ARR is vulnerable to displacement

    TLDR Data

◆ Bottom line

The take.

Anthropic overtook OpenAI in enterprise spend this week (34.4% vs 32.3%), then killed the 70-90% cost arbitrage that funded most Claude-wrapper startups while simultaneously leasing 220K GPUs from its sworn enemy because it grew 80x instead of 10x. The three trades to execute this week: re-underwrite every Claude-dependent portco's gross margins (they're worse), open AI observability sourcing (ServiceNow blew its annual budget by May because no one built the monitoring layer), and shift agent-infra capital from standalone orchestration into picks-and-shovels before SAP's €100M fund and Notion's dev platform absorb the category.

— Promit, reading as Investor ·

Frequently asked

What does Anthropic's subscription-to-API-credit conversion actually do to Claude wrapper economics?
It eliminates the 70-90% arbitrage that wrappers were running by pushing API-equivalent traffic through subscription UIs. A $200/month plan now buys exactly $200 of programmatic tokens at list rates, so any coding tool whose COGS was based on subscription abuse just saw effective runway shrink 20-40% and gross margins compress 3-9x overnight.
Why would Anthropic lease 220K GPUs from xAI, a company whose founder publicly attacks it?
Because Anthropic planned for 10x growth and got 80x, and Colossus 1 was the only cluster of that size available on the timeline required. The strategic read is that compute scarcity is bending strategy harder than rivalry — and that Anthropic is choosing to rent rather than build the next cluster itself, which is a signal about capital allocation ahead of the IPO.
How should the 34.4% vs 32.3% Ramp number be weighted given enterprise switching costs?
It should be weighted as real but reversible. Anthropic provides minimal usage telemetry, weak SLAs, and a dashboard well below 2014-era SaaS standards, and ServiceNow blew through its full-year Anthropic budget by May. Apply a 20-40% reversibility discount to LLM-layer ARR multiples where SLAs and telemetry are absent.
What's the cleanest way to position around an October Anthropic IPO?
Treat the May repricing as classic pre-book margin cleanup and move on secondary positioning before September book-building begins. The combination of a new CFO, the credit conversion closing the arbitrage, and OpenAI's defensive Codex giveaway all point to a deliberate margin sweep being staged for diligence.
Where does the alpha sit if incumbents are moving first in the agent economy?
In agent infrastructure picks-and-shovels — MCP tooling, agent identity, governance, and observability — where Seed/Series A pricing is still rational and SAP's €100M fund hasn't yet activated corp dev consensus. Vertical workflow agents with proprietary per-use interaction data, on the Abridge template, are the second viable lane; standalone orchestration is compression risk.

◆ Same day, different angle

Read this day as…

◆ Recent in investor

Keep reading.