Edition 2026-05-31 · read as Investor
AnthropicPricingResetBreakstheClaudeWrapperThesis
- Sources
- 36
- Words
- 2,011
- Read
- 10min
Topics Agentic AI AI Capital LLM Inference
◆ The signal
Anthropic's June 15 pricing change closed the seventy-to-ninety percent subscription arbitrage the third-party Claude tools were quietly running on, which is to say every Claude-dependent wrapper in the portfolio woke up last Friday with a different unit economics deck. ServiceNow, separately, burned its full-year Anthropic budget by May with no enterprise telemetry or SLAs to slow it down — call that reversible spend rather than SaaS ARR, or at least the more interesting version of that argument. The thesis could be wrong if margins re-expand at the model layer, but eighty times revenue marks assume a structure that, as of last week, no longer exists.
◆ INTELLIGENCE MAP
01 Enterprise AI Revenue Quality Is Structurally Fragile
act nowServiceNow exhausted its full-year Anthropic budget by May due to zero usage telemetry. Anthropic's June 15 credit unbundling kills 70-90% arbitrage for Claude wrappers. No SLAs, no per-user dashboards, reversible spend. Enterprise AI ARR ≠ SaaS ARR — apply 20-40% reversibility discount.
- Anthropic B2B share
- OpenAI B2B share
- Budget blown by
- Pricing change date
- Anthropic B2B34.4
- OpenAI B2B32.3
02 Agentic Workloads Hit 59% — Platform Absorption Accelerates
monitorVercel's production gateway shows agentic workloads now carry 59% of all token volume. Anthropic captures 61% of spend while Google takes 38% of volume — two different businesses inside 'foundation models.' Notion, SAP (€100M fund), and Airtable ($10M credits) are absorbing the agent-hosting layer, compressing standalone orchestration startups.
- Anthropic spend share
- Google volume share
- SAP agent fund
- Hyperagent credits
03 AI Infrastructure Security Crosses KEV Threshold
monitorLiteLLM's AI gateway landed on CISA's KEV catalog — first LLM-routing control plane flagged as actively exploited. Microsoft MDASH shipped 16 validated CVEs in one Patch Tuesday. DepthFirst claims 10x cost efficiency over Anthropic Mythos. PraisonAI exploited within 4 hours. AI-sec is now a budget line, not a pitch slide.
- LiteLLM status
- PraisonAI exploit time
- DepthFirst cost advantage
- Microsoft patches
04 Vertical AI Data Moats Validated — Horizontal Wrappers Compress
monitorAbridge raised $550M at $5.3B with 250 health systems and 80M+ conversations — an unreplicable data moat. Claude for Small Business ships QuickBooks/HubSpot connectors, killing generic SMB AI. a16z's GTM thesis explicitly says orchestration gravity > data gravity. The bar for new AI deals is proprietary per-use data flywheels, not model access.
- Abridge health systems
- Annual conversations
- Lemkin seat reduction
- Lemkin spend increase
05 GTM Software Value Migrates from Record to Intelligence Layer
backgrounda16z published its thesis and backed Stitch: system of intelligence captures majority of next decade's GTM enterprise value. Lemkin's proof point — 2 human seats replacing 10+, spend up 83%, 20+ agents running — is the unit-economics template. Salesforce ($140B) becomes infrastructure consumed at API layer. Investable window: 12-18 months before incumbents or consensus close it.
- Salesforce market cap
- Seats reduced
- Agents deployed
- Window remaining
- Human seats2
- AI spend22000
◆ DEEP DIVES
01 Enterprise AI ARR Is Not SaaS ARR — The Revenue Quality Problem Nobody Is Pricing
The Structural Problem
ServiceNow, which is one of the more sophisticated enterprise software buyers on the planet, burned through its full-year Anthropic budget by May 2026. Not because Claude underdelivered. Because Anthropic offers no per-user telemetry, no granular usage dashboards, and no SLAs worth printing. The National Life Group CIO put it about as plainly as a CIO ever puts anything: Anthropic is 'great for consumer usage but not great for companies.'
Then on May 12 Anthropic converted every Claude subscription into a dollar-matched API credit pool, which is to say it killed the 70-90% arbitrage the wrapper crowd (Cline, OpenCode, dozens more) had been quietly running. OpenAI countered inside the same news cycle with two months of free Codex for enterprise switchers. The coding-agent category is now in an open subsidy war, conveniently timed to Anthropic's likely October IPO.
Enterprise AI spend reverses quickly once cost-efficient alternatives land. No SLAs + no telemetry + no switching costs = a cliff-shaped revenue risk profile.
What This Means for Your Book
Ramp has Anthropic at 34.4% of business spend against OpenAI at 32.3%, which is the first documented lead change and is being read by some as a regime shift. It is not. Billing share is not the same as durable revenue, and three things make enterprise AI ARR structurally unlike SaaS:
- Zero contractual lock-in: no multi-year contracts, no SLAs, no data gravity in most deployments
- Budget opacity: customers cannot see per-user or per-workflow spend until the bill arrives
- Instant reversibility: the Ramp data itself shows vendor share flipping on each model release cycle
The FDE land-grab is the tell. Google is hiring hundreds of forward-deployed engineers, OpenAI stood up DeployCo with Bain, Salesforce and ServiceNow are staffing the same function. When four firms independently conclude the margin sits in deployment rather than the model, the margin sits in deployment rather than the model.
The Arbitrage That Just Died
Any portfolio company running COGS against Claude subscription tokens lost somewhere between 20% and 40% of effective runway since Friday. The June 15 change is explicit: programmatic usage bills at API rates. Founders may not have flagged it yet because the change is days old. This is a triage call for this week, not a line item for next quarter's board deck.
Where the Alpha Moves
The displacement trade is unusually clean, or rather, the more interesting version of it is: short the model layer's revenue-quality premium, long the observability and deployment layer that fixes what Anthropic will not build. This is probably wrong in at least one direction, but here is the shape.
- AI observability and FinOps-for-AI: token-level cost attribution, per-user spend caps, SLA monitoring. ServiceNow's AI Control Tower is the first comp and there is no independent category winner yet.
- Deployment services tooling: productize 60% of FDE work, meaning context ingestion, custom eval harnesses, workflow templates. Palantir alumni are the sourcing pool.
- Vertical AI with contractual lock-in: in a world where horizontal spend is reversible, vertical AI with data moats and compliance integration becomes structurally more valuable.
Action items
- Request updated gross-margin models from every Claude-dependent portfolio company assuming API-rate billing replaces subscription arbitrage
- Build AI observability/FinOps sourcing sprint targeting Seed-Series A companies with token-level attribution
- Apply a 20-40% 'reversibility discount' to any portfolio LLM-layer ARR where SLAs and telemetry are absent
- Demand SLA and usage-telemetry roadmap from Anthropic/OpenAI in next board cycle for any model-dependent portco
Sources:Anthropic has an enterprise gap · Anthropic squeezing its pre-IPO round is interesting · Anthropic is now at thirty billion dollars of annualized revenue · Anthropic at $900B > OpenAI's $852B: your AI cap table just got repriced
02 Agent Infrastructure Is Consolidating Into Platforms — 12-Month Window Before Absorption
The Data Point That Changes the Frame
Vercel's first production-grade AI Gateway index covers more than 200,000 teams, which is a sample large enough that the headline number stops being a vendor anecdote: agentic workloads now carry 59% of all token volume. The majority case in production is no longer human-in-the-loop chat. The bifurcation has been visible in earnings transcripts for two quarters. The index just makes it numerical, and the spend-volume split shows two businesses hiding inside the phrase 'foundation models'.
Metric Anthropic Google Share of spend 61% (via Opus) ~15% Share of volume ~25% 38% (via Flash) Business model Premium reasoning Commodity throughput Anthropic is being paid a premium for the long tool-using calls that agents generate, the ones that happen to be expensive. Google absorbs cheap high-throughput work that looks excellent in a volume chart and considerably less excellent in a gross margin table. Multi-model routing is now the enterprise default, which validates the orchestration layer and quietly retires any thesis still underwriting a single-model moat.
Platforms Are Moving First
Three platform moves landed in one quarter. Anthropic's June 15 third-party credit unbundling recovered margin the resellers had been sitting on. Notion's developer platform now hosts Claude, Codex, Cursor, Decagon, Warp, and Devin as teammates inside a surface enterprises already pay for. SAP's €100M Autonomous Enterprise fund wires NVIDIA and Microsoft into the platform layer with capital the standalone vendors do not have. Airtable's $10M Hyperagent credit program rounds out the picture: workflow incumbents are absorbing the agent-hosting surface before pure-play orchestration startups can define it.
When SAP underwrites a thesis with €100M and ServiceNow ships headless APIs for agents to consume, the standalone agent-ops category has 12-18 months before it becomes a feature inside an existing renewal.
Where the Investable Window Remains
a16z's GTM thesis lines up with Lemkin's proof point, where 2 human seats replaced more than 10 and spend rose 83% on the back of 20+ agents in production. That confirms the value migration is real. The system of intelligence layer is where the moat now sits, built on orchestration gravity and the accumulated workflow context that competitors cannot replay cheaply. The window is narrower than it sounds.
- Agent infrastructure picks-and-shovels: MCP gateways, agent identity and auth, knowledge-graph tooling, observability. Series A pricing is still rational. Expect a reset inside 2 quarters as corp dev activates.
- Vertical orchestration with institutional context: narrow high-frequency workflows like research, prospecting, and structured note-taking, with measurable outputs and NRR exceeding 150%. The Stitch archetype.
- Consumption-pricing proof points: 3+ customers showing 10x agent deployment, 80% seat reduction, and 80%+ spend increase clears the bar for premium multiples.
What to Avoid
Horizontal agent-ops companies face distribution asymmetry they cannot outspend. This is probably wrong in one or two specific cases we will eventually have to write about, but the pattern of Notion, Cursor, and Airtable absorbing agent hosting into existing workflow surfaces is not a problem standalone orchestration startups solve with capital alone. Vertical beats horizontal in agent ops from here.
Action items
- Map pipeline against 'agent-hosting platform' displacement risk — identify which deals get absorbed if Notion/SAP/Airtable ship the feature
- Source 3-5 agent infrastructure deals in MCP tooling, agent identity, and agent observability before SAP's corp dev team does
- Request Vercel AI Gateway production index as recurring data source; use spend/volume split as diligence benchmark
- Stress-test portfolio cos using third-party Claude integrations against June 15 pricing — model gross margin impact if API-rate billing kicks in
Sources:Vercel's first production AI Gateway index · a16z has published another map of where value accrues in GTM software · SAP put one hundred million euros into an Autonomous Enterprise fund · Anthropic squeezing its pre-IPO round is interesting
03 AI Security Crosses from Pitch Deck to Budget Line — Fund the Category Before Series B Reprices
The KEV Moment
LiteLLM, the LLM-routing control plane sitting inside thousands of enterprise AI deployments, landed on CISA's Known Exploited Vulnerabilities catalog this cycle, which is the first time the federal government has flagged an AI-infrastructure component as actively exploited in the wild. Ollama shipped a CVSS 9.1 data-exfiltration bug via malicious model files in the same window, and PraisonAI was weaponized within 4 hours of disclosure. The AI-infra stack is, in security maturity terms, somewhere around where enterprise SaaS was in 2014.
Meanwhile Microsoft's MDASH multi-model system quietly shipped 16 validated Windows CVEs in a single Patch Tuesday, which is the first auditable evidence that autonomous vulnerability discovery works at production scale rather than in a demo. DepthFirst's Open Defense Initiative claims 10x cost efficiency over Anthropic's Mythos, finding 12 memory corruption bugs in FFmpeg for roughly a thousand dollars where Mythos missed the same at ten thousand. Take the cost ratio with a grain of salt; the directional point survives the discount.
When rivals rent compute from enemies and AI gateways land on KEV, you are not in a security narrative. You are in a category formation event with a 6-12 month pricing window.
The Bifurcation
AI security is splitting into two businesses with economics that share almost nothing:
Segment Signal Window AI gateway security LiteLLM KEV; Ollama CVSS 9.1 Series A now Autonomous vuln discovery MDASH 16 CVEs; DepthFirst 10x Series A-B, 12 months LLMjacking defense 113K requests/month on honeypot; 175 hijack attempts/week Pre-category seed Agentic SOC/GRC Exaforce, Drata, Teleport all shipped same week Saturating — raise bar Identity/deepfake defense $40B projected US losses 2027 Series B before consensus That Mythos Congressional briefing routed through NSA rather than CISA is the tell. The first federal dollars in this category flow through intelligence community procurement, not civilian defense, which means weight toward teams with TS/SCI cleared talent and offensive-AI primitives wrapped in compliance. The capital is not going where the press releases imply.
What Gets Repriced Down
TrustedSec pointed LLMs at five commercial EDRs and reported reverse engineering now takes days, not weeks. All five products rest on identical architectural furniture, YARA rules, Lua engines, local ML classifiers, which is fine until the cost of pulling them apart drops by an order of magnitude. OpenAI's Daybreak launched with CrowdStrike, PANW, Zscaler, and Fortinet as 'partners', which is the pre-disintermediation seating chart that has played out in four prior waves of enterprise software. This is probably wrong, but: detection-rule IP is becoming a commodity input, and current EDR multiples are priced for the old moat.
The Harness Is the Moat
Mozilla's custom agentic harness on top of Claude found 271 bugs in Firefox. Anthropic's Mythos, running a generic scan, found 1 real CVE in curl, which the maintainer publicly called 'primarily marketing.' The delta is harness quality, not model quality, which is the more interesting version of the bull case. Fund the orchestration and CI-integration layers. The generic 'LLM finds bugs' pitch is already cheap.
Action items
- Pull forward AI-security diligence by one quarter — specifically AI-gateway firewalls, model-artifact scanners, and LLM-runtime sandboxing
- Request DepthFirst data room and validate 10x cost claim against Mythos on 2-3 additional codebases before next round prices up
- Run portfolio-wide exposure sweep for LiteLLM, Ollama, Traefik, Argo CD, and PraisonAI — escalate any internet-exposed instances
- Apply the 'Mozilla vs curl' test to every AI AppSec pitch — if founder can't explain 271 bugs vs 1, they're selling the model not the product
Sources:Cybersec alpha: AI-infra CVEs hit KEV · DepthFirst's Open Defense Initiative · The EDR moat is cracking · Anthropic's Mythos cleared AISI this week · Microsoft's MDASH producing 16 patched Windows flaws
04 Vertical AI Moats Are Built from Data, Not Models — The Abridge Template
The Category Winner Print
Abridge raised $550M in 2025 alone ($250M early year, $300M at a $5.3B mark in June), services 250 large US health systems, and processes 80M+ patient-clinician conversations annually across 28 languages and 50 specialties. That corpus is unreplicable — no foundation model or new entrant can buy or scrape it. The category is effectively closed for ambient clinical documentation in US large health systems.
The more under-appreciated signal: health systems — historically the slowest enterprise buyers on earth — compressed release cadence from quarterly/biannual to monthly for Abridge. This rewrites velocity assumptions in every healthcare SaaS DCF.
The Template for Vertical AI Underwriting
Abridge's three-act structure is the playbook to generalize:
- Save Time (documentation) → wedge into CMIO budget
- Save Money (prior auth, RCM) → expand to CFO budget
- Save Lives (clinical decision support) → unlocked by Jan 2026 FDA guidance
Each act monetizes the same proprietary conversation asset against a different stakeholder. The non-compete posture with Epic/Cerner — framing itself as a 'clinical intelligence layer' rather than a workflow replacement — is the existential-risk mitigation that makes the story investable at scale.
The investable wedge in vertical AI isn't better models — it's per-use data flywheels that compound with every customer interaction and can't be accessed by competitors.
Why Horizontal AI Wrappers Are Getting Killed
Claude for Small Business shipped with connectors into QuickBooks, PayPal, HubSpot, Google Workspace, and Microsoft 365. That is a horizontal land grab where the connectors are the product. Survivors need vertical depth, regulated workflow access, or proprietary data the connectors cannot replicate. Most pitches do not clear this bar.
The contrast with pre-product thesis bets is stark: Recursive Superintelligence raised $650M at $4B+ on seven co-founders and a thesis. Mind Robotics hit $3.4B at six months old. These are late-cycle optionality plays, not category-defining moats. The Abridge template — per-use data flywheel + ≥2 monetization vectors + explicit non-compete with system-of-record — is what separates durable vertical AI from expensive bets.
Adjacent Opportunities
The category-leader print at $5.3B doesn't close the healthcare AI book — it opens three chapters:
- Payer-side prior authorization: Abridge attacks from provider side; payer counterparty remains greenfield
- Nursing and short-form clinical workflows: 30-second visits are a fundamentally different product surface
- International and specialty verticals: Vet, dental, behavioral health — markets Abridge won't prioritize for 24+ months
Action items
- Kill or de-prioritize any active deals on direct ambient-scribe competitors targeting US large health systems
- Update vertical AI underwriting framework to require per-use data flywheel, ≥2 monetization vectors, and explicit non-compete posture vs. dominant system-of-record
- Build thesis memo on payer-side prior-auth automation with target list of 8-12 startups to meet this quarter
- Diligence SMB vertical AI pipeline against Claude for Small Business connector roadmap — kill or double down
Sources:Abridge at $5.3B: the healthcare AI vertical just printed a category winner · Anthropic is now at thirty billion dollars of annualized revenue · a16z has published another map of where value accrues in GTM software
◆ QUICK HITS
Update: Cerebras closed first day at $311 (+70% from IPO price), implying $41.7B market cap — Eclipse netted 17x, Tiger sitting on $1B paper gain; use as live comp for all AI-infra late-stage marks but note 180-day lock-up before declaring real returns
Cerebras printed a seventy percent first-day pop
Update: Nebius printed 684% YoY revenue growth ($399M Q1, guiding $3.0-3.4B for 2026) with 4+ customers bidding per GPU — but capex ($2.47B) exceeds operating cash ($2.26B), confirming neocloud model is structurally cash-negative
AI compute is sold out 4:1
Benchmark raised a $225M SPV to defend Cerebras ownership — the firm most committed to small early-stage funds broke its own model, signaling SPV infrastructure is now table stakes for AI-era GPs
Cerebras closed at three hundred eleven dollars
Anthropic leased entire Colossus 1 cluster (220K+ GPUs) from Musk's xAI — confirming xAI is retreating from frontier race and monetizing idle compute; reprice any xAI secondary as infra+distribution, not frontier lab
Anthropic's 80x growth broke its infra
Apple building agent governance directly into App Store — gating third-party agents through review queue and fee table ahead of likely WWDC reveal; stress-test any portfolio company whose GTM assumes friction-free iOS agent distribution
Anthropic has apparently overtaken OpenAI in the B2B enterprise segment
Only 15% of enterprises have data foundations for agentic AI despite spending millions (Fivetran index) — data quality and lineage cited as #1 blocker by nearly half of respondents; cleanest picks-and-shovels setup since early Snowflake era
DuckDB goes client-server, 85% agentic-AI readiness gap
Shai-Hulud supply-chain attack framework open-sourced under MIT — documents Sigstore provenance forgery and GitHub Actions OIDC token theft; mandate credential rotation and npm/PyPI provenance audit across portfolio
The EDR moat is cracking
a16z's AI liability framework warns of absolute-liability proposals that would make open-source model releases uninsurable — add 15-25% litigation reserve to unit economics for any deal releasing model weights
a16z is spending partner attention on AI liability policy
◆ Bottom line
The take.
Enterprise AI revenue isn't SaaS revenue — ServiceNow blew its Anthropic budget by May with no telemetry to stop it, Anthropic's June 15 pricing change just killed the 70-90% wrapper arbitrage, and Ramp's enterprise share flip (34.4% vs 32.3%) proves vendor loyalty in AI is effectively zero. The three highest-conviction plays this week: triage every Claude-dependent portfolio company's unit economics before month-end, fund AI observability and agent infrastructure before platforms absorb the category in 12 months, and concentrate vertical AI bets on companies with unreplicable per-use data moats (the Abridge template) while killing horizontal wrappers that Anthropic's own connector roadmap makes redundant.
Frequently asked
- What changed with Anthropic's June 15 pricing and why does it matter for portfolios?
- Anthropic converted Claude subscriptions into dollar-matched API credit pools, eliminating the 70-90% subscription arbitrage that third-party wrappers like Cline and OpenCode had been running. Any portfolio company with COGS tied to Claude subscription tokens effectively lost 20-40% of runway overnight, since programmatic usage now bills at API rates. Founders may not have flagged it yet — this is a triage call, not a next-quarter line item.
- Why isn't ServiceNow's Anthropic spend a bullish signal for enterprise AI ARR?
- ServiceNow burned its full-year Anthropic budget by May with no per-user telemetry, no granular dashboards, and no SLAs — that's reversible experimentation spend, not contracted SaaS ARR. Enterprise AI billing share flips on each model release cycle because there's zero contractual lock-in and budget opacity until invoices arrive. Apply a 20-40% reversibility discount to model-layer ARR lacking SLAs and telemetry.
- Where does the alpha move if the model layer is being repriced?
- Three displacement trades: AI observability and FinOps tooling for token-level cost attribution (ServiceNow's AI Control Tower validated demand, no category winner yet); productized deployment-services tooling targeting the FDE land-grab Google, OpenAI, Salesforce, and ServiceNow are all staffing; and vertical AI with proprietary data flywheels and compliance lock-in, where horizontal reversibility doesn't apply.
- What does the Vercel AI Gateway data reveal about agent workloads?
- Across 200,000+ teams, agentic workloads now carry 59% of token volume — production AI is no longer human-in-the-loop chat. The data also exposes a bifurcation: Anthropic captures 61% of spend at ~25% of volume via premium reasoning (Opus), while Google takes 38% of volume at ~15% of spend via commodity throughput (Flash). Multi-model routing is the enterprise default, retiring single-model moat theses.
- What makes Abridge the underwriting template for vertical AI?
- Abridge combines a per-use data flywheel (80M+ annual patient-clinician conversations across 250 health systems, unreplicable by any foundation model), at least two monetization vectors (documentation, then RCM/prior-auth, then clinical decision support post-FDA guidance), and explicit non-compete posture with Epic/Cerner as a 'clinical intelligence layer.' Any vertical AI memo without all three components fails the comp test.
◆ Same day, different angle
Read this day as…
◆ Recent in investor
Keep reading.
- SpaceX is pricing June 12 at one-point-seven-five trillion, roughly a hundred times revenue, into the worst tape we have seen for a listing…
- SpaceX is quietly collecting $2.17B/month in AI compute rent from Anthropic and Google — a $26B annualized run-rate that isn't in secondary…
- Anthropic edged OpenAI in enterprise billing on Ramp last week, 34.4 percent to 32.3, in the same week ServiceNow admitted it had burned its…
- ServiceNow burned its full-year Anthropic budget by May, with no SLAs, no per-user telemetry, no enterprise dashboard.
- Anthropic leased 220,000 GPUs from Elon Musk's xAI, a sworn enemy, after eighty-times growth broke its infrastructure plan.