Investor daily

Edition 2026-05-28 · read as Investor

CerebrasIPOPopMasksAnthropicBudgetBlowupsatSaaSCo's

Sources
36
Words
1,600
Read
8min

Topics Agentic AI AI Capital LLM Inference

◆ The signal

Cerebras popped 70% on day one to $311 while ServiceNow quietly blew its full-year Anthropic budget by May — with zero SLAs and no usage telemetry. Your AI marks are going up on proven exit liquidity while the revenue quality underneath the application layer is silently deteriorating. Anthropic's June 15 credit conversion eliminates the 70-90% token arbitrage that Claude-dependent wrappers relied on. The window to sell into strength and stress-test portfolio revenue quality is this week, not next quarter.

◆ INTELLIGENCE MAP

  1. 01

    Enterprise AI Revenue Is Not SaaS Revenue

    act now

    ServiceNow exhausted its full-year Anthropic budget by May with no SLAs, no per-user telemetry, and no contractual lock-in. Anthropic's June 15 credit conversion kills the 70-90% subscription arbitrage Claude wrappers depend on. OpenAI countered with 2-month free Codex — both labs buying developer lock-in ahead of Anthropic's likely October IPO.

    70-90%
    arbitrage eliminated
    5
    sources
    • Budget blown by
    • Credit conversion
    • Likely IPO
    • Wrapper margin hit
    1. ServiceNow budget blownMay 2026
    2. Credit conversion liveJune 15
    3. OpenAI Codex promo endsJuly 2026
    4. Anthropic IPO targetOctober 2026
  2. 02

    AI Infra Exit Window: First-Day Proof Points

    monitor

    Cerebras closed at $311 (70% pop), delivering Eclipse 17x and Tiger $1B paper gain. Fervo popped 33% to $10B+ on AI power demand. Benchmark raised a $225M SPV to defend Cerebras ownership — the first institutional admission that SPVs are now structural, not optional. Two of the next three filers will clear; the third tests whether the bid is selective.

    17x
    Eclipse return
    4
    sources
    • Cerebras day-1 pop
    • Fervo day-1 pop
    • Benchmark SPV
    • Tiger paper gain
    1. Eclipse (2016)17
    2. Tiger (late '25)3.5
    3. Benchmark SPV1.1
  3. 03

    Agent Economy Crosses 59% — Category Repricing Live

    monitor

    Vercel's production AI Gateway shows agentic workloads hit 59% of token volume — no longer a thesis, the majority case. Anthropic captures 61% of spend (premium reasoning), Google takes 38% of volume (commodity throughput). SAP committed €100M, ServiceNow shipped headless Action Fabric, and Notion launched a developer platform. The agent infrastructure window is 2-3 quarters before incumbents absorb it.

    59%
    agentic token share
    5
    sources
    • Anthropic spend share
    • Google volume share
    • SAP agent fund
    • Absorption window
    1. Agentic workloads59
    2. Chat/completion41
  4. 04

    AI Security Crosses KEV Threshold — Budget Line Confirmed

    monitor

    LiteLLM hit CISA's KEV catalog — first AI-infra component flagged as actively exploited. MDASH produced 16 validated Windows CVEs in one Patch Tuesday. DepthFirst claims 10x cost advantage over Mythos on vulnerability discovery. OpenAI's Daybreak launched with 8 incumbent 'partners' in the pre-disintermediation pattern. Series A window for AI-native security is open now.

    16
    MDASH CVEs shipped
    5
    sources
    • LiteLLM status
    • DepthFirst vs Mythos
    • Daybreak partners
    • PraisonAI exploit time
    1. DepthFirst (FFmpeg)1000
    2. Mythos (FFmpeg)10000
  5. 05

    GTM Value Migration: Record to Intelligence Layer

    background

    a16z published its GTM thesis with a live Stitch check: system of intelligence captures majority value over the system of record. Lemkin proof point — 80% fewer human seats, 83% higher spend ($12K→$22K), 20+ agents running. Salesforce and HubSpot ($149B combined) face ARPU expansion but seat-count erosion. The investable window for vertical orchestration is 12-18 months before incumbents or consensus close it.

    83%
    spend increase per account
    3
    sources
    • Seat reduction
    • CRM TAM at risk
    • Agent-to-seat ratio
    • Window remaining
    1. Before (10 seats)12000
    2. After (2 seats + agents)22000

◆ DEEP DIVES

  1. 01

    Enterprise AI Revenue Quality Is Structurally Fragile — Reprice the Application Layer

    The Revenue Looks Great Until You Read the Contract

    The least comfortable fact in AI investing this week is that enterprise AI ARR does not behave like SaaS ARR, and the evidence keeps arriving from buyers who would prefer it didn't. ServiceNow, arguably the most sophisticated enterprise buyer alive, blew through its full-year Anthropic budget by May 2026. National Life Group's CIO described Anthropic as 'great for consumer usage but not great for companies.' The reason is banal: there is no granular per-user telemetry, no SLAs worth the name, and no contractual switching costs anywhere in the stack.

    This matters because the thirty billion dollar ARR figure everyone is marking Anthropic against assumes enterprise-grade durability, and the plumbing underneath it is not enterprise-grade.


    The June 15 Margin Event

    Anthropic's decision to convert every Claude subscription into a dollar-matched API credit pool eliminates the 70-90% arbitrage that third-party harnesses (Cline, OpenCode, the various Claude wrappers) have been quietly running for months. A $200/month plan now buys exactly $200 of programmatic tokens, which means no more leveraging subscription-tier access for production-scale inference at a fraction of API rates.

    OpenAI answered within days with two months of free Codex for enterprise switchers. Set against Ramp's April data showing Anthropic at 34.4% of business spend versus OpenAI at 32.3%, this looks like a subsidy war timed to pre-IPO margin recovery. Or rather, the more interesting version: a subsidy war both sides need to be seen losing money on, for different reasons.

    Every Claude-dependent portfolio company whose unit economics were built on subscription-tier token access has lost 20-40% of runway since last Friday, a change that is four days old and which most founders have not yet flagged.

    The Deployment Services Arms Race Confirms It

    The industry has quietly conceded what Palantir proved twenty years ago, which is that deployment is the bottleneck, not model capability. Google is hiring hundreds of forward-deployed engineers, OpenAI stood up DeployCo with Bain, and Salesforce and ServiceNow are staffing the same function under different names. When four firms independently decide the margin lives in deployment services, the margin lives in deployment services.

    The second-order effect is a new category — AI observability and FinOps — validated by ServiceNow's own AI Control Tower selling into the same customers panicking about their Anthropic bills. The company that burned its budget is now selling the budget-monitoring tool, which is not coincidence so much as a CDIO who watched the category form inside her own P&L and decided to charge for the lesson.

    What This Means for the Book

    The under-discussed risk is the reversibility of enterprise AI spend: absent SLAs there are no switching costs, absent usage telemetry there is no early warning, and FOMO-driven procurement (which describes most 2025-2026 AI spend) produces a cliff-shaped risk profile rather than the smooth retention curves SaaS multiples assume. This is probably wrong in a handful of names, but the working rule is to apply a 20-40% reversibility discount to any LLM-layer ARR where SLAs and telemetry are absent.

    Action items

    • Request updated gross-margin models from every Claude-dependent portfolio company reflecting the June 15 credit conversion — any wrapper running COGS against subscription tokens needs a revised cohort model by end of month
    • Demand SLA + usage-telemetry roadmap from any portfolio company pitching enterprise AI ARR at next board meeting
    • Build a sourcing sprint on AI observability / FinOps (token-cost-attribution, per-user spend caps, SLA monitoring) at Seed-Series A
    • Apply 20-40% reversibility discount to any LLM-layer ARR in your marks where SLAs/telemetry/contractual lock-in are absent

    Sources:Anthropic has an enterprise gap · Anthropic squeezing its pre-IPO round · Anthropic is reportedly running at thirty billion dollars · Anthropic's 80x growth broke its infra

  2. 02

    Agent Workloads Are the Majority Case — Infrastructure Window Is 2-3 Quarters

    59% Is Not a Thesis. It's the Base Rate.

    Vercel published the first production-grade AI Gateway index this week — not a lab benchmark, not self-reported ARR, but downstream production data across 200K+ teams. The headline: agentic workloads now carry 59% of all token volume. The chat-completion era is officially the minority case in production.

    More revealing is the spend-versus-volume bifurcation. Anthropic captures 61% of spend via Opus on premium reasoning calls. Google takes 38% of volume via Flash as commodity throughput. These are two different businesses emerging inside what everyone has been calling 'foundation models.' The pricing power lives with Anthropic. The scale lives with Google. Multi-model routing is not a sophistication — it's the default architecture.


    The Incumbents Are Moving First This Cycle

    Unlike the cloud transition where AWS and Azure defined infrastructure while ERPs caught up late, this time the incumbents are moving first:

    • SAP committed €100M to an Autonomous Enterprise fund with Nvidia and Microsoft wired in
    • ServiceNow shipped Action Fabric — headless APIs designed for agents to consume, decoupling logic from UI
    • Notion launched a developer platform hosting Claude, Codex, Cursor, Decagon, Warp, and Devin as teammates
    • Airtable launched Hyperagent with a $10M inference credit program targeting 500 founders

    This means the question of where agent-layer value accrues looks materially different from the cloud parallel. Pure-play agent orchestration layers face absorption risk from both below (model labs climbing up) and above (workflow platforms climbing down).

    The window for agent infrastructure at reasonable Series A pricing is 2-3 quarters. After that, SAP's corp dev team, Notion's platform, and Anthropic's own tooling absorb the category into existing surfaces.

    The a16z GTM Thesis Sharpens the Allocation

    a16z published its full GTM thesis with a live check (Stitch): the system of intelligence layer captures majority value over the system of record. The Lemkin proof point makes the economics concrete: Salesforce from 10+ human seats to 2 humans + 1 API seat, while spend rose 83% ($12K → $22K) and 20+ agents run on top. The seat count collapsed. The bill went up.

    The investable wedge is narrow: vertical-specific orchestration layers with institutional-context accumulation. The moat has migrated from data gravity to orchestration gravity — institutional context, workflow memory, and multi-system synthesis are the new lock-in. Horizontal 'AI copilot for sales' plays walk straight into Salesforce's API-first counter-punch.

    What Survives Absorption

    Three characteristics define what the incumbents cannot replicate in a quarter: (1) proprietary action surfaces — a regulated workflow, a dataset, an integration graph; (2) accumulated institutional context that compounds with usage; (3) vertical depth the incumbent's horizontal platform cannot be bothered to defend. Everything else is a feature waiting to be shipped.

    Action items

    • Source 3-5 agent infrastructure deals in MCP tooling, agent identity/auth, agent observability, and knowledge-graph governance this quarter — before SAP's corp dev activation compresses entry points
    • Map pipeline against 'agent-hosting platform' absorption risk — identify which deals get displaced if Notion/Airtable/ServiceNow absorb the workflow
    • Request consumption/agent-usage metrics in all active AI-GTM diligence — specifically agent-to-seat ratio, API call volume growth, and NRR on existing logos
    • Rerank AI-GTM pipeline by orchestration moat depth — prioritize narrow high-frequency workflows with measurable outputs over horizontal 'AI copilot for sales' plays

    Sources:a16z has published another map · Vercel's first production AI Gateway index · SAP put one hundred million euros · Anthropic shipped Claude Code

  3. 03

    AI Security Hit Its KEV Moment — The Series A Window Opens Now

    LiteLLM Lands on the KEV Catalog

    CISA added LiteLLM, an AI Gateway routing control plane, to its Known Exploited Vulnerabilities catalog this week. This is the first time an LLM-routing component has been federally flagged as actively exploited. Combine that with Ollama's GGUF model-loader bug (CVSS 9.1, data exfiltration via malicious model files) and OpenClaw shipping six critical CVEs in a single cycle, and the AI-infra stack sits at roughly the security maturity of enterprise SaaS in 2014.

    This looks like a vulnerability story and is not. Or rather, the more interesting version is that it is the regulatory validation event that turns "AI security" from a pitch slide into an enterprise budget line with procurement authority attached.


    Concurrent Disruption Signals

    The category is being disrupted from three directions at once, and the wedges do not all have to clear for the thesis to work.

    Economics: DepthFirst's Open Defense Initiative landed FFmpeg, Envoy, and Kata as anchor tenants, claiming 12 memory corruption bugs for ~$1,000 in compute versus Anthropic's Mythos missing the same bugs at ~$10,000 across hundreds of scans. If the 10x claim holds at scale, wrapper economics collapse for anyone building security workloads on rented frontier-model time.

    Platform absorption: OpenAI's Daybreak launched with Cloudflare, Cisco, CrowdStrike, Palo Alto Networks, Oracle, Zscaler, Akamai, and Fortinet listed as "partners." That is the pre-disintermediation pattern, every incumbent named as a friend in the quarter before the platform eats them.

    Autonomous discovery at scale: Microsoft's MDASH produced 16 validated Windows CVEs in a single Patch Tuesday, the first auditable evidence that multi-model vulnerability discovery works at hyperscaler production scale rather than in conference demos.

    When attackers weaponize AI-framework CVEs in 4 hours (PraisonAI) and defenders need weeks to patch, the economics of selling security on a human-speed cadence break. The fundable layer is the one that matches machine speed.

    The Investment Map

    The category splits into four fundable wedges, and the diligence test on each is whether the moat is harness design or merely model access.

    1. AI-gateway security: MCP firewalls, LLM posture management, model-artifact scanning. LiteLLM on KEV is the "MongoDB ransomware moment" that creates enterprise procurement authority.
    2. Autonomous vulnerability discovery with specialized harnesses: the Mozilla-versus-curl delta is the diligence test. Mozilla found 271 Firefox bugs with a custom agentic harness; Mythos found 1 real CVE in curl with a generic scan. Harness design is the moat, not model access.
    3. Agent runtime security: an 81% bot-detection bypass rate plus 4-hour exploit windows on AI frameworks equals a defender-empty category with near-certain demand.
    4. Supply-chain provenance: Shai-Hulud's leaked source documents industrial-grade Sigstore forgery. The current trust chain is known-breakable.

    EDR Moat Erosion

    TrustedSec ran LLMs against five commercial EDRs and compressed reverse engineering from weeks to days; all five products rest on identical architectural furniture (YARA rules, Lua engines, local ML classifiers). Current multiples on CrowdStrike, Palo Alto, and SentinelOne are priced for the old moat. Detection-rule IP is becoming a commodity input. This is probably wrong on timing, but the durable positions look like distribution, data scale, and harness-heavy integration layers.

    Action items

    • Pull forward AI-security diligence by one quarter — specifically AI-gateway firewalls, model-artifact scanners, and LLM-runtime sandbox companies at Series A
    • Request DepthFirst data room and validate 10x cost claim against Mythos with independent benchmark on 2-3 additional codebases
    • Add OpenAI Daybreak to competitive tracker for every security portfolio company — require founders to articulate why their product isn't a Daybreak connector in 18 months
    • Run portfolio-wide exposure sweep for LiteLLM (1.81.16–1.83.7), Traefik, Argo CD, and MOVEit Automation — escalate any that are internet-exposed

    Sources:Cybersec alpha: AI-infra CVEs hit KEV · DepthFirst's Open Defense Initiative · Anthropic's Mythos cleared AISI · The EDR moat is cracking · Microsoft's MDASH producing 16 patched Windows flaws

◆ QUICK HITS

  • Update: Cerebras closed day one at $311 (70% pop) — Eclipse netted 17x from 2016 entry, Tiger sits on ~$1B paper from $89 entry months ago; Modal's $4.5B round is now the cleanest private read-through

    Cerebras printed a seventy percent first-day pop

  • Benchmark raised a $225M SPV for Cerebras despite being 'the patron saint of small early-stage funds' — late-stage adds were 93% of total cost basis and lowered the fund multiple; SPV infrastructure is now structurally required for top-5 winners

    Cerebras printed a seventy percent first-day pop

  • Abridge raised $550M at $5.3B servicing 250 health systems with 80M+ conversations — the ambient clinical scribe category is closed; alpha moved to payer-side prior auth, nursing workflows, and specialty verticals

    Abridge at $5.3B: the healthcare AI vertical just printed a category winner

  • Google's Gemini Intelligence embeds autonomous task execution directly into Android (97%+ share in key markets, summer 2026 rollout) — every mobile AI assistant startup whose wedge is 'agent for your phone' is now a feature, not a company

    Gemini becomes the OS: your agent-layer portfolio just got disintermediated

  • Fivetran readiness index confirms only 15% of enterprises have data foundations for agentic AI despite spending millions — data quality and lineage cited as #1 blocker by nearly half; picks-and-shovels window open at Series A/B

    Data infra thesis update: DuckDB goes client-server, 85% agentic-AI readiness gap

  • Figure ran an 8-hour fully autonomous humanoid shift at human-parity speed with fleet coordination and self-maintenance — the first credible 'labor-hour' proof point; expect 1X, Physical Intelligence, and Apptronik rounds to mark against this comp

    Anthropic squeezing its pre-IPO round

  • a16z's AI liability framework is coordinated lobbying against absolute-liability proposals — if developer liability for downstream misuse gets established, open-weight foundation models become uninsurable; stress-test deals at 15-25% revenue absorbed by litigation reserves

    a16z is spending partner attention on AI liability policy

  • Nebius guided to 6x revenue growth ($530M→$3.4B) with 4+ customers bidding per GPU and $2.47B capex against $2.26B operating cash — neoclouds are structurally cash-negative even at 684% growth; compress private multiples from 8-10x to 2-3x forward

    AI compute is sold out 4:1

◆ Bottom line

The take.

Enterprise AI revenue is growing at headline rates while the plumbing underneath it — no SLAs, no telemetry, no contractual lock-in — means that ARR is reversible in ways SaaS ARR never was. The AI exit window is proven liquid (Cerebras 70% pop, Eclipse 17x), agentic workloads are now 59% of production tokens, and AI-infra security just hit CISA's actively-exploited list for the first time. The trade this week: sell model-layer exposure into the strongest primary demand you'll see, stress-test every wrapper's unit economics before June 15 kills their margin arbitrage, and concentrate new capital where incumbents cannot absorb the category in two quarters — agent infrastructure governance and AI-native security.

— Promit, reading as Investor ·

Frequently asked

How does Anthropic's June 15 credit conversion change wrapper economics?
It eliminates the 70-90% token arbitrage that Claude-dependent wrappers ran by leveraging subscription-tier access for production inference. After June 15, a $200/month plan buys exactly $200 of programmatic tokens. Any portfolio company whose COGS model assumed subscription-tier token rates has effectively lost 20-40% of runway, and most founders have not yet flagged it.
What discount should be applied to LLM-layer ARR without SLAs or usage telemetry?
A 20-40% reversibility discount is the working rule. ServiceNow's full-year Anthropic budget blowout by May, combined with CIO sentiment that Claude is 'not great for companies,' demonstrates that enterprise AI spend reverses quickly absent contractual lock-in. Without SLAs, telemetry, or switching costs, retention curves look cliff-shaped rather than SaaS-smooth.
Why is the agent infrastructure investment window only 2-3 quarters?
Incumbents are moving first this cycle, unlike the cloud transition. SAP committed €100M with Nvidia and Microsoft, ServiceNow shipped Action Fabric, Notion launched a developer platform hosting Claude/Codex/Cursor, and Airtable launched Hyperagent. Pure-play orchestration layers face absorption from both model labs climbing up and workflow platforms climbing down, compressing Series A entry points fast.
What makes LiteLLM's KEV listing an investable inflection rather than a vulnerability story?
It is the first time a federal catalog has flagged an LLM-routing component as actively exploited, which converts AI security from a pitch slide into an enterprise budget line with procurement authority. Combined with Ollama's CVSS 9.1 model-loader bug and OpenClaw's six critical CVEs, it functions as the 'MongoDB ransomware moment' that triggers re-rating of AI-gateway firewalls and model-artifact scanners at Series A.
What characteristics let an agent-layer company survive incumbent absorption?
Three: proprietary action surfaces (regulated workflows, integration graphs, exclusive datasets), accumulated institutional context that compounds with usage, and vertical depth horizontal platforms cannot be bothered to defend. The moat has migrated from data gravity to orchestration gravity. Horizontal 'AI copilot for sales' plays walk into Salesforce's API-first counter-punch; narrow high-frequency workflows with measurable outputs do not.

◆ Same day, different angle

Read this day as…

◆ Recent in investor

Keep reading.