Leader daily

Edition 2026-05-27 · read as Leader

AnthropicMythosClearsUKAISIAttackRanges,ResetsBar

Sources
36
Words
1,494
Read
7min

Topics Agentic AI AI Capital LLM Inference

◆ The signal

A reasonable skeptic would say one model clearing two ranges is one model clearing two ranges. The skeptic is correct, and also a quarter behind. Anthropic's Mythos is the first model through both UK AISI simulated attack ranges. EDR internals that used to cost skilled reversers weeks now resolve in days. Exploit weaponization on PraisonAI is down to four hours. Security architectures calibrated last year were calibrated against a different adversary.

◆ INTELLIGENCE MAP

  1. 01

    Security Architecture Invalidated: AI Achieves Full Network Takeover

    act now

    Mythos cleared both AISI hardest ranges — first model ever. EDR reverse engineering collapsed from weeks to days across all 5 tested products. PraisonAI went from disclosure to exploitation in 4 hours. Foxconn lost 8TB of Apple/Intel/Nvidia designs. The defender response window is now shorter than the patch cycle.

    4hrs
    exploit weaponization window
    8
    sources
    • EDR reversal time
    • Foxconn IP exfiltrated
    • KEV in AI tools
    • Mozilla AI bugs found
    1. Traditional exploit dev90
    2. AI-assisted (2025)7
    3. PraisonAI weapon.0.17
  2. 02

    Agent Execution Layer Being Claimed — 59% of Traffic Is Agentic

    monitor

    Vercel production data confirms 59% of AI token volume is now agentic. SAP and ServiceNow are colliding on who owns the execution layer — SAP with a €100M Knowledge Graph, ServiceNow via MCP servers. Notion launched as an agent-hosting platform. Intercom rebranded entirely to its AI agent 'Fin.' The UI-centric era is ending.

    59%
    AI traffic that's agentic
    7
    sources
    • SAP fund
    • Agent bot bypass rate
    • Seat reduction (Lemkin)
    • Spend increase
    1. Agentic workloads59
    2. Human chat/query41
  3. 03

    Anthropic's June 15 Pricing Restructure Opens a 6-Month Arbitrage Window

    act now

    Anthropic disclosed 80x demand against 10x planning. ServiceNow blew its full-year Anthropic budget by May. June 15 brings programmatic metering — third-party tools lose 70-90% discounts. OpenAI responded with 2 months free Codex. Both vendors are paying enterprises to switch simultaneously. This subsidy window lasts ~6 months before lock-in hardens.

    80x
    demand vs. plan
    9
    sources
    • Anthropic ARR
    • Growth in 4 months
    • B2B share lead
    • Pricing change date
    1. Anthropic B2B share34.4
    2. OpenAI B2B share32.3
  4. 04

    Compute Supply Locked Up at $10-100B Scale — Cerebras IPO Confirms

    monitor

    Microsoft's $100B OpenAI commitment surfaced in court filings. Cerebras priced 16% above range at $56B, popped 70% day one on a $20B OpenAI anchor. Nebius at 684% revenue growth with 4:1 demand-to-supply ratio. Fervo Energy debuted at $10B+ (up 33%) on AI power demand. Google optioned 3GW from Fervo alone — enough for 60+ facilities.

    $100B
    Microsoft-OpenAI commitment
    6
    sources
    • Cerebras valuation
    • Nebius revenue growth
    • GPU demand ratio
    • Google power option
    1. Microsoft→OpenAI100
    2. OpenAI→Cerebras20
    3. Fervo Energy IPO10
    4. Cisco AI orders9
  5. 05

    AI Liability Regime Being Written — 18-Month Window to Shape the Rules

    background

    a16z published the most comprehensive AI liability framework yet, proposing user-liability defaults and damages caps. Courts are actively deciding cases that could impose strict developer liability before any legislation passes. Developer-liability regimes would make open-source AI economically unviable. The framework that prevails determines which companies can afford to exist in 5 years.

    $115M
    a16z political spend 2026
    4
    sources
    • Passage odds (Clarity)
    • Active jurisdictions
    • Window to influence
    • a16z midterm spend
    1. NowCourts setting precedent
    2. Q3 2026Safe harbor proposals mature
    3. Q1 2027Federal framework likely
    4. 2028Enforcement begins

◆ DEEP DIVES

  1. 01

    Your Security Architecture Was Built for Last Year's Adversary — The Adversary Just Got Replaced

    The Discontinuity

    The temptation with this week's results is to file them under incremental capability and move on. That is the wrong file. Anthropic's Mythos became the first model to clear both UK AISI simulated attack ranges, and the qualifying task was full network takeover, not persistence. OpenAI's GPT-5.5-cyber completed one of the two. Independent assessors agree that frontier models can now find and chain exploits in something close to real time. The UK AISI reports AI cyber task completion is doubling every few months, and the latest numbers broke above the trend line.

    The vulnerability disclosure and patch cycle the industry runs on is measured in days to weeks. The adversary's capability cycle is now measured in hours.

    EDR Is Now a Glass Box

    TrustedSec ran LLMs against five commercial EDR products and found the same architecture in every one: YARA-style rules, behavioral logic, allowlists, prefilters, Lua scripting engines readable after a single decryption pass, and local ML classifiers. The reverse engineering that used to take a skilled human weeks now takes days with AI assistance. The endpoint detection category was running on security-through-obscurity. The obscurity is gone.

    A reasonable skeptic would point out that EDR was never the only control, and that is correct. The controls that matter over the next eighteen months are identity, network telemetry, and behavioral analytics above the endpoint. Teams that keep treating the agent on the box as the load-bearing control will learn what load-bearing means when the adversary can read the agent.


    The Exploit Window Collapsed

    PraisonAI went from disclosure to active exploitation in 4 hours. Microsoft's MDASH system found 16 exploitable flaws in a single Patch Tuesday using multi-model AI analysis. Mozilla's custom harness found 271 real bugs in Firefox, including sandbox escapes, against curl's single low-severity CVE under generic scanning. The variable is not the model. The variable is the harness.

    Patch SLAs written for a thirty-day window were calibrated for a world where weaponization was the slow step. Weaponization is no longer the slow step. Procurement is.

    Supply Chain Under Active Exploitation

    Foxconn lost 8TB of confidential designs belonging to Apple, Intel, Google, and Nvidia to Nitrogen ransomware. CISA added 5 AI infrastructure tools to the Known Exploited Vulnerabilities catalog, with LiteLLM, Ollama, and OpenClaw among them. The AI tooling adopted last year for speed is now being targeted in production before most organizations have finished inventorying what they deployed.

    Attack SurfaceOld AssumptionCurrent Reality
    EDR bypassWeeks of expert workDays with AI
    Exploit developmentDays to weeks4 hours
    Vuln discoveryExpensive human research271 bugs/model cycle
    AI infra securityExperimental layer5 KEV entries, actively exploited

    Action items

    • Commission red team exercise targeting your EDR with AI-assisted reverse engineering to quantify actual detection gap
    • Rewrite patch SLAs to 72 hours for critical internet-facing assets and establish automated containment for the gap
    • Inventory all AI infrastructure tooling (LiteLLM, Ollama, model registries) adopted without security review in the past 12 months
    • Evaluate building custom AI vulnerability scanning harnesses for your 3 most critical codebases this quarter

    Sources:Clint Gibler · The Information AM · CyberScoop · The Hacker News · SANS AtRisk · TLDR InfoSec

  2. 02

    The Agent Execution Layer Is Being Claimed — Your Platform Is Either the Surface or the Plumbing

    The 59% Number Deserves a Second Look

    Vercel's AI Gateway production index is the closest thing this market has to ground truth, and it now reports that 59% of all AI token volume is agentic workloads. A reasonable skeptic would point out that one vendor's gateway is not the whole market. The reasonable skeptic is correct. The reasonable skeptic also has to explain why every other production telemetry source we looked at this year tells the same story. A roadmap built around adding a chatbot is now optimizing for the minority case.

    The question is no longer which model is best. It is which layer coordinates the models, tools, memory, and handoffs, and who owns that layer.

    SAP and ServiceNow Picked Different Architectures

    Both companies told the market in the same quarter that the UI-centric era is ending. They disagree on what replaces it.

    • ServiceNow adopted MCP servers as the communication standard for its Action Fabric. The message to the ecosystem is that agents talk to ServiceNow through an open protocol.
    • SAP is building a vertically integrated €100M Knowledge Graph so its own agents are contextually superior inside SAP's data universe. The message is the inverse.

    These are two competing theories of how the agent economy organizes: open interoperability against data-moat integration. Both can win in different segments at the same time. The enterprise running both vendors has to decide which one owns the execution layer for processes that cannot stop, and that decision is harder than either slide deck makes it look.

    Orchestration Is Where Lock-In Actually Lives

    Notion launched a developer platform built for agents to sync data and trigger workflows. Intercom rebranded the entire company to Fin, where the agent is the product. Amazon killed Rufus and embedded the capability into Alexa's shopping flow. Three different companies, one shared conclusion: the platform that hosts where agents live captures the distribution and switching cost that model vendors are quietly giving up.

    Vercel's data shows Anthropic captures 61% of spend on the expensive reasoning side, while Google captures 38% of volume on cheap throughput. Model selection has stopped being a strategic decision and started being a routing problem. The strategic decision is whether a company owns the orchestration surface or becomes the commodity infrastructure underneath someone else's agent.

    The Pricing Signal

    The Lemkin number is the one worth keeping on a sticky note: 80% fewer human seats, 83% higher total spend, 20+ agents running. Consumption pricing is accretive against seat pricing in a way the seat-pricing incumbents have not absorbed yet. SAP is not charging per-seat for autonomous finance agents. ServiceNow's headless posture implies consumption pricing on agent API calls. The per-seat era is ending in production contracts being signed this quarter, which is a more useful place to watch than the analyst-day deck.

    Action items

    • Conduct agent-readiness audit of your platform — determine if third-party AI agents can discover, invoke, and orchestrate your workflows without a human UI
    • Decide whether your product is the orchestration surface or the infrastructure underneath it, and allocate engineering accordingly
    • Model consumption-based pricing scenarios and pilot with 3-5 customers if you currently charge per-seat for workflows agents will consume
    • Stand up AI governance function with authority over tool/vendor rationalization before Q3 budgeting

    Sources:TLDR IT · a16z · TLDR · ben's bites · Simplifying AI · Lenny's Newsletter

  3. 03

    Both Frontier Vendors Are Paying You to Switch — The 6-Month Window to Lock Optionality

    The Subsidy Phase Has a Clock

    Anthropic and OpenAI are running the full platform competition playbook at the same time, which means customers are being paid to move. Anthropic concedes it grew 80x against a planned 10x, which left it at roughly 12% of required capacity for extended periods. OpenAI answered Anthropic's June 15 pricing change with two months free Codex for enterprise switchers inside the same news cycle. ServiceNow's CDIO says they blew their full-year Anthropic budget by May, against a vendor that offers no SLAs, no usage telemetry, and no enterprise-grade cost governance.

    The window where both vendors are paying customers to move is roughly six months. After that, the lock-in calcifies, and the sticker price is the price.

    What June 15 Actually Changes

    Anthropic's programmatic metering separates first-party usage from third-party usage, which is the part of the announcement worth reading twice. Third-party tools that were enjoying 70-90% effective discounts get capped credits, then API rates. This is an IPO-driven margin play. Anthropic has hired a CFO and is likely targeting an October listing, and the 50% rate limit increase for two months is sugar on a structural price increase.

    A reasonable skeptic would point out that taking incentives from both vendors is not a strategy. The reasonable skeptic is half right. The other half is the arbitrage opportunity sitting in plain view: build a thin abstraction over both, accept the engineering tax, and use the next eighteen months to keep two vendors honest.

    The Capacity Fragility Problem

    A provider that planned for 10x and got 80x was operating at roughly 12% of required capacity for extended stretches. Developers on the platform were getting degraded service, and probably lower-quality model responses, without disclosure. xAI leasing 45% of its compute (220,000 GPUs) to Anthropic, after Musk publicly called them 'misanthropic and evil,' tells you that financial logic has overwhelmed competitive logic. The inference market is financializing.

    The Enterprise Cost Governance Gap

    ServiceNow is already building the workarounds, calling it the 'AI Control Tower,' and selling it to other enterprises. That is not partnership. That is the market routing around a vendor deficiency, and every major AI player now concedes it cannot deploy without an expensive human services layer. If forward-deployed engineers cost $300-500K loaded each and a meaningful deployment needs five to ten of them, the true cost of the program is 3-5x the model fees. The board-deck version of the AI budget is the model fees. The complete version is the services layer underneath, and it is where next year's overrun lives.

    DecisionCost NowCost in 18 Months
    Single vendor, no abstractionLowestNo negotiating leverage at renewal
    Thin abstraction, dual vendor10-15% engineering taxFull pricing leverage, swap in weekend
    Defer decisionZeroLock-in from vendor you didn't choose

    Action items

    • Negotiate aggressive terms with both OpenAI and Anthropic before June 15 — use each vendor's switching offers as leverage against the other
    • Build or validate a model abstraction layer that enables provider swap within 48 hours for production workloads
    • Audit all AI model consumption spend with per-team and per-use-case attribution — determine your actual total cost including services layers
    • Evaluate building internal 'AI Control Tower' capability for cost governance, or acquire emerging observability tooling

    Sources:TLDR AI · Laura Bratton · The Pragmatic Engineer · AINews · StrictlyVC · Techpresso

◆ QUICK HITS

  • Update: Cerebras IPO priced 16% above raised range at $56B fully diluted, popped 70% day one — anchored by OpenAI's $20B commitment, confirming compute supply is being pre-sold in bilateral blocks that exclude the spot market

    Katie Roof

  • Fervo Energy IPO debuted at $10B+ valuation (up 33%) driven by AI datacenter demand — Google holds an option for 3GW (enough for 60+ facilities), making geothermal power a platform business

    StrictlyVC

  • a16z published comprehensive AI liability framework proposing user-liability defaults and damages caps — simultaneously deployed $115.5M into 2026 midterms, making them the largest disclosed political donor of the cycle

    a16z AI Policy Brief

  • Lovable dissolved its growth management layer 5 months ago, replaced it with autonomous parallel 'High-Impact IC' contributors — model is expanding, not retreating, and attracting VP-level talent who prefer autonomy over authority

    Lenny's Newsletter

  • Google's Gemini Intelligence ships summer 2026 on Android flagships — positions the OS as an autonomous agent platform on 3B+ devices, turning apps into infrastructure the agent calls on the user's behalf

    Simplifying AI

  • Only 15% of organizations have data foundations adequate for agentic AI — the PDC survey shows 95.2% of the gap is organizational (ownership, training, requirements) not tooling (4.8%)

    TLDR Data

  • Update: Microsoft court filings reveal $100B total commitment to OpenAI by June 2026 against only $30B in direct revenue — the most audacious infrastructure bet since late-90s fiber, with demand looking real but returns still unproven

    The Information AM

  • ODNI vs Commerce fight over AI model pre-release evaluation authority is active — IC-led regime would function as licensing for frontier AI, Commerce-led stays voluntary; resolution expected in quarters, not years

    Risky.Biz

◆ Bottom line

The take.

AI models achieved full autonomous network takeover this week while EDR agents became transparent to AI reverse engineering in days — your security architecture is fighting the wrong adversary. Simultaneously, both Anthropic and OpenAI are paying enterprises to switch providers in a subsidy window that closes in roughly 6 months (Anthropic's June 15 pricing change is the first deadline). The companies that build thin abstraction layers and negotiate from dual-vendor positions now will hold pricing leverage through 2027. The companies that locked in single-vendor and single-security-model assumptions 18 months ago are about to discover what both of those choices cost at the same time.

— Promit, reading as Leader ·

Frequently asked

Why does one model clearing two AISI ranges matter beyond a single benchmark result?
Because the underlying capability curve — AI cyber task completion doubling every few months — just broke above its trend line, and Mythos clearing both UK AISI simulated attack ranges (full network takeover, not just persistence) signals frontier models can now find and chain exploits in close to real time. The benchmark is a marker, not the story.
How should patch SLAs change given a 4-hour weaponization window?
Critical internet-facing assets need 72-hour patch SLAs with automated containment covering the gap. The PraisonAI disclosure-to-exploitation cycle was 4 hours, which means a 7-30 day SLA is no longer a patch window — it is an active exposure window. Weaponization used to be the slow step; procurement now is.
Should we pick between SAP's knowledge graph approach and ServiceNow's MCP approach?
Not necessarily — they represent two viable theories (data-moat integration vs. open interoperability) and both can win in different segments. The decision that matters is which vendor owns the execution layer for processes that cannot stop in your environment, and that is a per-workflow call, not a company-wide standard.
Is dual-vendor AI sourcing actually worth the engineering overhead?
For the next 6 months, yes. Both Anthropic and OpenAI are subsidizing switchers, and a thin abstraction layer costs roughly 10-15% in engineering tax while preserving full pricing leverage at renewal. After the subsidy phase ends, lock-in calcifies and sticker price becomes the price.
What is the real total cost of an enterprise AI deployment?
Roughly 3-5x the model fees once the forward-deployed engineering layer is included. Loaded FDE cost runs $300-500K each, and meaningful deployments need five to ten of them. Board decks typically show only the model spend, which is why ServiceNow blew its full-year Anthropic budget by May and why cost governance tooling is becoming a category.

◆ Same day, different angle

Read this day as…

◆ Recent in leader

Keep reading.