Why does one model clearing two AISI ranges matter beyond a single benchmark result?

Because the underlying capability curve — AI cyber task completion doubling every few months — just broke above its trend line, and Mythos clearing both UK AISI simulated attack ranges (full network takeover, not just persistence) signals frontier models can now find and chain exploits in close to real time. The benchmark is a marker, not the story.

How should patch SLAs change given a 4-hour weaponization window?

Critical internet-facing assets need 72-hour patch SLAs with automated containment covering the gap. The PraisonAI disclosure-to-exploitation cycle was 4 hours, which means a 7-30 day SLA is no longer a patch window — it is an active exposure window. Weaponization used to be the slow step; procurement now is.

Should we pick between SAP's knowledge graph approach and ServiceNow's MCP approach?

Not necessarily — they represent two viable theories (data-moat integration vs. open interoperability) and both can win in different segments. The decision that matters is which vendor owns the execution layer for processes that cannot stop in your environment, and that is a per-workflow call, not a company-wide standard.

Is dual-vendor AI sourcing actually worth the engineering overhead?

For the next 6 months, yes. Both Anthropic and OpenAI are subsidizing switchers, and a thin abstraction layer costs roughly 10-15% in engineering tax while preserving full pricing leverage at renewal. After the subsidy phase ends, lock-in calcifies and sticker price becomes the price.

What is the real total cost of an enterprise AI deployment?

Roughly 3-5x the model fees once the forward-deployed engineering layer is included. Loaded FDE cost runs $300-500K each, and meaningful deployments need five to ten of them. Board decks typically show only the model spend, which is why ServiceNow blew its full-year Anthropic budget by May and why cost governance tooling is becoming a category.

Edition 2026-05-27 · read as Leader

AnthropicMythosClearsUKAISIAttackRanges,ResetsBar

Sources: 36
Words: 1,494
Read: 7min

Topics Agentic AI AI Capital LLM Inference

◆ The signal

A reasonable skeptic would say one model clearing two ranges is one model clearing two ranges. The skeptic is correct, and also a quarter behind. Anthropic's Mythos is the first model through both UK AISI simulated attack ranges. EDR internals that used to cost skilled reversers weeks now resolve in days. Exploit weaponization on PraisonAI is down to four hours. Security architectures calibrated last year were calibrated against a different adversary.

◆ INTELLIGENCE MAP

01
Security Architecture Invalidated: AI Achieves Full Network Takeover
act now
Mythos cleared both AISI hardest ranges — first model ever. EDR reverse engineering collapsed from weeks to days across all 5 tested products. PraisonAI went from disclosure to exploitation in 4 hours. Foxconn lost 8TB of Apple/Intel/Nvidia designs. The defender response window is now shorter than the patch cycle.
4hrs
exploit weaponization window
8
sources
- EDR reversal time
- Foxconn IP exfiltrated
- KEV in AI tools
- Mozilla AI bugs found
1. Traditional exploit dev90
2. AI-assisted (2025)7
3. PraisonAI weapon.0.17
02
Agent Execution Layer Being Claimed — 59% of Traffic Is Agentic
monitor
Vercel production data confirms 59% of AI token volume is now agentic. SAP and ServiceNow are colliding on who owns the execution layer — SAP with a €100M Knowledge Graph, ServiceNow via MCP servers. Notion launched as an agent-hosting platform. Intercom rebranded entirely to its AI agent 'Fin.' The UI-centric era is ending.
59%
AI traffic that's agentic
7
sources
- SAP fund
- Agent bot bypass rate
- Seat reduction (Lemkin)
- Spend increase
1. Agentic workloads59
2. Human chat/query41
03
Anthropic's June 15 Pricing Restructure Opens a 6-Month Arbitrage Window
act now
Anthropic disclosed 80x demand against 10x planning. ServiceNow blew its full-year Anthropic budget by May. June 15 brings programmatic metering — third-party tools lose 70-90% discounts. OpenAI responded with 2 months free Codex. Both vendors are paying enterprises to switch simultaneously. This subsidy window lasts ~6 months before lock-in hardens.
80x
demand vs. plan
9
sources
- Anthropic ARR
- Growth in 4 months
- B2B share lead
- Pricing change date
1. Anthropic B2B share34.4
2. OpenAI B2B share32.3
04
Compute Supply Locked Up at $10-100B Scale — Cerebras IPO Confirms
monitor
Microsoft's $100B OpenAI commitment surfaced in court filings. Cerebras priced 16% above range at $56B, popped 70% day one on a $20B OpenAI anchor. Nebius at 684% revenue growth with 4:1 demand-to-supply ratio. Fervo Energy debuted at $10B+ (up 33%) on AI power demand. Google optioned 3GW from Fervo alone — enough for 60+ facilities.
$100B
Microsoft-OpenAI commitment
6
sources
- Cerebras valuation
- Nebius revenue growth
- GPU demand ratio
- Google power option
1. Microsoft→OpenAI100
2. OpenAI→Cerebras20
3. Fervo Energy IPO10
4. Cisco AI orders9
05
AI Liability Regime Being Written — 18-Month Window to Shape the Rules
background
a16z published the most comprehensive AI liability framework yet, proposing user-liability defaults and damages caps. Courts are actively deciding cases that could impose strict developer liability before any legislation passes. Developer-liability regimes would make open-source AI economically unviable. The framework that prevails determines which companies can afford to exist in 5 years.
$115M
a16z political spend 2026
4
sources
- Passage odds (Clarity)
- Active jurisdictions
- Window to influence
- a16z midterm spend
1. NowCourts setting precedent
2. Q3 2026Safe harbor proposals mature
3. Q1 2027Federal framework likely
4. 2028Enforcement begins

◆ DEEP DIVES

Your Security Architecture Was Built for Last Year's Adversary — The Adversary Just Got Replaced

The Discontinuity

The temptation with this week's results is to file them under incremental capability and move on. That is the wrong file. Anthropic's Mythos became the first model to clear both UK AISI simulated attack ranges, and the qualifying task was full network takeover, not persistence. OpenAI's GPT-5.5-cyber completed one of the two. Independent assessors agree that frontier models can now find and chain exploits in something close to real time. The UK AISI reports AI cyber task completion is doubling every few months, and the latest numbers broke above the trend line.

The vulnerability disclosure and patch cycle the industry runs on is measured in days to weeks. The adversary's capability cycle is now measured in hours.

EDR Is Now a Glass Box

TrustedSec ran LLMs against five commercial EDR products and found the same architecture in every one: YARA-style rules, behavioral logic, allowlists, prefilters, Lua scripting engines readable after a single decryption pass, and local ML classifiers. The reverse engineering that used to take a skilled human weeks now takes days with AI assistance. The endpoint detection category was running on security-through-obscurity. The obscurity is gone.

A reasonable skeptic would point out that EDR was never the only control, and that is correct. The controls that matter over the next eighteen months are identity, network telemetry, and behavioral analytics above the endpoint. Teams that keep treating the agent on the box as the load-bearing control will learn what load-bearing means when the adversary can read the agent.

The Exploit Window Collapsed

PraisonAI went from disclosure to active exploitation in 4 hours. Microsoft's MDASH system found 16 exploitable flaws in a single Patch Tuesday using multi-model AI analysis. Mozilla's custom harness found 271 real bugs in Firefox, including sandbox escapes, against curl's single low-severity CVE under generic scanning. The variable is not the model. The variable is the harness.

Patch SLAs written for a thirty-day window were calibrated for a world where weaponization was the slow step. Weaponization is no longer the slow step. Procurement is.

Supply Chain Under Active Exploitation

Foxconn lost 8TB of confidential designs belonging to Apple, Intel, Google, and Nvidia to Nitrogen ransomware. CISA added 5 AI infrastructure tools to the Known Exploited Vulnerabilities catalog, with LiteLLM, Ollama, and OpenClaw among them. The AI tooling adopted last year for speed is now being targeted in production before most organizations have finished inventorying what they deployed.

Attack Surface	Old Assumption	Current Reality
EDR bypass	Weeks of expert work	Days with AI
Exploit development	Days to weeks	4 hours
Vuln discovery	Expensive human research	271 bugs/model cycle
AI infra security	Experimental layer	5 KEV entries, actively exploited

Action items

Commission red team exercise targeting your EDR with AI-assisted reverse engineering to quantify actual detection gap
Rewrite patch SLAs to 72 hours for critical internet-facing assets and establish automated containment for the gap
Inventory all AI infrastructure tooling (LiteLLM, Ollama, model registries) adopted without security review in the past 12 months
Evaluate building custom AI vulnerability scanning harnesses for your 3 most critical codebases this quarter

Sources:Clint Gibler · The Information AM · CyberScoop · The Hacker News · SANS AtRisk · TLDR InfoSec

02
The Agent Execution Layer Is Being Claimed — Your Platform Is Either the Surface or the Plumbing
The 59% Number Deserves a Second Look

Vercel's AI Gateway production index is the closest thing this market has to ground truth, and it now reports that 59% of all AI token volume is agentic workloads. A reasonable skeptic would point out that one vendor's gateway is not the whole market. The reasonable skeptic is correct. The reasonable skeptic also has to explain why every other production telemetry source we looked at this year tells the same story. A roadmap built around adding a chatbot is now optimizing for the minority case.

The question is no longer which model is best. It is which layer coordinates the models, tools, memory, and handoffs, and who owns that layer.

SAP and ServiceNow Picked Different Architectures

Both companies told the market in the same quarter that the UI-centric era is ending. They disagree on what replaces it.
- ServiceNow adopted MCP servers as the communication standard for its Action Fabric. The message to the ecosystem is that agents talk to ServiceNow through an open protocol.
- SAP is building a vertically integrated €100M Knowledge Graph so its own agents are contextually superior inside SAP's data universe. The message is the inverse.
These are two competing theories of how the agent economy organizes: open interoperability against data-moat integration. Both can win in different segments at the same time. The enterprise running both vendors has to decide which one owns the execution layer for processes that cannot stop, and that decision is harder than either slide deck makes it look.

Orchestration Is Where Lock-In Actually Lives

Notion launched a developer platform built for agents to sync data and trigger workflows. Intercom rebranded the entire company to Fin, where the agent is the product. Amazon killed Rufus and embedded the capability into Alexa's shopping flow. Three different companies, one shared conclusion: the platform that hosts where agents live captures the distribution and switching cost that model vendors are quietly giving up.

Vercel's data shows Anthropic captures 61% of spend on the expensive reasoning side, while Google captures 38% of volume on cheap throughput. Model selection has stopped being a strategic decision and started being a routing problem. The strategic decision is whether a company owns the orchestration surface or becomes the commodity infrastructure underneath someone else's agent.

The Pricing Signal

The Lemkin number is the one worth keeping on a sticky note: 80% fewer human seats, 83% higher total spend, 20+ agents running. Consumption pricing is accretive against seat pricing in a way the seat-pricing incumbents have not absorbed yet. SAP is not charging per-seat for autonomous finance agents. ServiceNow's headless posture implies consumption pricing on agent API calls. The per-seat era is ending in production contracts being signed this quarter, which is a more useful place to watch than the analyst-day deck.
Action items
- Conduct agent-readiness audit of your platform — determine if third-party AI agents can discover, invoke, and orchestrate your workflows without a human UI
- Decide whether your product is the orchestration surface or the infrastructure underneath it, and allocate engineering accordingly
- Model consumption-based pricing scenarios and pilot with 3-5 customers if you currently charge per-seat for workflows agents will consume
- Stand up AI governance function with authority over tool/vendor rationalization before Q3 budgeting
Sources:TLDR IT · a16z · TLDR · ben's bites · Simplifying AI · Lenny's Newsletter

Both Frontier Vendors Are Paying You to Switch — The 6-Month Window to Lock Optionality

The Subsidy Phase Has a Clock

Anthropic and OpenAI are running the full platform competition playbook at the same time, which means customers are being paid to move. Anthropic concedes it grew 80x against a planned 10x, which left it at roughly 12% of required capacity for extended periods. OpenAI answered Anthropic's June 15 pricing change with two months free Codex for enterprise switchers inside the same news cycle. ServiceNow's CDIO says they blew their full-year Anthropic budget by May, against a vendor that offers no SLAs, no usage telemetry, and no enterprise-grade cost governance.

The window where both vendors are paying customers to move is roughly six months. After that, the lock-in calcifies, and the sticker price is the price.

What June 15 Actually Changes

Anthropic's programmatic metering separates first-party usage from third-party usage, which is the part of the announcement worth reading twice. Third-party tools that were enjoying 70-90% effective discounts get capped credits, then API rates. This is an IPO-driven margin play. Anthropic has hired a CFO and is likely targeting an October listing, and the 50% rate limit increase for two months is sugar on a structural price increase.

A reasonable skeptic would point out that taking incentives from both vendors is not a strategy. The reasonable skeptic is half right. The other half is the arbitrage opportunity sitting in plain view: build a thin abstraction over both, accept the engineering tax, and use the next eighteen months to keep two vendors honest.

The Capacity Fragility Problem

A provider that planned for 10x and got 80x was operating at roughly 12% of required capacity for extended stretches. Developers on the platform were getting degraded service, and probably lower-quality model responses, without disclosure. xAI leasing 45% of its compute (220,000 GPUs) to Anthropic, after Musk publicly called them 'misanthropic and evil,' tells you that financial logic has overwhelmed competitive logic. The inference market is financializing.

The Enterprise Cost Governance Gap

ServiceNow is already building the workarounds, calling it the 'AI Control Tower,' and selling it to other enterprises. That is not partnership. That is the market routing around a vendor deficiency, and every major AI player now concedes it cannot deploy without an expensive human services layer. If forward-deployed engineers cost $300-500K loaded each and a meaningful deployment needs five to ten of them, the true cost of the program is 3-5x the model fees. The board-deck version of the AI budget is the model fees. The complete version is the services layer underneath, and it is where next year's overrun lives.

Decision	Cost Now	Cost in 18 Months
Single vendor, no abstraction	Lowest	No negotiating leverage at renewal
Thin abstraction, dual vendor	10-15% engineering tax	Full pricing leverage, swap in weekend
Defer decision	Zero	Lock-in from vendor you didn't choose

Action items

Negotiate aggressive terms with both OpenAI and Anthropic before June 15 — use each vendor's switching offers as leverage against the other
Build or validate a model abstraction layer that enables provider swap within 48 hours for production workloads
Audit all AI model consumption spend with per-team and per-use-case attribution — determine your actual total cost including services layers
Evaluate building internal 'AI Control Tower' capability for cost governance, or acquire emerging observability tooling

Sources:TLDR AI · Laura Bratton · The Pragmatic Engineer · AINews · StrictlyVC · Techpresso

◆ QUICK HITS

Update: Cerebras IPO priced 16% above raised range at $56B fully diluted, popped 70% day one — anchored by OpenAI's $20B commitment, confirming compute supply is being pre-sold in bilateral blocks that exclude the spot market
Katie Roof
Fervo Energy IPO debuted at $10B+ valuation (up 33%) driven by AI datacenter demand — Google holds an option for 3GW (enough for 60+ facilities), making geothermal power a platform business
StrictlyVC
a16z published comprehensive AI liability framework proposing user-liability defaults and damages caps — simultaneously deployed $115.5M into 2026 midterms, making them the largest disclosed political donor of the cycle
a16z AI Policy Brief
Lovable dissolved its growth management layer 5 months ago, replaced it with autonomous parallel 'High-Impact IC' contributors — model is expanding, not retreating, and attracting VP-level talent who prefer autonomy over authority
Lenny's Newsletter
Google's Gemini Intelligence ships summer 2026 on Android flagships — positions the OS as an autonomous agent platform on 3B+ devices, turning apps into infrastructure the agent calls on the user's behalf
Simplifying AI
Only 15% of organizations have data foundations adequate for agentic AI — the PDC survey shows 95.2% of the gap is organizational (ownership, training, requirements) not tooling (4.8%)
TLDR Data
Update: Microsoft court filings reveal $100B total commitment to OpenAI by June 2026 against only $30B in direct revenue — the most audacious infrastructure bet since late-90s fiber, with demand looking real but returns still unproven
The Information AM
ODNI vs Commerce fight over AI model pre-release evaluation authority is active — IC-led regime would function as licensing for frontier AI, Commerce-led stays voluntary; resolution expected in quarters, not years
Risky.Biz

◆ Bottom line

The take.

AI models achieved full autonomous network takeover this week while EDR agents became transparent to AI reverse engineering in days — your security architecture is fighting the wrong adversary. Simultaneously, both Anthropic and OpenAI are paying enterprises to switch providers in a subsidy window that closes in roughly 6 months (Anthropic's June 15 pricing change is the first deadline). The companies that build thin abstraction layers and negotiate from dual-vendor positions now will hold pricing leverage through 2027. The companies that locked in single-vendor and single-security-model assumptions 18 months ago are about to discover what both of those choices cost at the same time.

Frequently asked

Why does one model clearing two AISI ranges matter beyond a single benchmark result?: Because the underlying capability curve — AI cyber task completion doubling every few months — just broke above its trend line, and Mythos clearing both UK AISI simulated attack ranges (full network takeover, not just persistence) signals frontier models can now find and chain exploits in close to real time. The benchmark is a marker, not the story.
How should patch SLAs change given a 4-hour weaponization window?: Critical internet-facing assets need 72-hour patch SLAs with automated containment covering the gap. The PraisonAI disclosure-to-exploitation cycle was 4 hours, which means a 7-30 day SLA is no longer a patch window — it is an active exposure window. Weaponization used to be the slow step; procurement now is.
Should we pick between SAP's knowledge graph approach and ServiceNow's MCP approach?: Not necessarily — they represent two viable theories (data-moat integration vs. open interoperability) and both can win in different segments. The decision that matters is which vendor owns the execution layer for processes that cannot stop in your environment, and that is a per-workflow call, not a company-wide standard.
Is dual-vendor AI sourcing actually worth the engineering overhead?: For the next 6 months, yes. Both Anthropic and OpenAI are subsidizing switchers, and a thin abstraction layer costs roughly 10-15% in engineering tax while preserving full pricing leverage at renewal. After the subsidy phase ends, lock-in calcifies and sticker price becomes the price.
What is the real total cost of an enterprise AI deployment?: Roughly 3-5x the model fees once the forward-deployed engineering layer is included. Loaded FDE cost runs $300-500K each, and meaningful deployments need five to ten of them. Board decks typically show only the model spend, which is why ServiceNow blew its full-year Anthropic budget by May and why cost governance tooling is becoming a category.

◆ Same day, different angle

Read this day as…

◆ Recent in leader