Edition 2026-05-16 · read as Leader
AnthropicMythosBreakstheCost-of-AttackAssumption
- Sources
- 36
- Words
- 1,504
- Read
- 8min
Topics Agentic AI LLM Inference AI Capital
◆ The signal
Anthropic's Mythos became the first model to achieve full autonomous network takeover — not persistence, full compromise — while separate research confirmed all five tested commercial EDR products can be reverse-engineered in days using AI. OpenAI simultaneously launched Daybreak with eight major security vendors. Your defensive stack's implicit assumption — that attacking it costs more than it's worth — expired this week across three independent vectors at once. Compensating controls in identity, behavioral analytics, and network telemetry are now load-bearing, not supplementary.
◆ INTELLIGENCE MAP
01 Security Operating Model Collapses: Full Autonomous Takeover + EDR Transparency
act nowMythos cleared both UK AISI attack ranges (GPT-5.5 cleared one). AI reverses all 5 tested EDRs in days vs weeks. PraisonAI exploited within 4 hours of disclosure. NSA, not CISA, gets Mythos access — offense over defense. OpenAI's Daybreak with 8 vendors opens AI security platform war.
- AISI ranges cleared
- EDR reversal time
- Mozilla AI bugs found
- Daybreak launch partners
- Monthly AI infra attacks
02 Agent Execution Layer War: SAP, ServiceNow, Apple, and Notion Race to Own the Surface
monitor59% of all AI token volume is now agentic workloads. SAP bet €100M on Knowledge Graph vertical integration. ServiceNow adopted MCP for headless Action Fabric. Apple is gating agent distribution on 3B+ devices this summer. Notion launched a developer platform positioning as agent-hosting infrastructure. The layer agents route through becomes the next system of record.
- Agentic token share
- SAP agent fund
- Android market share
- Lemkin: seats down
- Lemkin: spend up
03 Enterprise AI Governance Vacuum: Budgets Blown, Foundations Missing
act nowServiceNow blew its full-year Anthropic budget by May. Anthropic grew 80x against 10x planning — operating at ~12% of required capacity. 85% of organizations spending millions on agentic AI lack adequate data foundations. Every major AI vendor now converging on forward-deployed engineer model at $300-500K loaded cost each, making true AI deployment 3-5x model fees.
- Anthropic demand vs plan
- Budget blown by
- FDE loaded cost
- Data pain: org vs tool
- Anthropic ARR
04 AI Liability Architecture Being Written — Courts Moving Faster Than Congress
monitora16z published a comprehensive lobbying blueprint advocating user-liability defaults and damages caps, spending $115.5M on 2026 midterms. Active litigation could impose penalties on developers for downstream misuse before any framework exists. ODNI and Commerce fighting over who evaluates models — intelligence community wants pre-release gating. Developer-liability regimes threaten open-source model availability.
- a16z midterm spend
- Clarity Act odds
- Window to influence
- Competing frameworks
- Clarity Act passage odds55
05 Workforce Compression Wave Hits Infrastructure Companies
background103K tech layoffs by mid-May approaching 2025's full-year 124K. Cloudflare cut 20% citing 'agentic AI era.' LinkedIn cut 5% explicitly for AI reshaping. Lovable dissolved growth management, replaced with autonomous HI-C operators — attracting VP-level talent who prefer craft over coordination. Cisco stock up 15% on same day as 4,000 cuts.
- 2025 cuts YTD
- 2025 full-year prior
- Cloudflare cut
- Cisco AI orders
- LinkedIn cut
- 2025 full year124
- YTD (mid-May)103
◆ DEEP DIVES
01 Your Security Stack Just Became Transparent — Three Vectors, One Quarter to Respond
The Capability Discontinuity
The honest read on this week's results is that the curve broke upward, not that it continued. Anthropic's Mythos became the first model to clear both of the UK AI Security Institute's simulated attack ranges — full autonomous network takeover, not persistence. OpenAI's GPT-5.5 cleared one. Both are outperforming a curve that already doubled AI cyber task completion every few months. The researcher consensus, confirmed across multiple intelligence sources, is that models now find and chain exploits in something close to real time.
The security posture assumptions written into the last board pack were drafted against a threat model that no longer describes the ground. Rewriting them now is cheaper than defending them later.
Three Independent Failures
Vector 1: EDR Architecture. TrustedSec ran LLMs against five commercial EDR products and found the same internals in all five — YARA-style rules, behavioral logic, allowlists, Lua scripted engines readable after a single decryption pass. Work that took a skilled reverser weeks now takes days. The category ran on security-through-obscurity. The obscurity left.
Vector 2: Exploit Velocity. PraisonAI was actively targeted within 4 hours of disclosure. Microsoft's MDASH found 16 exploitable flaws in a single Patch Tuesday using multi-model AI analysis. Patch SLAs written for 30-day windows are now being measured against 4-hour weaponization. A honeypot dressed as an AI stack was indexed by Shodan in 3 hours and absorbed 113,000+ attacks per month.
Vector 3: Platform Restructuring. OpenAI launched Daybreak with CrowdStrike, Palo Alto Networks, Cisco, Cloudflare, Oracle, Zscaler, Akamai, and Fortinet. The board-deck version is that this is a partnership. The complete version is that within 3-5 years, today's security vendors risk becoming feature providers on OpenAI's platform.
The Defender's Dilemma
The Foxconn breach, with 8TB exfiltrated from a single contract manufacturer holding Apple, Google, Intel, and Nvidia designs, settles the question of whether this is theoretical. The AI infrastructure layer — LiteLLM, Ollama, OpenClaw — already carries 5 KEV entries and was adopted faster than security review could keep pace. An 18-year-old RCE in NGINX's rewrite module confirms that foundational infrastructure auditing has systematic gaps.
Mozilla found 271 real bugs in Firefox using custom Claude harnesses. The same model scanning curl produced 1 low-severity CVE. The variable is the harness, not the model. Organizations building target-specific AI scanning infrastructure get real outcomes. Those buying generic AI scanning get slide decks.
The NSA Signal
Congress is holding closed-door Mythos demos, and access routes through NSA, not CISA. The government is prioritizing offensive and intelligence operations over civilian defense, which means the private sector is on its own for several years. The same hearings mark the leading edge of a multi-year federal buying cycle.
Action items
- Commission a red team exercise specifically targeting your EDR with AI-assisted reverse engineering — scope the actual detection gap within 60 days
- Compress critical vulnerability patch SLAs from 30+ days to 72 hours for internet-facing assets
- Build custom AI vulnerability scanning harnesses for your 3 most critical codebases by end of Q3, following Mozilla's pattern
- Map your strategic position relative to Daybreak — determine whether OpenAI becomes your security vendor or your security vendor's vendor
- Inventory all AI infrastructure tooling (LiteLLM, Ollama, model registries) adopted without security review — bring under standard governance immediately
Sources:Clint Gibler · The Information AM · CyberScoop · AINews · SANS AtRisk · The Hacker News
02 The Agent Execution Layer Is Being Claimed — Positioning Decisions Have an 18-Month Window
The Collision
SAP and ServiceNow stopped talking past each other this week. Both are explicitly pitching themselves as the execution layer — the surface where AI agents touch systems of record and actually do things. SAP bet €100M and a Knowledge Graph on vertical integration. ServiceNow adopted MCP servers as the headless communication standard for Action Fabric. These are incompatible theories of how the agent economy organizes: open interoperability vs. data-moat integration.
Agents that act across finance, HR, IT, and procurement need one authoritative place to reconcile state. Two authoritative places is zero authoritative places.
The Data That Settles The Debate
Vercel's AI Gateway production index confirms that 59% of all token volume is now agentic workloads. More than half of production AI is agents taking actions, not humans having conversations. A product strategy still built around 'add a chatbot' is optimizing for the minority case.
Anthropic captures 61% of spend (expensive Opus reasoning). Google captures 38% of volume (cheap Flash throughput). That bifurcation is structural. Model selection is now a routing optimization, not a strategic choice. The strategic choice moved one layer up — to who owns the orchestration surface.
Platform Moves This Week
Company Move Bet ServiceNow MCP-based Action Fabric Open interop wins SAP Knowledge Graph + €100M fund Data moat wins Apple Agent App Store gating Distribution control Google Gemini Intelligence on Android OS-level agent layer Notion Developer platform for agents Workspace as host Intercom Rebrand to 'Fin' Agent IS the company Apple's Constraint Layer
Apple is inserting itself at the agent layer on 3 billion+ devices this summer. The framing tells you everything: Apple is addressing agents that 'spin up smaller apps on the spot after approval.' That language treats agent sub-spawning as both a safety risk and a revenue leak. For any product shipping consumer-facing agents on iOS, this is a new constraint that needs pricing into unit economics before WWDC turns it into a fait accompli.
The Value Migration
The a16z thesis quantifies it: $150B of GTM value migrating from CRM to orchestration layer. Lemkin's working data: 80% fewer human seats, 83% higher total spend, 20+ agents running. The CRM stops being where work happens and becomes where work is recorded. Switching costs migrate from data lock-in to workflow/reasoning lock-in — which is stickier because institutional context is prohibitively expensive to rebuild.
Action items
- Conduct an 'agent readiness' audit of your platform — can third-party agents discover, invoke, and orchestrate your workflows without a human UI? Report findings by end of Q3
- Evaluate MCP as a strategic investment for your platform roadmap — build or integrate MCP server capabilities within 90 days
- Model per-action/per-outcome pricing scenarios and pilot with 3-5 customers this quarter
- Audit your iOS agent roadmap against Apple's likely fee/approval structure — model into unit economics before WWDC
Sources:TLDR IT · a16z · Simplifying AI · TLDR · ben's bites · Techpresso
03 Enterprise AI Economics Are Structurally Broken — The Governance Gap Is Now the Strategy Gap
The Budget Problem Is Structural, Not Cyclical
ServiceNow burned through its full-year Anthropic budget by May, five months into a twelve-month plan. That is not one buyer's miscalculation. Anthropic does not offer SLAs, does not provide usage telemetry, and had nothing to say when an enterprise customer publicly described the blowout. A company valued at hundreds of billions is deliberately optimizing for capability over enterprise readiness, and the customers are funding that choice.
The capacity math explains the experience. Anthropic grew 80x against a planned 10x, which means operating at roughly 12% of required capacity for extended stretches. Developers in that window were getting degraded service, rate-limited output, and probably lower-quality responses without disclosure. Productivity gains measured in that period understate what adequate provisioning would deliver.
We are in the equivalent of cloud computing circa 2014: powerful capabilities, wildly unpredictable economics, and a governance vacuum that creates real financial exposure.
The Hidden Cost Multiplier
Every major AI provider has now converged on Palantir's forward-deployed-engineer model. Google is hiring hundreds of FDEs. OpenAI acquired a 150-person consulting firm. ServiceNow and Salesforce are building FDE teams. The market has collectively admitted that AI deployment is a human-intensive problem and stopped pretending otherwise.
At $300-500K loaded cost per FDE, and 5-10 needed for meaningful deployment, the true cost of an AI program lands at 3-5x the model fees. Boards approving AI envelopes off token costs are approving a fraction of the actual spend.
The Data Foundation Gap
The board-deck version is that 85% of organizations are spending millions on agentic AI without adequate data foundations. The complete version is more useful. In the PDC survey of 334 practitioners asked what they need most, 4.8% said better tools. The remaining 95.2% asked for training, clearer requirements, more time, and dedicated ownership. This is an organizational problem being treated as a technology purchase.
Netflix and Meta independently converged on identity-based, team-owned data governance, replacing brittle ACLs and human-owned identities with durable app identities. That is the prerequisite for letting an AI agent read and write data without tying permissions to a human's next job change. Most organizations have not started that migration.
The FOMO Dynamic
A reasonable skeptic would point out that companies always overspend when they fear a competitor will figure out the economics first, and that this looks like classic bubble psychology. The skeptic is correct, with one saving grace: AI spend is uniquely reversible. Unlike a cloud migration or an ERP implementation, token consumption can be cut to zero overnight. The enterprise AI revenue base carries a fragility that model-company valuations do not price in. For buyers, that optionality is an asset, provided the workflows underneath can still function without the model.
The xAI Signal
Elon Musk agreed to lease 220,000 GPUs (45% of Colossus) to Anthropic, a company he has publicly called 'misanthropic and evil.' When financial logic overwhelms competitive logic at that scale, GPU supply has become a financial instrument first and a strategic moat second. Excess infrastructure is moving onto the lease market, and that should reshape compute economics for enterprises over the next 12-18 months.
Action items
- Conduct an immediate audit of all AI model consumption spend vs. budget with per-team and per-use-case attribution — deliver findings to CFO within 30 days
- Renegotiate AI vendor contracts to include SLAs, committed pricing tiers, and usage telemetry requirements before next renewal
- Commission an agentic AI readiness audit focused on data quality, lineage, and governance maturity — restructure data ownership with dedicated modeling roles
- Model true AI deployment cost at 3-5x model fees and present revised investment envelope to board before Q3 budget planning
Sources:Laura Bratton · The Pragmatic Engineer · TLDR AI · TLDR Data · StrictlyVC · Martin Peers
◆ QUICK HITS
Update: Anthropic ARR hit $30B (from $9B in ~4 months) while raising at $900B+ valuation — the displacement is accelerating, not stabilizing
StrictlyVC
Update: xAI leased 45% of Colossus (220K GPUs) to Anthropic — compute financialization means the GPU lease market could meaningfully alter enterprise pricing within 12-18 months
The Pragmatic Engineer
Fervo Energy IPO at $10B+ with 33% first-day pop — Google holds option for 3GW (60+ data centers) from single supplier, validating power as the binding AI infrastructure constraint
The Information AM
a16z published comprehensive AI liability lobbying blueprint advocating user-liability defaults and damages caps — open-source model availability directly threatened by developer-liability regimes
a16z AI Policy Brief
Duolingo CEO publicly admits blanket AI mandate failed — quantified at ~20% 'slop tax' on AI-generated content requiring human QC, validating performative adoption concerns
TLDR Marketing
Abridge raised at $5.3B valuation on 80-100M+ medical conversations — rebranded from 'ambient scribe' to 'clinical intelligence layer,' compressing prior auth from 45 days to minutes
Latent.Space
ODNI vs Commerce fight for AI model evaluation authority — intelligence community proposal would function as licensing regime for frontier AI, extending release timelines by months
Risky.Biz
Amazon killed Rufus standalone to embed AI into Alexa shopping — 'Buy for Me' completes purchases on third-party sites from Amazon's surface, claiming the transaction layer of the open web
TLDR Design
◆ Bottom line
The take.
The security operating model, the enterprise software stack, and AI cost governance all broke this week from multiple directions simultaneously. Anthropic's Mythos achieved full autonomous network takeover while every tested commercial EDR became transparent to AI-assisted reverse engineering. Meanwhile, 59% of AI traffic is now agentic — and SAP, ServiceNow, Apple, and Google are all racing to own the execution layer your agents route through. ServiceNow blowing its full-year Anthropic budget by May while 85% of organizations lack data foundations for the agents they're deploying means the gap between AI ambition and AI governance is now a first-order financial risk. The decisions being made this quarter about where detection lives, which execution layer to commit to, and how to govern AI spend will define competitive position for the next two years.
Frequently asked
- What should leaders do first given the new AI-driven attack capabilities?
- Compress patch SLAs for internet-facing assets from 30 days to 72 hours and commission an AI-assisted red team specifically targeting your EDR within 60 days. The combination of 4-hour weaponization windows and reverse-engineerable EDR architectures means existing assumptions about defender economics no longer hold. Identity, behavioral analytics, and network telemetry should be treated as load-bearing controls rather than supplementary.
- Why did ServiceNow exhaust its full-year Anthropic budget by May?
- Anthropic grew 80x against a planned 10x and is operating at roughly 12% of required capacity, with no SLAs, no usage telemetry, and a deliberate prioritization of capability over enterprise readiness. Customers funding that choice get degraded service and unpredictable consumption curves. The fix is contractual: demand SLAs, committed pricing tiers, and per-team attribution before renewal.
- What is the real cost of deploying AI in the enterprise?
- Roughly 3-5x the model fees once forward-deployed engineers are factored in. Google, OpenAI, ServiceNow, and Salesforce have all converged on the Palantir FDE model at $300-500K loaded cost per engineer, with 5-10 needed for meaningful deployment. Boards approving AI envelopes based on token costs are approving a fraction of actual spend.
- How should platform companies respond to the SAP vs. ServiceNow execution-layer fight?
- Audit whether third-party agents can discover, invoke, and orchestrate your workflows without a human UI, and treat MCP server compatibility as mandatory rather than optional. With 59% of token volume now agentic and $150B of GTM value migrating from CRM to orchestration, being bypassed by agents is a worse outcome than being disrupted because it leaves no seat at the table.
- Is the enterprise AI spending surge a bubble?
- It has bubble characteristics, but with one structural difference: AI spend is uniquely reversible. Unlike cloud migrations or ERP implementations, token consumption can be cut to zero overnight, which makes the enterprise revenue base far more fragile than model-company valuations imply. For buyers, that optionality is an asset provided the underlying workflows can still function without the model.
◆ Same day, different angle
Read this day as…
◆ Recent in leader
Keep reading.
- Princeton's ICML 2026 paper finds that GPT 5.5, Gemini 3.1 Pro, and Claude Opus 4.7 are no more reliable on agent tasks than their predecess…
- GitHub disclosed 17 million agent-authored pull requests in a single month while Anthropic confirmed Claude writes 90%+ of its own code — an…
- Anthropic's Mythos cleared both UK AISI simulated attack ranges this week, a first, while TrustedSec demonstrated that all five major commer…
- Your EDR became structurally transparent this week.
- Anthropic's Mythos became the first AI model to fully take over both UK AISI attack ranges autonomously, and a parallel study showed AI reve…