Leader daily

Edition 2026-05-23 · read as Leader

AIReverseEngineeringJustBrokeYourEDR'sDefensiveMoat

Sources
36
Words
1,786
Read
9min

Topics Agentic AI AI Capital AI Regulation

◆ The signal

Your EDR's defensive moat evaporated this week. AI-assisted reverse engineering made all five tested commercial endpoint products architecturally transparent in days instead of weeks, CISA added AI infrastructure tools (LiteLLM, Ollama, OpenClaw) to its actively-exploited vulnerability catalog, and Anthropic's Mythos became the first model to clear both UK government simulated attack ranges. Congress is routing access through NSA, not CISA — the government has decided this is an offensive weapon first and a defensive one second. Your security operating model was built on the premise that understanding your agents would remain expensive. Commission a red team exercise using AI-assisted reversing against your own EDR this quarter, not next year.

◆ INTELLIGENCE MAP

  1. 01

    Security Operating Model Failure — Three Layers at Once

    act now

    EDR architectures transparent to AI reversing in days. AI infra tools (LiteLLM, Ollama) now in CISA KEV — actively exploited before most firms inventoried them. Mythos first model to clear both AISI attack ranges. PraisonAI exploited 4 hours after disclosure. Foxconn lost 8TB of Apple/Google/Nvidia/Intel IP. The entire stack failed simultaneously.

    4 hrs
    disclosure-to-exploit
    7
    sources
    • EDR products exposed
    • AISI ranges cleared
    • AI infra KEV entries
    • Foxconn data stolen
    • Honeypot attacks/month
    1. Old exploit dev90
    2. Current exploit dev7
    3. PraisonAI exploit0.17
  2. 02

    AI Infrastructure Capital Scale Revealed

    monitor

    Microsoft's OpenAI commitment disclosed at $100B via lawsuit. Cerebras IPO at $56B with 70% pop on $20B OpenAI contract. xAI leasing 220K GPUs (45% of Colossus) to Anthropic — financial logic now overrides competitive logic in compute. Fervo Energy IPO at $10B+ validates power as platform business. Nebius at 684% growth, 4:1 demand ratio.

    $100B
    Microsoft-OpenAI commitment
    8
    sources
    • Cerebras IPO valuation
    • Anthropic valuation
    • Anthropic ARR
    • Fervo Energy IPO
    • GPU demand ratio
    1. Microsoft→OpenAI100
    2. Anthropic ARR30
    3. Cerebras market cap56
    4. Fervo Energy IPO10
    5. Cisco AI orders9
  3. 03

    Execution Layer War — Four Platforms Move in One Week

    monitor

    SAP (€100M fund + Knowledge Graph), ServiceNow (MCP-based Action Fabric), Apple (App Store agent framework with fee gates), and Google (Gemini Intelligence on 3B+ Android devices this summer) all claimed the agent execution layer simultaneously. 59% of AI traffic is now agentic. The question is no longer 'which model' but 'which platform does the agent call through.'

    59%
    agentic token share
    7
    sources
    • Android market share
    • SAP investment fund
    • Agent bot bypass rate
    • a16z GTM value shift
    1. Agentic workloads59
    2. Conversational28
    3. Other AI calls13
  4. 04

    AI Governance Gap — From Budget Blowouts to Legal Exposure

    act now

    ServiceNow blew its full-year Anthropic budget by May — no SLAs, no telemetry, no predictable pricing from the model provider. a16z published the industry's liability blueprint while active courts could impose strict liability on developers for user misuse. Only 15% of orgs have data foundations for agentic AI. Duolingo quantified the 'slop tax' at ~20%.

    85%
    orgs unready for agents
    6
    sources
    • ServiceNow budget blown
    • Duolingo slop tax
    • Data problem org vs tool
    • FDE loaded cost
    1. Enterprise AI readiness15
  5. 05

    Workforce Architecture Fracture

    background

    VPs are voluntarily taking IC roles at AI-native startups — the economic case for coordination-layer management is inverting. 103K tech layoffs by mid-May approaching 2025's full-year total. Lovable dissolved its growth management layer and found it attracts elite talent. One operator ships in hours what cross-functional squads took weeks to produce.

    103K
    tech layoffs YTD
    4
    sources
    • 2025 full-year layoffs
    • Cloudflare cut
    • LinkedIn cut
    • HI-C high-value time
    1. H1 2024 layoffs68
    2. H1 2025 layoffs103

◆ DEEP DIVES

  1. 01

    Your Security Stack Failed on Three Axes This Week — Architecture Response Required

    The Convergence

    The frame to use this week is architecture, not budget. The security operating model built over the past decade no longer describes the threat environment it was purchased to defend against, and the gap is now visible across endpoint detection, AI routing infrastructure, and offensive tooling at the same time.

    The cost curve on reverse engineering a commercial endpoint agent has collapsed in a way the defensive stack's threat model never priced in.

    Layer 1: Endpoint Detection Becomes Transparent

    TrustedSec ran LLMs against five commercial EDR products and found the same internals across all five: YARA-style rules, behavioral logic, allowlists, prefilters, scripted engines (some readable as Lua after a single decryption pass), and local ML classifiers. Work that used to require a skilled reverser and weeks of effort now takes days with AI assistance. The category was running on security-through-obscurity, and the obscurity just left.

    Layer 2: AI Infrastructure Under Active Exploitation

    CISA added LiteLLM, Ollama, and OpenClaw to the Known Exploited Vulnerabilities catalog, which means adversaries are already targeting AI routing infrastructure in production. A Raspberry Pi honeypot dressed as an AI stack was indexed by Shodan in 3 hours and absorbed 113,000 attacks per month. The AI tooling layer went from experiment to production without the security review traditional enterprise software gets on the way in.

    Layer 3: Offensive AI Crosses the Discontinuity

    Anthropic's Mythos became the first model to clear both UK AISI simulated attack ranges, with full network takeover rather than mere persistence. Congress is holding closed-door demos and routing access through NSA rather than CISA, which signals offensive and intelligence priorities over civilian defense. Microsoft's MDASH found 16 exploitable flaws in a single Patch Tuesday using multi-model AI, and the PraisonAI framework was exploited 4 hours after disclosure.


    The Compound Effect

    Each layer compounds the others. An adversary with transparent EDR visibility, AI-speed exploit development, and access to unpatched AI infrastructure tools is operating against an attack surface measured in hours rather than months, not the one the current security posture was calibrated for. The NGINX 18-year RCE, present since 2008 and affecting nearly every modern web application, shows how long a latent defect waits for the discovery economics to reach it.

    Defenders are still patching on a monthly cycle while attackers have compressed exploit development from months to hours, which means the window between disclosure and exploitation is now shorter than most enterprise change-control processes.

    Action items

    • Commission red team exercise specifically using AI-assisted reverse engineering against your EDR within 30 days
    • Conduct emergency inventory of all AI infrastructure tools (LiteLLM, Ollama, model registries, AI gateways) by end of week
    • Rewrite patch SLAs from 30-day to 72-hour for internet-facing critical vulns by end of Q3
    • Invest in compensating controls above the endpoint — identity, network telemetry, behavioral analytics — as the new load-bearing detection layer

    Sources:Clint Gibler · The Information AM · CyberScoop · SANS AtRisk · The Hacker News · TLDR InfoSec

  2. 02

    The True Capital Scale of AI Just Became Visible — Compute Is Being Financialized

    The Numbers That Changed This Week

    The financial architecture of AI became legible this week through a court filing and a lease deal whose competitive logic does not survive scrutiny. The IPO print in between simply confirmed what the other two already implied:

    • Microsoft → OpenAI: $100B committed by June 2026 (revealed via Musk lawsuit), with $30B in direct revenue against it. OpenAI committed another $280B to Microsoft servers going forward.
    • Cerebras IPO: $56B fully diluted, priced 16% above the raised range, 70% first-day pop. Backstopped by a $20B procurement commitment from OpenAI.
    • xAI → Anthropic: 220,000 GPUs (45% of Colossus 1) leased to a competitor Musk publicly called 'misanthropic and evil.' Financial logic overwhelmed competitive logic.
    • Anthropic: $30B ARR, up from $9B roughly four months ago. Raising at $900B+ valuation.
    • Fervo Energy: $10B+ IPO, shares jumped 33%. Google contracted option for 3GW (≈60 data centers).
    When Elon Musk leases 45% of his compute to the competitor he publicly calls misanthropic and evil, financial logic has overwhelmed competitive logic. Compute now behaves like a financial instrument, not a strategic moat.

    What This Means for Vendor Strategy

    The xAI lease is the structurally significant signal of the week. It confirms that Grok never achieved meaningful traction, because lease revenue from Colossus almost certainly exceeds what Grok itself could generate from those GPUs. The population of viable frontier labs is contracting, and excess infrastructure is moving onto the lease market. That could meaningfully alter compute economics for enterprises over the next 12-18 months.

    A reasonable skeptic would file Anthropic's capacity admission as an operations footnote. The skeptic would be wrong. 80x demand growth against 10x planning means Anthropic operated at roughly 12% of required capacity for extended periods, and developers received degraded service without disclosure. The productivity gains measured during that window are understated against what adequate provisioning would have produced.

    The Concentration Paradox

    Fewer than five entities can play at the $100B level: Microsoft/OpenAI, Google, Amazon, Meta, and the Nvidia ecosystem. But within that constraint, the vendor hierarchy is demonstrably unstable. Anthropic quadrupled enterprise share while OpenAI grew 0.3%. The window for extracting favorable multi-vendor terms is open now, while both sides are still competing for lock-in.

    Action items

    • Evaluate multi-year compute commitments this quarter — model cost of 12-month lock-in versus spot pricing exposure against a 4:1 demand ratio
    • Negotiate aggressively with both Anthropic and OpenAI during this 6-month subsidy window while both compete for enterprise lock-in
    • Build a thin model abstraction layer enabling provider swap within 48 hours
    • Explore whether GPU capacity financialization creates procurement advantages via lease-market access

    Sources:The Information AM · Martin Peers · StrictlyVC · Bloomberg Technology · The Pragmatic Engineer · Katie Roof

  3. 03

    The Execution Layer War Just Started — Four Platforms Made Incompatible Claims in Seven Days

    The Collision

    The interesting question used to be which model an enterprise should standardize on. That question is settled. The question now is which platform agents call through when they act on a user's behalf, and four of the largest technology platforms answered it in the same week with architectures that do not compose.

    PlatformStrategyArchitecture Bet
    SAP€100M fund + Knowledge GraphVertical integration — own agents are contextually superior inside SAP's data universe
    ServiceNowAction Fabric via MCP serversOpen interoperability — any agent talks to ServiceNow via standard protocol
    AppleApp Store agent framework + fee gatesDistribution control — agents that spawn sub-apps require Apple approval and 30% cut
    GoogleGemini Intelligence on AndroidOS-level agent platform — 3B+ devices, ships summer 2026 on flagship hardware

    Amazon moved the same direction with Buy for Me, an agent that completes purchases on third-party sites from inside Amazon's surface. A reasonable reading is that this is a convenience feature. The more useful reading is that it is a claim on the transaction layer of the open web.

    Why This Forces a Decision

    Vercel's production telemetry shows 59% of all AI API tokens are now agentic workloads. The interface layer of software is unbundling from the application layer underneath it. The a16z thesis puts $150B of GTM value migrating from CRM systems of record to the AI orchestration layer above them.

    Agents that act across finance, HR, IT, and procurement need one authoritative place to reconcile state. Two authoritative places is zero authoritative places.

    The SAP and ServiceNow collision is the one that forces the enterprise decision. SAP's claim is strongest where the process is the transaction: order to cash, record to report. ServiceNow's claim is strongest where the process is the workflow across systems, the connective tissue that was never going to live in an ERP. Most enterprises have both shapes today. The run-both instinct will not survive contact with agents that need to commit writes.

    The Platform Tax Problem

    Apple is not entering the AI game. It is setting the rules for the surface where consumer AI interactions happen, and framing agent sub-spawning as both a safety risk and a revenue leak. For any company shipping AI agents on iOS, that is a new constraint layer on product economics, not a footnote. Google's 97%+ Android share in markets like India means Gemini Intelligence ships as a default rather than a choice. Products that are technically API-addressable but sit outside the default agent's routing table will be bypassed without anyone noticing they were bypassed.

    Action items

    • Conduct an agent-readiness audit — determine whether third-party AI agents can discover, invoke, and orchestrate your platform's workflows without a human UI
    • Decide whether SAP or ServiceNow owns your execution layer for the processes that cannot stop — this sets 3 years of licensing leverage
    • Model Apple's likely agent fee/approval structure into iOS product unit economics before WWDC
    • Evaluate MCP server capabilities as a strategic investment for your platform roadmap

    Sources:TLDR IT · Techpresso · TLDR · Simplifying AI · TLDR Design · a16z

  4. 04

    Enterprise AI Spending Is Running Without Guardrails — And the Liability Clock Is Ticking

    The Budget Problem

    ServiceNow's CDIO disclosed that the company blew its full-year Anthropic budget by May. The cause is structural rather than accidental. Anthropic offers no SLAs, no usage telemetry, and no predictable pricing to enterprise customers, which is a deliberate choice: they are optimizing for model capability over enterprise readiness. ServiceNow's response was to build AI Control Tower internally and then sell it to other enterprises. That is not partnership. That is the market routing around a vendor deficiency.

    The problem generalizes. Every major AI vendor — Google, OpenAI, Anthropic — now concedes they cannot deploy without expensive forward-deployed engineering teams at $300-500K loaded cost per FDE. If meaningful deployment takes 5-10 of them, the true cost of an AI program is 3-5x the model fees. Most board-approved investment envelopes do not reflect that math.

    The Data Foundation Gap

    Only 15% of organizations have adequate data foundations for agentic AI. The remaining 85% are spending millions to tens of millions building agents on data that was never governed for autonomous decision-making. A practitioner survey of 334 data professionals found that 95.2% cite organizational problems — training, ownership, requirements clarity — versus 4.8% citing tooling gaps. This is not a category of problem money solves.

    A dashboard tolerates a dirty field. A human analyst routes around it. An agent executing a chain of decisions acts on it, then acts on what it just did, and the error compounds through the chain until someone notices the invoice went to the wrong entity.

    The Liability Framework Forming Now

    a16z published what is, on honest reading, the most comprehensive lobbying blueprint the AI industry has produced on liability. The core proposal is user-liability defaults with damages caps. A reasonable skeptic would point out that lobbying documents rarely become law on the lobbyist's terms. The reasonable skeptic is correct, but is answering the wrong question. Courts are already deciding cases that could impose strict liability on developers for downstream user misuse before any legislative framework exists. Precedent-setting rulings will arrive before comprehensive federal law, and a patchwork of judicial standards will set the ceiling the legislation eventually negotiates against.

    If developer liability becomes the default, open-sourcing a model becomes an uninsurable risk. Product strategies that quietly assume continued access to open weights are carrying an unpriced dependency on regulatory outcomes the P&L does not show.

    The Duolingo Warning

    Duolingo quantified what others will not. At scale, AI-generated output runs ~20% unusable and requires human QC. Their blanket AI mandate produced performative adoption: the metrics climbed while the underlying work did not. The mandate was walked back.

    Action items

    • Conduct immediate audit of all AI model consumption spend vs. budget with per-team and per-use-case attribution
    • Renegotiate AI vendor contracts to include SLAs, committed pricing tiers, and usage telemetry requirements before next renewal
    • Commission agentic AI readiness audit focused on data quality, lineage, and governance maturity across top 3 AI investment areas
    • Begin building audit-ready AI governance infrastructure (model cards, safety docs, incident protocols) that would satisfy proposed safe harbor requirements

    Sources:Laura Bratton · a16z AI Policy Brief · TLDR Data · TLDR Marketing · TLDR Dev · Brian Ardinger

◆ QUICK HITS

  • Update: Cerebras IPO closed at $56B (16% above raised range, 70% first-day pop) backed by $20B OpenAI contract — the AI compute market is now being allocated through bilateral commitments large enough to underwrite public offerings

    Katie Roof

  • xAI leasing 45% of Colossus cluster (220K GPUs) to Anthropic signals Grok has effectively conceded the frontier race — and secondary compute markets are forming for enterprise buyers

    The Pragmatic Engineer

  • Abridge raised at $5.3B valuation with 80-100M+ medical conversations — the 'intelligence layer above the EHR' pattern is replicable in any regulated vertical with an entrenched system of record

    Latent.Space

  • AI liability regime being drafted in three jurisdictions simultaneously — a16z's $115.5M political spend makes them largest 2026 midterm donor, explicitly targeting AI and crypto regulation

    a16z AI Policy Brief

  • Google Gemini Intelligence ships on Android this summer across 3B+ devices — converting 97%+ market share into an OS-level agent platform that routes around apps the agent doesn't call

    Simplifying AI

  • Anthropic's June 15 pricing change caps third-party tool access at plan-value credits then bills API rates — every Cursor/Zed user faces a material cost increase, and OpenAI offered 2 months free Codex to switchers

    AINews

  • Microsoft shopping for startup acquisitions as hedge against OpenAI — Nadella expressed explicit fear of OpenAI 'supplanting' Microsoft, signaling the partnership may fracture within 12-18 months

    The Download from MIT Technology Review

  • Lovable dissolved its growth management layer in December 2025, replaced it with autonomous parallel contributors, and is expanding the model 5 months later — elite VPs are choosing IC craft over coordination authority

    Lenny's Newsletter

◆ Bottom line

The take.

The AI security operating model, the AI vendor hierarchy, and the AI execution layer ownership question all broke open in the same week. EDR architectures are now transparent to AI-assisted reversing, Microsoft's $100B OpenAI commitment establishes the cost floor of frontier participation at a level only 4-5 entities can sustain, and four major platforms (SAP, ServiceNow, Apple, Google) made incompatible claims on where agents live — while 85% of enterprises lack the data foundations to make any of it work. The decisions being made this quarter about security architecture, vendor diversification, and platform positioning will compound over the next two years in ways that cannot be reversed at renewal time.

— Promit, reading as Leader ·

Frequently asked

Why should I red-team my own EDR with AI-assisted reverse engineering this quarter?
Because the obscurity your EDR relied on has collapsed. TrustedSec demonstrated that LLMs can reverse five major commercial endpoint products in days rather than weeks, exposing YARA rules, behavioral logic, allowlists, and local ML classifiers. If an attacker can read your detection logic in days, you need to know what they see before they prove it on your network — and compensating controls in identity, network telemetry, and behavioral analytics need to become the new load-bearing detection layer.
What does CISA listing LiteLLM, Ollama, and OpenClaw in its KEV catalog actually mean for me?
It means AI infrastructure tools your teams adopted without standard security review are being actively exploited in production right now. A honeypot dressed as an AI stack absorbed 113,000 attacks per month after Shodan indexed it in three hours, and PraisonAI was exploited four hours after disclosure. You need an emergency inventory of every AI gateway, model registry, and routing tool in your environment by end of week, plus a patch SLA rewritten from 30 days to 72 hours for internet-facing critical vulns.
Why is Congress routing offensive AI access through NSA instead of CISA significant?
It signals the U.S. government has classified frontier offensive AI as an intelligence and military capability first, and a civilian defense tool second. Anthropic's Mythos cleared both UK AISI attack ranges with full network takeover, and access is now being demoed behind closed doors via NSA channels. The defensive sharing and advisories enterprises rely on from CISA will lag the actual capability curve, which means your threat model can no longer assume government-led defense will warn you in time.
How should I think about AI vendor budgets when ServiceNow blew its full-year Anthropic spend by May?
Treat current AI vendor relationships as ungoverned consumption until proven otherwise. Anthropic and peers offer no SLAs, no usage telemetry, and no predictable pricing, and meaningful deployment also requires forward-deployed engineering teams at $300–500K loaded cost each, pushing true program cost to 3–5x model fees. Audit consumption with per-team and per-use-case attribution now, and force SLAs, committed pricing tiers, and telemetry into the next renewal.
With SAP, ServiceNow, Apple, and Google making incompatible agent platform claims, how do I avoid picking wrong?
You don't avoid picking — you pick deliberately for the processes that cannot stop. SAP's claim is strongest where the process is the transaction (order to cash, record to report); ServiceNow's is strongest where the process is workflow across systems. Two authoritative places to reconcile agent state is zero authoritative places, so decide which platform owns your execution layer this quarter, and make every product MCP-addressable so third-party agents can discover and invoke your workflows without a human UI.

◆ Same day, different angle

Read this day as…

◆ Recent in leader

Keep reading.