Leader daily

Edition 2026-05-17 · read as Leader

EDRStackGoesTransparentasLLMsCollapseExploitTimelines

Sources
36
Words
1,825
Read
9min

Topics Agentic AI AI Regulation AI Capital

◆ The signal

Your endpoint detection stack is now transparent to AI. TrustedSec demonstrated that all five major commercial EDR products share identical architectural patterns — YARA rules, Lua scripting engines, local ML classifiers — and can be fully reverse-engineered by LLMs in days instead of weeks. In the same window, Anthropic's Mythos became the first model to clear both UK AISI simulated attack ranges (full network takeover), and exploit weaponization timelines collapsed to 4 hours. The security model was built on adversaries needing months. They now need an afternoon.

◆ INTELLIGENCE MAP

  1. 01

    Security Architecture Discontinuity: Endpoint to Network

    act now

    EDR reverse-engineering went from weeks to days via AI. Mythos cleared both AISI attack ranges (first ever). PraisonAI was weaponized in 4 hours. AI infra tools (LiteLLM, Ollama) now on CISA KEV. The entire defensive posture — from endpoint to patch cadence to infrastructure — was calibrated for an adversary that no longer exists.

    4 hrs
    exploit weaponization window
    7
    sources
    • EDR reverse time
    • AISI ranges cleared
    • AI honeypot attacks/mo
    • MDASH flaws/cycle
    1. Traditional reverser90
    2. AI-assisted (now)3
    3. Weaponization window0.17
  2. 02

    Agent Execution Layer War: Who Mediates the Machine

    monitor

    SAP (€100M fund + Knowledge Graph), ServiceNow (headless Action Fabric via MCP), Apple (App Store agent gating), Google (Gemini Intelligence on 3B+ devices), and Amazon (Buy for Me cross-retailer agent) all moved in the same window to own where AI agents execute. The decision of which platform your agents route through sets licensing leverage for the next 3 years.

    59%
    token volume from agents
    8
    sources
    • Android market share
    • Agentic token share
    • Bot bypass rate
    • SAP fund
    1. 01Google (Android/Gemini)3B+ devices
    2. 02ServiceNow (MCP)IT/HR/CS workflows
    3. 03SAP (Knowledge Graph)ERP transactions
    4. 04Apple (App Store)iOS gating
    5. 05Amazon (Buy for Me)Commerce agent
  3. 03

    AI Liability: Courts Deciding Before Congress Acts

    monitor

    a16z published the industry's most comprehensive liability blueprint (user-liability defaults, damages caps) while courts are actively imposing penalties on AI developers for downstream misuse. ODNI vs Commerce fight over model evaluation will resolve in quarters. If developer-liability wins, open-source AI becomes uninsurable. The framework locks in this year.

    $115M
    a16z political spend
    5
    sources
    • a16z 2026 midterms
    • Clarity Act odds
    • Regime decision
    • Competing frameworks
    1. Commerce-led (voluntary)45
    2. IC-led (pre-release eval)55
  4. 04

    Enterprise AI True Cost: 3-5x the Model Fee

    monitor

    ServiceNow blew its full-year Anthropic budget by May. Google, OpenAI, and Anthropic all now require expensive human FDE layers ($300-500K loaded per engineer, 5-10 needed per deployment). Only 15% of organizations have data foundations for agentic AI. True deployment cost is 3-5x model fees — and AI spend is uniquely reversible, creating ecosystem fragility.

    85%
    orgs unprepared for agents
    5
    sources
    • FDE loaded cost
    • FDEs per deployment
    • Data readiness
    • Org vs tool problem
    1. Model/API fees25
    2. FDE deployment layer40
    3. Data governance gap20
    4. Integration/tooling15
  5. 05

    Org Design Inflection: The Coordination Layer Dissolves

    background

    AI-native startups are hiring VPs into IC roles — and VPs are saying yes. Lovable dissolved its growth management layer, replaced it with autonomous parallel contributors, and found it attracts elite talent. One operator ships in hours what a squad shipped in weeks. The economic case for coordination-only management is collapsing.

    90%
    time on high-value work
    4
    sources
    • VPs choosing IC
    • HI-C time on value
    • Model maturity
    • Tech layoffs 2025
    1. Traditional squad (15 people)1
    2. HI-C operator (1 person)3

◆ DEEP DIVES

  1. 01

    Your Security Perimeter Just Became Transparent — The 12-Month Rebuild Starts Now

    The Collapse Is Measured, Not Hypothetical

    TrustedSec pointed LLMs at five commercial EDR products and found the same architecture under each one: YARA-style rules, behavioral logic, allowlists, Lua scripting engines that decrypt in a single pass, and local ML classifiers. Work that used to require a skilled reverse engineer for weeks now takes days with AI assistance. The reasonable skeptic will note that endpoint detection was never supposed to be the last line. The reasonable skeptic is correct. What the skeptic does not explain is why the entire category was priced and deployed as if obscurity were a control, because the obscurity just evaporated for an order of magnitude more adversaries.

    The UK AI Security Institute confirmed that Anthropic's Mythos cleared both simulated attack ranges, the first model to achieve full autonomous network takeover. OpenAI's GPT-5.5-cyber cleared one. These are not benchmarks. They are operational demonstrations that a model can now find, chain, and exploit end-to-end.


    The Timeline Has Compressed Beyond Recovery

    PraisonAI was exploited in the wild 4 hours after disclosure. Microsoft's MDASH found 16 exploitable flaws in a single Patch Tuesday using multi-model analysis. Mozilla found 271 real bugs in Firefox 150 using Claude Mythos with a custom harness, including sandbox escapes and use-after-free vulnerabilities that fuzzers had missed.

    A patch window measured in months because procurement needs months is a window calibrated for a world where weaponization was the slow step. Weaponization is no longer the slow step.

    The AI infrastructure layer is now actively targeted. LiteLLM was added to CISA's Known Exploited Vulnerabilities catalog. A single Raspberry Pi honeypot dressed as an AI stack was indexed by Shodan in 3 hours and absorbed 113,000+ requests in one month, with 175 active hijacking attempts in the final week. 23% of the traffic targeted AI-specific endpoints. The toolchain evolved mid-experiment to detect and evade the honeypot.


    Where Detection Must Move

    The compensating controls that will matter over the next 18 months are identity, network telemetry, and behavioral analytics above the endpoint. Two universal Linux LPEs, Dirty Frag and Copy Fail, compound the problem. Copy Fail modifies in-memory file copies without touching disk, which makes it invisible to every file integrity monitoring product currently deployed. It has affected every major Linux distribution since 2017.

    The board-deck reading is that the government will catch up. The complete reading is that Congress is routing Mythos access through NSA rather than CISA, which says offensive and intelligence use is the priority and civilian defense is not. The private sector is on its own for several years.

    Action items

    • Commission AI-assisted red team exercise specifically targeting your EDR's detection logic within 30 days
    • Reduce critical vulnerability patch SLA to 8 hours for internet-facing assets by end of Q3
    • Audit all AI infrastructure tooling (LiteLLM, Ollama, model registries) for security governance gaps — many entered production without security review
    • Shift detection investment from endpoint to identity/network behavioral analytics over next 2 quarters

    Sources:Clint Gibler · The Information AM · CyberScoop · The Hacker News · SANS AtRisk · TLDR InfoSec

  2. 02

    Five Platforms Moved This Week to Own Where Agents Execute — Your Architecture Decision Is This Quarter

    The Execution Layer Is the New Control Point

    SAP, ServiceNow, Apple, Google, and Amazon all moved in the same window to claim the surface where AI agents actually commit writes to systems of record. The temptation is to read this as five product launches that happened to overlap. The more useful reading is that five companies with very different distribution models all concluded, independently, that the execution layer is the architectural decision enterprises cannot defer.

    The positions are now explicit:

    PlatformStrategyMoat Thesis
    ServiceNowHeadless Action Fabric via MCPOpen interoperability across IT/HR/CS
    SAP€100M fund + Knowledge GraphVertically integrated data-moat in ERP
    GoogleGemini Intelligence on Android (3B+ devices)OS-level default on consumer hardware
    AppleApp Store agent gating + reviewDistribution control + 30% fee structure
    AmazonBuy for Me cross-retailer agentCommerce relationship ownership

    Why "Run Both" No Longer Works

    A reasonable skeptic would point out that enterprises run multiple systems of record today and have for decades. The skeptic is correct about the past. The skeptic is wrong about agents. Agents acting across finance, HR, IT, and procurement need one authoritative place to reconcile state. Two authoritative places is zero authoritative places. ServiceNow adopting MCP as the communication standard is the most consequential signal in the set, because a company with workflow gravity across IT, HR, and customer service declaring that agents talk to it via MCP pulls the ecosystem toward that protocol whether or not the rest of the ecosystem agrees.

    SAP is playing the other game. The Knowledge Graph is a bet that vertically integrated context makes SAP's own agents contextually superior inside SAP's data universe. These are two competing theories of how the agent economy organizes. Open interoperability is one. Data-moat integration is the other. They do not converge.

    Being bypassed is not the same as being disrupted. Disruption leaves a seat at the table. Bypass does not.

    Vercel's production data names the clock: 59% of all AI API tokens are now agentic workloads. More than half of production AI usage is agents taking actions, not humans having conversations. Any platform whose roadmap still assumes a human-in-the-UI has a 12-18 month window before agent orchestrators route around it rather than through it.


    The Security Dimension Is Already Failed

    The number that should be on every CISO's first slide is 81% of AI agents successfully bypass legacy bot detection. Every WAF, CAPTCHA, and rate-limiting system built to flag automated access by behavioral pattern is now ineffective against LLM-driven agents that mimic human interaction. The board-deck version is that the security stack needs an upgrade. The complete version is that an architecture designed for human adversaries is the wrong architecture for the agent era, and upgrading it does not change what it was designed to detect.

    Action items

    • Conduct an 'agent readiness' audit of your platform — determine if third-party agents can discover, invoke, and orchestrate your workflows without a human UI by end of Q3
    • Evaluate MCP server capability as a strategic build-or-integrate decision before Q3 budgeting
    • Model per-action/per-outcome pricing scenarios for your revenue if agents replace human seat consumption
    • Reassess security stack against AI agent adversaries — specifically test bot detection against LLM-driven agents

    Sources:TLDR IT · Simplifying AI · TLDR · Techpresso · TLDR Design · a16z

  3. 03

    AI Liability Is Being Decided in Courtrooms This Quarter — Your Posture Determines Your Options

    Three Jurisdictions, Same Window, Divergent Outcomes

    a16z has published the most comprehensive AI liability blueprint the industry has produced: user-liability defaults, damages caps, and federal preemption of the state patchwork. At the same time, courts are actively deciding cases that could impose substantial penalties on general-purpose AI developers for downstream misuse. The sequence is the point. Precedent-setting rulings will arrive before any comprehensive federal framework, producing judicial standards that subsequent legislation has to work around rather than replace.

    Inside the Trump administration, ODNI and Commerce are fighting over who evaluates frontier models. CAISI published voluntary testing agreements with Google, Microsoft, and xAI, then retracted them inside the same week. The outcome decides whether AI operates under an intelligence-community-led pre-release evaluation regime or a Commerce-led voluntary one.

    The companies filing AI liability under 'future problem' are the ones that will find out, in a courtroom, that it was this quarter's.

    The Open-Source Kill Switch

    If developer liability for downstream use becomes the standard, the economic logic of releasing an open-source model stops working entirely. No rational actor open-sources a model that generates unbounded liability for every downstream application. The supply chain restructures toward proprietary foundation models and a handful of providers. Product strategies that quietly assume continued access to open weights — which is most of them — carry an unpriced dependency on regulatory outcomes that does not appear on the P&L.


    The 'Liability Cartel' Dynamic

    A reasonable skeptic would point out that liability rules are a poor proxy for competitive dynamics. The skeptic is half right. Excessive AI liability functions as a moat for incumbents with thousand-lawyer teams, and a16z names this explicitly. A strict-liability regime with developer-side presumption changes the unit economics of every foundation model provider. A negligence regime with deployer-side presumption changes the unit economics of every company using one. Deep pockets prefer strict liability for the same reason they prefer any rule that prices out the challenger.

    a16z's $115.5M in 2026 midterm spending makes the firm the largest disclosed political donor, and the $50M 'Leading the Future' PAC backing AI-friendly candidates of either party is the delivery mechanism. Companies not engaged in this process will comply with rules written by firms that had a hand in writing them.

    Action items

    • Commission legal exposure audit of AI products against three competing liability frameworks (absolute, safe harbor, rebuttable user-liability) within 60 days
    • Begin building audit-ready AI governance infrastructure — model cards, safety testing docs, incident protocols — that would satisfy proposed safe harbor requirements
    • Evaluate open-source AI dependencies and develop contingency plans for a world where model availability contracts due to developer liability
    • Join or establish industry policy coalition on federal preemption — align voice with a16z framework or develop alternative before defaults harden

    Sources:a16z AI Policy Brief · Risky.Biz · Morning Brew · The Download from MIT Technology Review · Bloomberg Technology

  4. 04

    Enterprise AI's True Cost Is 3-5x the Budget Line — And 85% of Orgs Aren't Ready for What They're Buying

    The Budget Blowout Pattern

    ServiceNow's CDIO said publicly that the company blew its full-year Anthropic budget by May. Anthropic's answer was no SLAs, no usage telemetry, no comment. A reasonable skeptic would say this is what every fast-growing infrastructure vendor looks like before the admin layer arrives. The reasonable skeptic is partly correct. What the skeptic does not explain is why ServiceNow is already shipping AI Control Tower and selling it to other enterprises. That's not partnership. That's the market routing around a vendor deficiency.

    Every major AI provider now requires expensive forward-deployed engineering layers to generate value. Google is hiring hundreds of FDEs. OpenAI bought a 150-person consulting firm. At $300-500K loaded cost per FDE and 5-10 needed per meaningful deployment, the true cost of an AI program runs 3-5x the model fees. The board-deck version of this is a model-spend line item. The complete version is a services line item three to five times larger.


    The Data Foundation Gap Is Organizational, Not Technical

    Of 334 data practitioners asked what they need most, 4.8% said better tools. 95.2% asked for training, clearer requirements, more time, and dedicated ownership. Only 15% of organizations have data foundations adequate for agentic AI at scale. Netflix and Meta independently converged on identity-based, team-owned data governance, replacing brittle ACLs and human-owned identities, as the precondition for letting agents read and write across an organization.

    The eighty-five percent that are not ready are not unready because they bought the wrong platform. They are unready because nobody owns the data the agents are supposed to act on.

    The Fragility Nobody Is Pricing

    AI spend is uniquely reversible. Unlike a cloud migration or an ERP rollout, token consumption can be cut to zero overnight. The enterprise AI revenue base therefore carries a fragility that model-company valuations are not pricing in. For buyers, the same property is an asset, but only for the ones who have not yet built workflows that cannot function without it. The FOMO dynamic of spending past budget because a competitor might 'figure out economics first' is classic bubble psychology meeting a reversible commitment. Those two things do not usually appear in the same sentence.

    Anthropic grew demand 80x against a planned 10x, which means it ran at roughly 12% of required capacity for extended periods. Developers on the platform received degraded service without disclosure. The productivity gains measured during that window are almost certainly understated against what adequate provisioning would have delivered. The next quarter's number will be the more honest one.

    Action items

    • Conduct immediate audit of all AI model consumption vs. budget with per-team and per-use-case attribution before next board meeting
    • Restructure data modeling ownership: establish dedicated modeling architects (only 19.2% of orgs have them) and enforce standards before scaling agentic deployments
    • Renegotiate AI vendor contracts to include SLAs, committed pricing tiers, and usage telemetry requirements — use the subsidy window while both Anthropic and OpenAI are competing
    • Model total AI program cost at 3-5x model fees when presenting investment cases — include FDE requirements, governance build, and data remediation

    Sources:Laura Bratton · TLDR Data · The Pragmatic Engineer · TLDR AI · Brian Ardinger, Inside Outside Innovation

◆ QUICK HITS

  • Update: Anthropic's Mythos overtook GPT-5.5 on offensive cyber — first model to clear both AISI simulated attack ranges, shifting the competitive dynamic between the two on government procurement

    CyberScoop

  • Cerebras IPO closed at $56B (+70% day one) on a $20B OpenAI procurement commitment — AI compute supply is now allocated through relationship-based bilateral contracts, not open markets

    Katie Roof

  • xAI leasing 45% of Colossus (220K GPUs) to Anthropic — Musk's public enemy is now his tenant, confirming GPU financialization has overwhelmed competitive logic

    The Pragmatic Engineer

  • Foxconn lost 8TB of IP from Apple, Google, Intel, and Nvidia to Nitrogen ransomware — supply chain data custody is now a first-class attack surface for AI infrastructure designs

    TLDR InfoSec

  • Fervo Energy IPO at $10B+ (shares +33%) with Google holding option on 3GW (60+ data centers) — power contracts signed this year set AI competitive position in 2028-2030

    StrictlyVC

  • Lovable dissolved its growth management layer in December, replaced with autonomous parallel contributors — 5 months in the model is expanding, attracting VP-level talent who prefer 90% high-value work over coordination

    Lenny's Newsletter

  • Duolingo walked back blanket AI mandate — CEO concedes ~20% of AI-generated output is unusable at scale, validating that forced adoption produces performative compliance, not productivity

    TLDR Marketing

  • Google's Gemini Intelligence rolls out this summer on Galaxy S26 and Pixel 10 — Android becomes an agent platform on 3B+ devices, turning apps into infrastructure agents call on users' behalf

    Simplifying AI

◆ Bottom line

The take.

AI-powered offense achieved full autonomous network takeover this week while your EDR became transparent to adversaries in days — and that's just the security layer. Five major platforms simultaneously moved to own the agent execution layer (the surface where AI actually commits writes), AI liability is being decided in courtrooms before legislation exists, and enterprise AI's true deployment cost is 3-5x what budgets show with 85% of organizations lacking the data foundations to make any of it work. The decisions being deferred to next quarter — on security architecture, platform positioning, liability posture, and governance — are the ones that will be quoted back in the 2027 renewal cycle, the 2028 courtroom, or the next incident report.

— Promit, reading as Leader ·

Frequently asked

Why does it matter that all five major EDR products share the same architecture?
It means a single LLM-assisted reverse engineering effort can compromise the entire category, not just one product. TrustedSec showed YARA rules, Lua scripting engines, and local ML classifiers are common across vendors, so detection logic that took skilled humans weeks to study can now be mapped in days — collapsing the obscurity that priced and justified the whole stack.
Where should detection investment move if endpoint controls are losing their half-life?
Toward identity, network telemetry, and behavioral analytics that sit above the endpoint. Compounding Linux LPEs like Copy Fail modify in-memory file copies without touching disk, defeating file integrity monitoring entirely, so controls that observe authentication patterns, lateral movement, and east-west traffic are the layer that still has signal over the next 18 months.
What does ServiceNow adopting MCP actually signal for platform strategy?
It signals that the agent execution layer is consolidating around an open protocol, and platforms that cannot be consumed headlessly will be routed around. With 59% of AI API tokens now agentic and ServiceNow's workflow gravity pulling the ecosystem toward MCP, any roadmap still assuming a human-in-the-UI has roughly a 12–18 month window before agent orchestrators bypass it.
Why is open-source AI access an unpriced risk on most product roadmaps?
Because if courts impose developer liability for downstream misuse, the economics of releasing open weights collapse and the supply chain restructures around a few proprietary providers. Most enterprise AI strategies quietly assume continued open-weight availability, so a plausible regulatory outcome could eliminate a dependency that never appeared on the P&L.
Why is the true cost of an enterprise AI program 3–5x the model spend?
Because every major provider now requires forward-deployed engineering to generate value, and 85% of organizations lack the data foundations to deploy agents at scale. At $300–500K loaded cost per FDE with 5–10 per meaningful deployment, plus governance and data remediation work, the services layer dwarfs the API line item that boards typically approve.

◆ Same day, different angle

Read this day as…

◆ Recent in leader

Keep reading.