Leader daily

Edition 2026-05-15 · read as Leader

EDR'sSharedArchitectureNowFallstoAIinDays,NotMonths

Sources
36
Words
1,975
Read
10min

Topics Agentic AI LLM Inference AI Regulation

◆ The signal

TrustedSec's writeup this week argues the five major commercial EDR products share one architecture, built on YARA rules, Lua engines, and local ML classifiers, and that AI now reverses them in days rather than months. The same week, Anthropic's Mythos completed full autonomous network takeover in both UK AISI attack ranges. Endpoint budgets were underwritten on the premise that understanding the agent cost more than bypassing it. That premise no longer holds for a fast-growing population of adversaries, and survivability over the next eighteen months sits in the controls above the endpoint: identity, network telemetry, behavioral analytics.

◆ INTELLIGENCE MAP

  1. 01

    Endpoint Security Model Collapses — Detection Must Move Up-Stack

    act now

    AI-assisted reverse engineering renders all tested EDR products transparent in days. Mythos achieved full network takeover (first ever). Exploitation windows collapsed to 4 hours. The defensive model that assumed obscurity bought time is over.

    4hrs
    exploit window
    7
    sources
    • EDR products reversed
    • AISI ranges cleared
    • Exploit dev time
    • Monthly attacks/honeypot
    1. Old exploit dev cycle90
    2. AI-assisted exploit dev1
  2. 02

    Execution Layer War: SAP vs ServiceNow vs Platform Newcomers

    monitor

    SAP's Knowledge Graph and ServiceNow's MCP-based Action Fabric are incompatible bets on who owns the surface AI agents call. With agents at 59% of token volume, the execution layer decision this quarter determines 3 years of licensing leverage.

    59%
    agent token share
    5
    sources
    • Agent traffic share
    • SAP AI fund
    • Bot bypass rate
    • Vercel teams tracked
    1. Agentic workloads59
    2. Human chat/query41
  3. 03

    AI Liability Regime Being Written — Three Jurisdictions at Once

    monitor

    a16z published a comprehensive lobbying blueprint for user-liability defaults and damages caps. Active court cases could impose massive developer penalties before legislation exists. Open-source AI strategy is directly threatened by developer-liability outcomes.

    $115M
    a16z political spend
    4
    sources
    • a16z 2026 donations
    • Window to influence
    • Regulatory scenarios
    • Passage odds (Clarity)
    1. Active litigationNow - setting precedent
    2. Federal framework debateQ3 2026
    3. Regime hardens2027-2028
    4. Insurance markets reprice2028+
  4. 04

    Org Design Becomes Competitive Weapon — Management Layer Under Pressure

    monitor

    Lovable dissolved its growth management layer and found that VPs choosing IC roles outship 15-person teams. Cisco stock rose 15% on AI orders and 4,000 cuts the same day. The market now penalizes companies that haven't restructured.

    90%
    time on high-value work
    4
    sources
    • HI-C high-value time
    • 2025 tech layoffs
    • Cisco AI stock bump
    • LinkedIn cuts
    1. Traditional team (15 people)1
    2. HI-C + AI tools (1 person)3
  5. 05

    AI Infrastructure Financializes — Compute Becomes a Traded Asset

    background

    xAI leasing 220K GPUs (45% of Colossus) to Anthropic signals compute is now a financial instrument. Cerebras IPO at $56B on a single $20B OpenAI anchor. The GPU monopoly is being disaggregated into a lease market.

    $56B
    Cerebras market cap
    6
    sources
    • Cerebras valuation
    • OpenAI commitment
    • xAI GPUs leased
    • Nebius demand ratio
    1. Microsoft→OpenAI100
    2. Cerebras IPO56
    3. OpenAI→Cerebras20
    4. Fervo Energy IPO10

◆ DEEP DIVES

  1. 01

    Your EDR Is Now a Glass Box — The Security Architecture Reset Is This Quarter

    The Finding That Changes the Defensive Model

    TrustedSec pointed LLMs at five commercial EDR products and discovered they are built to the same blueprint: YARA-style rules, behavioral logic, allowlists, prefilters, scripted engines (some readable as Lua after a single decryption pass), and local ML classifiers. Work that used to occupy a skilled reverse engineer for weeks now takes days. The architectural diversity defenders quietly assumed existed — the thing that made each bypass a per-product investment — was never there.

    At the same time, the UK AI Security Institute confirmed that Anthropic's Mythos is the first model to clear both simulated attack ranges, achieving full network takeover rather than persistence alone. OpenAI's GPT-5.5-cyber completed one of two. Congress is taking closed-door demos and routing access through NSA rather than CISA, which tells you which use case the government considers primary.

    The security model of the defensive stack was built on the premise that the cost of understanding the agent exceeded the value of bypassing it. That premise is no longer true for a growing share of the threat population.

    The Timing Problem

    A PraisonAI vulnerability was exploited within four hours of disclosure. An 18-year NGINX RCE sat undetected in rewrite module parsing logic touching nearly every web application. Five KEV entries hit the AI infrastructure stack at once: LiteLLM, Ollama, and OpenClaw all carrying critical authentication bypass and RCE flaws. A single honeypot logged 113,000 attacks per month against AI endpoints, with tooling that evolved mid-experiment to dodge detection.

    These are not separate stories. They describe a world where patch SLAs written for 30-day windows are operating in a 4-hour reality, where the tooling layer adopted for speed was never security-reviewed, and where the endpoint control most organizations treat as load-bearing is transparent to an expanding adversary population.

    Where Detection Actually Lives Now

    A reasonable skeptic would point out that EDR has absorbed many supposed extinction events and remains the central control. The skeptic is correct on the history. The skeptic does not have to explain what happens when the cost of producing a clean bypass falls by an order of magnitude inside a single product cycle. The compensating controls that matter over the next 18 months sit above the endpoint: identity blast radius containment, network telemetry, and behavioral analytics above the endpoint. Microsoft's MDASH is running 100+ coordinated agents against its own vulnerability surface, finding 16 exploitable flaws in a single Patch Tuesday. The same capability, distributed to adversaries within 12-18 months, is the base case.

    The architectural decision this quarter is where detection lives when the endpoint agent becomes transparent. The board-deck answer is to keep EDR and add more of it. The complete answer is that teams investing in identity-layer detection, network-layer telemetry, and kernel-level isolation (Firecracker microVMs, gVisor) will absorb the next disclosure as a Tuesday. Teams that keep treating EDR as the load-bearing control will learn what load-bearing means the quarter after.

    Action items

    • Commission a red team exercise specifically targeting your EDR with AI-assisted reverse engineering to quantify your actual detection gap
    • Rewrite patch SLAs for critical internet-facing assets from 30-day to 72-hour windows, with automated deployment for the top 20 most exploitable services
    • Audit all AI infrastructure tooling (LiteLLM, Ollama, model registries, AI gateways) for production deployment without security review — inventory by end of month
    • Shift 30% of new security investment from endpoint to identity-layer detection and network behavioral analytics over the next two quarters

    Sources:Clint Gibler · The Information AM · CyberScoop · The Hacker News · SANS AtRisk · TLDR InfoSec

  2. 02

    The Execution Layer War — Who Owns the Surface AI Agents Call

    Two Incompatible Architectural Bets

    SAP and ServiceNow both declared this quarter that they are the execution layer, the place where AI agents touch systems of record and commit writes. The bets are not the same bet. SAP is building a vertically integrated Knowledge Graph that makes its own agents contextually superior inside SAP's data universe. ServiceNow adopted MCP (Model Context Protocol) servers as the communication standard for Action Fabric, opening its platform to any agent that speaks the protocol.

    These are competing theories of how the agent economy organizes: open interoperability versus data-moat integration. The decision matters because agents acting across finance, HR, IT, and procurement need one authoritative place to reconcile state. Two authoritative places is zero authoritative places.

    The twelve-to-eighteen-month window is the window. Miss it, and the agent orchestrators route around the platform rather than through it. Being bypassed is not the same as being disrupted; disruption leaves a seat at the table, bypass does not.

    Production Data Confirms the Shift

    Vercel's AI Gateway index, the closest thing to ground truth available, shows agentic workloads at 59% of all token volume across 200K+ teams. More than half of production AI usage is agents taking actions, not humans having conversations. Amazon killed Rufus to embed AI into Alexa's shopping surface. Notion launched a developer platform for agents to sync data and trigger workflows. Apple is wrestling with how to let agents into the App Store without destroying its fee structure.

    The model layer is commoditizing. Anthropic captures 61% of spend on expensive reasoning while Google captures 38% of volume on cheap throughput. That bifurcation is structural, not temporary. The competitive axis has moved from which model is best to "which execution environment do workflows depend on."

    Consumption Pricing Arrives

    SAP is not charging per-seat for autonomous finance agents. ServiceNow's headless architecture implies consumption-based pricing on agent API calls. Jason Lemkin's data point: 80% fewer human seats, 83% higher total spend, 20+ agents running. The per-seat model does not survive contact with agents that replace seat-holders. A reasonable skeptic would point out that pricing regimes take years to shift in enterprise software. The skeptic is correct about the average case. The case in front of us is not the average case, and firms that delay modeling consumption pricing will be explaining to their largest customers why they are paying for seats their agents made redundant.

    The Governance Gap

    The same organizations racing to adopt agents have not staffed the governance function. 81% of AI agents bypass legacy bot detection. Nobody owns the inventory of which agents can call which systems on whose behalf. That question becomes load-bearing the moment anything goes wrong, which is to say before Q3 budgeting closes.

    Action items

    • Conduct an 'agent readiness' audit of your platform architecture — determine if third-party AI agents can discover, invoke, and orchestrate your workflows without a human UI
    • Evaluate MCP as a strategic integration standard and build or integrate MCP server capabilities by Q4
    • Model per-action/per-outcome pricing scenarios against your current seat-based revenue and pilot with 3-5 customers this quarter
    • Stand up AI governance function with authority over agent inventory, tool rationalization, and access control before Q3 budget cycle

    Sources:TLDR IT · TLDR · ben's bites · a16z · Simplifying AI · Lenny's Newsletter

  3. 03

    AI Liability Regime Crystallizing — Your Open-Source Strategy Has an Unpriced Dependency

    Three Frameworks Competing in Real Time

    a16z has published the most comprehensive AI liability lobbying blueprint the industry has produced. The headline proposals are user-liability defaults and damages caps. Underneath the proposals is a more revealing fact: the venture class has decided the legal architecture of the next decade is worth $115.5M in political capital right now, which makes them the largest disclosed political donor of the 2026 midterm cycle.

    The strategic question is narrower than the public debate suggests. Does AI inherit Section 230 treatment, the platform regime with limited liability for downstream use, or product-manufacturer treatment, strict liability for harm from outputs. A reasonable observer would call these adjacent regimes. They are not. They produce different cost structures, different competitive dynamics, and different survival rates for everyone downstream.

    If developer liability for downstream use becomes the standard, the economic logic of releasing an open-source model stops working. No rational actor open-sources a model that generates unbounded liability for every downstream application.

    Courts Are Moving Faster Than Congress

    Active litigation against general-purpose AI tools could impose substantial penalties on developers for user misuse before any legislative framework exists. The likely sequence is that precedent-setting rulings arrive before comprehensive federal law, producing a patchwork of judicial standards that legislation has to work around rather than replace. Inside the White House, the fight between ODNI (pre-release model evaluation, effectively a licensing regime) and Commerce (voluntary frameworks through CAISI) will resolve in quarters, not years.

    CAISI published voluntary testing agreements with Google, Microsoft, and xAI, then retracted them in the same week. The instability is the planning variable. An IC-led regime means release gating and classified compliance that reshapes who ships what and when. A Commerce-led regime means disclosure requirements that are expensive but navigable. Those are not the same world.

    The Liability Cartel Dynamic

    Excessive AI liability functions as a moat for incumbents with thousand-lawyer teams. Deep pockets prefer strict liability for the same reason they prefer any rule that prices out the challenger. Product strategies quietly assuming continued access to open-source weights are carrying an unpriced dependency on regulatory outcomes that the P&L does not show. If the regime turns strict-liability, the supply of open-source models contracts and the competitive position relative to well-capitalized providers deteriorates in the same quarter.

    Action items

    • Commission a legal exposure audit of your AI products against three competing liability frameworks (absolute liability, safe harbor, user-liability presumption) — quantify financial exposure under each by end of Q3
    • Begin building audit-ready AI governance infrastructure — model cards, safety testing documentation, incident reporting — that would satisfy proposed safe harbor requirements
    • Map all open-source AI dependencies and develop contingency plans for a world where open-source model availability contracts due to developer liability outcomes
    • Engage in federal legislative process through industry coalitions — the window to influence is 12-18 months before the regime hardens

    Sources:a16z AI Policy Brief · Risky.Biz · The Download from MIT Technology Review · Morning Brew

  4. 04

    The Management Layer Is Being Priced Out — Org Design Becomes a Competitive Surface

    The Lovable Experiment

    In December 2025, the AI-native startup Lovable dissolved its growth management layer and replaced it with autonomous parallel contributors it calls High-Impact ICs, or HI-Cs. Five months in, the model is expanding rather than retreating. Former VP Elena Verna reports spending 90% of her time on high-value building instead of coordination. A single operator ships in hours what a cross-functional squad used to ship in weeks. The number that matters is not the speed. It is the headcount that used to sit between the VP and the output.

    The pattern is not unique to Lovable. It shows up across AI-native firms with different products and different founders, which is what makes it worth taking seriously. The economic argument for middle management is coordination across teams. That argument weakens considerably when AI compresses coordination costs to near-zero. One operator with the right tools and full organizational context now does the work the squad was assembled to do. The squad was, in part, a workaround for missing tooling.

    The market now penalizes companies that haven't restructured. Cisco's stock rose 15% on AI orders the same day it announced 4,000 cuts. LinkedIn stated explicitly that its cuts are 'tied to reshaping operations around AI.' The board conversation is no longer whether, but why not yet.

    The Talent Problem This Creates

    VPs are taking recruiter calls from fifty-person AI-native companies and saying yes. A VP who watched a peer ship 3x the output with six engineers and a pager asks, reasonably, which career compounds. Title inflation and retention grants do not address the question. The vulnerability is the org model. The old promise was to demonstrate craft, ascend into management, and scale comp with headcount. That promise breaks when AI eliminates the headcount that justified the management layer in the first place.

    Tech layoffs are running at 103,000+ by mid-May, already approaching 2025's full-year total of 124,000. The companies cutting are profitable. This is not a recession response. It is a response to the structural fact that fewer people with AI tools produce more output than more people without them.

    The Design Decision

    The right response is not to fire every manager next quarter. It is to treat org design as a competitive weapon. The companies that win will flatten selectively, where coordination costs exceed coordination value, and preserve management where genuine specialization, mentorship, or regulatory complexity actually requires it. The HI-C model only works when individuals hold full organizational context, which means investing in the knowledge infrastructure that makes ungated information access real rather than aspirational.

    A reasonable skeptic would say the window for experimentation will stay open for years. The skeptic is half right. The window closes when senior operators start making permanent career decisions about which kind of company they want to spend the next decade inside. Some of them are making those decisions now.

    Action items

    • Audit your org for 'coordination-only' management layers — identify every role whose primary value is alignment rather than output, and model which can be replaced by better tooling and information access
    • Pilot an HI-C track for your top 5% of senior ICs and managers who've expressed interest in returning to craft — design compensation that rewards output, not span of control
    • Invest in enterprise knowledge infrastructure that enables ungated information access with governance — the binding constraint on the HI-C model is context, not tools
    • Model your 2027 engineering capacity plan against two scenarios: current ratio of managers to makers, and 50% reduction in coordination roles offset by AI tooling investment

    Sources:Lenny's Newsletter · Techpresso · The Pragmatic Engineer · Laura Bratton

◆ QUICK HITS

  • Update: Anthropic grew from $9B to $30B+ ARR in four months, disclosed an 80x demand spike against 10x planning — service degradation during that window means measured productivity gains were understated

    StrictlyVC

  • xAI is leasing 220,000 GPUs (45% of Colossus 1) to Anthropic — Musk's competitive logic has been overwhelmed by financial logic, signaling compute is now a traded financial instrument

    The Pragmatic Engineer

  • ServiceNow blew its full-year Anthropic budget by May — no SLAs, no usage telemetry, no predictable pricing — and is now selling its 'AI Control Tower' workaround to other enterprises

    Laura Bratton

  • Apple is building agent-level App Store governance ahead of WWDC — specifically addressing agents that 'spin up smaller apps' post-approval, creating a new distribution chokepoint with fee extraction

    Techpresso

  • Only 15% of organizations have data foundations adequate for agentic AI — 95.2% of data modeling problems are organizational (ownership, training, requirements), not tooling

    TLDR Data

  • Fervo Energy IPO at $10B+ (shares up 33% on day one) validates power as AI platform business — Google holds option for 3GW from Fervo against 658MW currently contracted, enough for 60+ data centers

    The Information AM

  • Abridge raised at $5.3B with 80-100M+ medical conversations as irreplicable moat — compressing prior authorization from 45 days to minutes across 250 health systems

    Latent.Space

  • Google's Gemini Intelligence shipping this summer on Android — converting 97% market share into an OS-level agent platform that demotes apps from interfaces to capabilities agents call

    Simplifying AI

◆ Bottom line

The take.

Your endpoint security model just failed its load-bearing test — all five major EDR products are architecturally identical and AI-transparent in days, while Anthropic's Mythos achieved the first full autonomous network takeover. Simultaneously, the execution layer underneath enterprise AI agents is being claimed by SAP and ServiceNow with incompatible architectures, the AI liability regime is being written in courtrooms before Congress acts, and the market is actively penalizing companies that haven't restructured around AI-native org models. The connecting thread: every assumption from 18 months ago — that EDR was opaque, that one AI vendor was the default, that management layers were load-bearing, that liability was a future problem — is being falsified in the same quarter. The decisions being forced are architectural, not incremental.

— Promit, reading as Leader ·

Frequently asked

Why does AI-assisted reverse engineering of EDR products undermine endpoint budgets?
Endpoint security spending was justified by the assumption that producing a clean bypass cost more than the agent itself. TrustedSec showed all five major EDRs share a YARA/Lua/local-ML architecture that LLMs can now reverse in days, collapsing that cost asymmetry and pushing survivability into identity, network telemetry, and behavioral analytics layers above the endpoint.
What concrete steps should leaders take this quarter to reduce EDR dependency?
Commission an AI-assisted red team against your specific EDR to quantify the gap, compress patch SLAs on critical internet-facing assets from 30 days to 72 hours, audit AI infrastructure tooling like LiteLLM and Ollama for ungoverned production deployments, and shift roughly 30% of new security investment toward identity-layer detection and network behavioral analytics over the next two quarters.
How does the SAP versus ServiceNow execution-layer fight affect platform strategy?
It forces a choice between vertically integrated data moats and open MCP-based interoperability as the standard for how agents commit writes to systems of record. Platforms that aren't programmatically reachable by third-party agents within 12–18 months risk being routed around rather than disrupted, and per-seat pricing breaks when 80% fewer seats correlate with 83% higher spend on consumption.
What is the unpriced risk in relying on open-source AI models?
Product strategies assuming continued open-weight availability are exposed to a regulatory regime that hasn't been decided yet. If courts or Congress impose developer liability for downstream misuse before a safe harbor exists, the economics of open-sourcing collapse and supply contracts sharply, advantaging incumbents with large legal teams and leaving downstream builders without the substrate their roadmaps assume.
Why is org design becoming a competitive surface rather than an HR concern?
AI compresses coordination costs toward zero, which removes much of the economic justification for middle management layers that existed to align cross-functional squads. Lovable's HI-C model, Cisco's 15% stock pop alongside 4,000 cuts, and 103,000+ tech layoffs at profitable firms show markets are already pricing operational leverage, and senior operators are making decade-length career bets on which org models compound.

◆ Same day, different angle

Read this day as…

◆ Recent in leader

Keep reading.