Why does enterprise AI actually cost 3-5x the model fees?

Because every major AI vendor — Google, OpenAI, Anthropic, ServiceNow, and Salesforce — now requires forward-deployed engineers to make deployments work. At $300-500K loaded cost per FDE, with five to ten needed per meaningful deployment, the human integration layer dwarfs token spend. Boards approving AI envelopes based on model pricing are funding roughly one-third of the actual program cost.

What should a CFO ask before approving Q3 AI spend?

Ask for per-team and per-use-case attribution of current model consumption, the FDE headcount assumed in the deployment plan, and whether the contract includes SLAs, committed pricing tiers, and usage telemetry. If any of those answers is missing, the budget is not a budget — it is an estimate that will be reconciled after the invoice arrives.

Why is ServiceNow's budget blowout an industry signal rather than a procurement failure?

Because ServiceNow is a $150B enterprise software vendor with mature procurement, and it still exhausted its annual Anthropic budget by May with no usage telemetry to see it coming. Anthropic offers no SLAs, no committed tiers, and no real-time cost visibility to any customer. If ServiceNow cannot govern this spend, smaller enterprises on the same provider are 3-6 months behind the same outcome.

How much leverage do enterprise buyers have right now in AI vendor negotiations?

More than they will have in twelve months. Anthropic grew 80x against a planned 10x and needs enterprise logos ahead of an IPO, which means the subsidy window for committed pricing, SLAs, and telemetry concessions is open now. Renewals negotiated in Q3 will set the terms for the next two years; renewals deferred will be negotiated from a weaker position.

Is there a market opportunity emerging from this cost-control gap?

Yes — ServiceNow is already building an AI Control Tower and selling it externally, and the broader 'Datadog for AI spend' category is forming this quarter. Enterprises face a build-versus-buy decision on AI cost governance tooling, and identifying the category winner early matters because the control point for AI economics will sit wherever attribution, telemetry, and chargeback consolidate.

Edition 2026-05-28 · read as Leader

EnterpriseAI'sRealCostIs3–5xWhatCFOsBudgeted

Sources: 36
Words: 1,553
Read: 8min

Topics Agentic AI AI Regulation AI Capital

◆ The signal

ServiceNow exhausted its annual Anthropic budget by May. In the same quarter, Google, OpenAI, Anthropic, ServiceNow, and Salesforce have all independently converged on Palantir's forward-deployed-engineer model, which puts the true cost of enterprise AI at three to five times the model fees most budgets were built around. The Q3 CFO conversation is not whether the spend is justified. It is whether anyone in the room actually knows what it costs.

◆ INTELLIGENCE MAP

01
Your Defensive Stack Is Now Transparent
act now
TrustedSec proved all 5 major EDR products are reverse-engineered by AI in days, not weeks. Simultaneously, CISA added AI infrastructure tools to KEV. PraisonAI was weaponized within 4 hours of disclosure. The defender's response window and the defender's detection architecture both failed in the same week.
4 hrs
exploit window
7
sources
- EDR reverse time
- AISI ranges cleared
- AI infra KEVs added
- LLMjacking attempts/mo
1. EDR Reverse (Before AI)21
2. EDR Reverse (With AI)3
3. Exploit Weaponization0.17
02
Enterprise AI Cost Governance Has Failed
act now
ServiceNow exhausted its full-year Anthropic budget by May. Every major vendor now requires 5-10 forward-deployed engineers at $300-500K loaded each, making true AI deployment cost 3-5x model fees. Anthropic offers no SLAs, no usage telemetry, and had zero comment on customer budget blowouts.
3-5x
true cost vs. model fees
5
sources
- Budget exhausted
- FDE loaded cost
- FDEs per deployment
- Anthropic demand spike
1. Model Fees (Budget)1
2. FDE Layer2
3. True Total Cost4
03
Execution Layer War: SAP vs ServiceNow
monitor
SAP bet on a vertically integrated Knowledge Graph with a €100M fund. ServiceNow adopted MCP as its agent communication standard. Both are rebuilding to be consumed headlessly by AI agents, not humans. The 12-18 month window to decide which platform owns your agent execution layer is open now.
€100M
SAP agent fund
4
sources
- ServiceNow protocol
- SAP approach
- Bot bypass rate
- Agentic token share
1. SAP: Data-Moat Integration85
2. ServiceNow: Open MCP78
04
AI Liability Regime Being Written Now
monitor
a16z published the industry's lobbying blueprint while courts are actively setting precedent on AI developer liability. ODNI and Commerce are fighting for model assessment authority. If developer-liability wins, open-source AI becomes uninsurable. If user-liability wins, incumbents lose their compliance moat.
$115.5M
a16z political spend
4
sources
- Competing frameworks
- ODNI vs Commerce
- a16z midterm spend
- Window to shape
1. Developer Liability35
2. User Liability30
3. Safe Harbor35
05
Org Design Becomes Competitive Weapon
background
Lovable dissolved its growth management layer, replaced it with autonomous parallel contributors, and found the move attracts elite VPs. One operator ships in hours what a cross-functional squad shipped in weeks. Duolingo quantified the counter-risk: blanket AI mandates produce ~20% unusable output.
90%
time on high-value work
4
sources
- Lovable model age
- Duolingo slop tax
- High-value time (HI-C)
- US workforce designers
1. Traditional VP (coordination)30
2. HI-C Operator (autonomous)90

◆ DEEP DIVES

01
Your Defensive Architecture Is Now Transparent — And Your AI Infrastructure Was Never Secured
Two Failures Arrived in the Same Week
The week produced two security findings that would each define a bad quarter on their own. Together they retire the operating model most security programs still run on. TrustedSec pointed LLMs at five commercial EDR products and found all five architecturally identical: YARA-style rules, behavioral logic, allowlists, prefilters, Lua-based scripted engines readable after a single decryption pass, and local ML classifiers. Reverse engineering work that used to take a skilled human weeks now takes days. The endpoint detection category has been running on obscurity, and the obscurity is gone.
The security model assumed the cost of understanding the agent exceeded the value of bypassing it for most adversaries. That assumption no longer holds for a growing share of the threat population.
In the same window, CISA added five AI infrastructure tools to its Known Exploited Vulnerabilities catalog, including LiteLLM, Ollama, and OpenClaw. These are tools most engineering teams adopted without security review, in the narrow gap between experiment and production that AI tooling closed in roughly two quarters. A Raspberry Pi honeypot configured as an AI stack was indexed by Shodan in 3 hours and absorbed 113,000 attacks per month, with 23% of traffic aimed at AI-specific endpoints.
The Response Window Has Collapsed
PraisonAI was weaponized within 4 hours of disclosure. An 18-year-old RCE in NGINX sat undisturbed across most of the web. Traefik shipped a CVSS 10.0 authentication bypass. Argo CD allows plaintext Kubernetes secret extraction at CVSS 9.6. Stack those disclosures against the same remediation teams, change windows, and testing capacity, and any organization on a quarterly patch cadence is operating with permanent known exposure.
Microsoft's MDASH system found 16 exploitable flaws in a single Patch Tuesday cycle using multi-model AI analysis. That capability, or its functional equivalent, reaches adversaries within 12-18 months. The UK AISI confirms AI cyber task completion is doubling every few months, and Anthropic's Mythos became the first model to clear both simulated attack ranges. Congress is routing Mythos access through NSA rather than CISA, which is the clearest available signal about which use case the government treats as primary.
The Foxconn Proof Point
Nitrogen ransomware exfiltrated 8TB of confidential designs from Apple, Google, Intel, and Nvidia through a single contract manufacturer. The concentration of AI infrastructure work at a small number of assembly partners produces concentration of intellectual property, which produces concentration of target value. Supply chain data custody is now a first-class security surface, not a procurement annex.
What Changed Since Tuesday's Coverage
Tuesday's briefing argued that offensive capability was arriving. The finding this week is that defensive capability simultaneously failed. The endpoint agents are hollow. The AI infrastructure underneath them went into production without controls, and exploit windows are now compressed below most patch cadences. A reasonable skeptic will note that any single finding could be reversed by a vendor patch or a process change. The skeptic is right about any one of them. The point is that all of them landed at once, against the same teams, in the same week, which is what turns this from a patching problem into an architecture decision.
Action items
- Commission red team exercise targeting your EDR specifically with AI-assisted reverse engineering — TrustedSec's methodology is public
- Emergency audit all AI infrastructure tooling (LiteLLM, Ollama, model registries, AI gateways) adopted by engineering teams without security review
- Compress patch SLA for critical internet-facing assets from 30-day to 72-hour maximum
- Evaluate kernel-level isolation (Firecracker microVMs, gVisor) for CI/CD and multi-tenant workloads
Sources:Clint Gibler · The Information AM · CyberScoop · The Hacker News · SANS AtRisk · TLDR InfoSec
02
The AI Budget Just Tripled: What Enterprise Deployment Actually Costs
ServiceNow Is the Canary
ServiceNow's CDIO went public with the fact that the company exhausted its full-year Anthropic budget by May. Anthropic's response was no SLAs, no usage telemetry, no comment. A reasonable skeptic would call this a procurement story, not an industry story. The reasonable skeptic is wrong on the scale. A $150B enterprise software vendor is discovering in real time that its primary AI provider has no enterprise-grade cost controls, no committed pricing tiers, and no way for the buyer to know what is being spent until the invoice lands.
ServiceNow is already building the workaround — an AI Control Tower — and selling it to other enterprises. That is the market routing around a vendor deficiency in real time. The company that becomes the Datadog for AI spend will be worth tens of billions.
We are in the equivalent of cloud computing circa 2014: powerful capabilities, wildly unpredictable economics, and a governance vacuum that creates real financial exposure.
The Forward-Deployed Engineer Tax Nobody Budgeted
Every major AI vendor converged on the same admission this quarter:
- Google: hiring hundreds of forward-deployed engineers
- OpenAI: acquired a 150-person consulting firm and partnered with Bain Capital
- Anthropic: building FDE teams at scale
- ServiceNow & Salesforce: building FDE teams for AI integration
At $300-500K loaded cost per FDE, with five to ten required for a meaningful deployment, the true cost of an AI program runs 3-5x the model fees. Boards approving AI envelopes based on token pricing are approving one-third of the actual spend. The other two-thirds will arrive on a different line item, owned by a different executive, and reconciled later.
The 80x Demand Spike and What It Means for Reliability
Anthropic admitted it grew 80x against a planned 10x. That means it operated at roughly twelve percent of required capacity for extended periods. Developers experienced degraded service, rate limits, and possibly lower-quality responses without disclosure. The productivity gains measured during that window are almost certainly understated against what adequate provisioning would have delivered.
The xAI disclosure compounds the picture. Leasing 45% of its compute (220,000 GPUs) to Anthropic says GPU supply has become a financial instrument, not a strategic moat. Grok never achieved meaningful traction, and the lease revenue exceeds what those GPUs would generate running inference for xAI's own customers. The most valuable thing xAI owns is the asset it is renting out.
The FOMO Fragility
Companies are spending beyond budget because a competitor might figure out the economics first. That is classic bubble psychology, and we have flagged it before. The saving grace, which is also the risk, is that AI spend is uniquely reversible. Token consumption can be cut to zero overnight. The enterprise AI revenue base carries a fragility that is not priced into model-company valuations at $900B.
Action items
- Conduct immediate audit of all AI model consumption spend vs. budget with per-team and per-use-case attribution
- Renegotiate AI vendor contracts to require SLAs, committed pricing tiers, and usage telemetry by Q3 renewal
- Model total AI deployment cost at 3-5x model fees and present revised envelope to board before Q4 budgeting
- Evaluate AI cost governance tooling (ServiceNow AI Control Tower, Kubecost equivalents) for build-vs-buy decision
Sources:Laura Bratton · The Pragmatic Engineer · AINews · StrictlyVC · The Information AM · Martin Peers

The Execution Layer Fork: SAP vs ServiceNow Forces a This-Quarter Architecture Decision

Two Incompatible Theories of the Agent Economy

SAP and ServiceNow have finally said out loud what their architectures have implied for a year. The bets are not variations on a theme. They are different theories of where authoritative state should live when software starts taking actions on its own.

Dimension	SAP	ServiceNow
Architecture	Vertically integrated Knowledge Graph	Headless Action Fabric via MCP
Agent strategy	SAP's agents are contextually superior inside SAP's data universe	Any agent talks to ServiceNow via open protocol
Investment	€100M fund + Autonomous Enterprise	MCP servers as communication standard
Pricing model	Workflow execution value	Consumption-based on agent API calls
Best for	Process IS the transaction (order-to-cash, record-to-report)	Process IS the workflow across systems (connective tissue)

Agents that act across finance, HR, IT, and procurement need one authoritative place to reconcile state. Two authoritative places is zero authoritative places.

Why 'Run Both' No Longer Works

A reasonable skeptic would point out that large customers have run both SAP and ServiceNow side by side for two decades. The skeptic is historically correct. What changes is that agents need to commit writes. A dashboard can tolerate ambiguity between two systems of record. An agent executing a chain of decisions across finance, procurement, and HR cannot. The integration middleware of the last decade was a way of postponing the question. The question is now being asked.

The a16z framing makes the stakes concrete. $150B+ of GTM value is migrating from CRM to the AI orchestration layer. The vendor that owns execution captures the compounding. The vendor relegated to integration watches the margin walk away. Lemkin's number is the early read on what that migration looks like in a contract: 80% fewer seats, 83% higher total spend, 20+ agents running. Consumption pricing is not a future thesis. It is already accretive, in the field, this year.

The 81% Signal

81% of AI agents successfully bypass legacy bot detection. Every WAF, CAPTCHA, and rate-limiting system built to flag automated traffic is now ineffective against LLM-driven agents. The defensive implication is that the current security stack assumes adversaries behave like bots and they no longer do. The strategic implication is more interesting. The vendors that solve agent-era detection will take meaningful share, and the platforms that own agent identity and authentication become the control point the next decade is fought over.

The 12-18 Month Window

Startups are shipping agentic fabric faster than SAP and ServiceNow. That window is probably 18-24 months before the incumbents' API-first AI offerings mature enough to recapture orchestration. The decision being made this quarter, about which platform owns agent execution, is the decision that determines who has leverage in the quarter the incumbents catch up.

Action items

Conduct agent-readiness audit: can third-party AI agents discover, invoke, and orchestrate your workflows without a human UI?
Evaluate MCP adoption as a strategic investment for your platform — build or integrate MCP server capabilities by Q4
Stand up AI governance function with authority over tool/vendor rationalization before Q3 budgeting
Model per-action/per-outcome pricing on your revenue if agents replace human seat consumption within 18 months

Sources:TLDR IT · a16z · TLDR · Simplifying AI · ben's bites

◆ QUICK HITS

Update: Anthropic ARR hit $30B (from $9B four months prior), now raising at $900B+ — surpassing OpenAI's $854B March valuation for the first time
StrictlyVC
Update: Cerebras IPO priced at $56B (16% above range), up 70% day one — anchored by $20B OpenAI procurement commitment that signals compute is being pre-sold in blocks of $10B+
Katie Roof
xAI leasing 220,000 GPUs (45% of Colossus 1) to Anthropic — Musk's lab conceding the frontier race and financializing excess infrastructure
The Pragmatic Engineer
Apple positioning as AI agent gatekeeper pre-WWDC: agents that 'spin up smaller apps' face safety review and 30% fee extraction — your iOS agent roadmap needs repricing
Techpresso
a16z AI liability blueprint proposes user-liability defaults and damages caps while deploying $115.5M into 2026 midterms — the regulatory framework is being purchased, not debated
a16z AI Policy Brief
Fervo Energy IPO at $10B+ with 33% first-day pop; Google holds option for 3GW (~60 data centers) from a single geothermal supplier
StrictlyVC
Duolingo quantified the 'AI mandate' failure: ~20% of AI-generated output is unusable slop, forcing reversal from blanket adoption to role-specific augmentation
TLDR Marketing
85% of organizations spending millions on agentic AI lack adequate data foundations — and 95.2% of the gap is organizational (ownership, training, requirements), not tooling
TLDR Data
Foxconn breach: 8TB exfiltrated including confidential designs from Apple, Google, Intel, Nvidia — contract manufacturing IP custody now a first-class board risk
TLDR InfoSec
ODNI vs Commerce fight over AI model assessment authority will resolve in quarters — IC-led regime means pre-release gating; Commerce-led means voluntary disclosure. Plan for both.
Risky.Biz

◆ Bottom line

The take.

Your security architecture was proven hollow the same week your AI budget was proven uncontrolled. TrustedSec showed all five major EDR products are transparent to AI-assisted reverse engineering in days; ServiceNow showed a $150B enterprise company can blow its entire annual AI budget by May with no telemetry to explain why. Meanwhile, SAP and ServiceNow are making incompatible bets on who owns the agent execution layer — a decision that determines your platform economics for the next three years. The common thread: assumptions written into last year's plans no longer describe the ground, and the organizations that discover this through failure rather than through audit will pay the difference in public.

Frequently asked

Why does enterprise AI actually cost 3-5x the model fees?: Because every major AI vendor — Google, OpenAI, Anthropic, ServiceNow, and Salesforce — now requires forward-deployed engineers to make deployments work. At $300-500K loaded cost per FDE, with five to ten needed per meaningful deployment, the human integration layer dwarfs token spend. Boards approving AI envelopes based on model pricing are funding roughly one-third of the actual program cost.
What should a CFO ask before approving Q3 AI spend?: Ask for per-team and per-use-case attribution of current model consumption, the FDE headcount assumed in the deployment plan, and whether the contract includes SLAs, committed pricing tiers, and usage telemetry. If any of those answers is missing, the budget is not a budget — it is an estimate that will be reconciled after the invoice arrives.
Why is ServiceNow's budget blowout an industry signal rather than a procurement failure?: Because ServiceNow is a $150B enterprise software vendor with mature procurement, and it still exhausted its annual Anthropic budget by May with no usage telemetry to see it coming. Anthropic offers no SLAs, no committed tiers, and no real-time cost visibility to any customer. If ServiceNow cannot govern this spend, smaller enterprises on the same provider are 3-6 months behind the same outcome.
How much leverage do enterprise buyers have right now in AI vendor negotiations?: More than they will have in twelve months. Anthropic grew 80x against a planned 10x and needs enterprise logos ahead of an IPO, which means the subsidy window for committed pricing, SLAs, and telemetry concessions is open now. Renewals negotiated in Q3 will set the terms for the next two years; renewals deferred will be negotiated from a weaker position.
Is there a market opportunity emerging from this cost-control gap?: Yes — ServiceNow is already building an AI Control Tower and selling it externally, and the broader 'Datadog for AI spend' category is forming this quarter. Enterprises face a build-versus-buy decision on AI cost governance tooling, and identifying the category winner early matters because the control point for AI economics will sit wherever attribution, telemetry, and chargeback consolidate.

◆ Same day, different angle

Read this day as…

◆ Recent in leader

EnterpriseAI'sRealCostIs3–5xWhatCFOsBudgeted

◆ INTELLIGENCE MAP

◆ DEEP DIVES

Two Failures Arrived in the Same Week

The Response Window Has Collapsed

The Foxconn Proof Point

What Changed Since Tuesday's Coverage

ServiceNow Is the Canary

The Forward-Deployed Engineer Tax Nobody Budgeted

The 80x Demand Spike and What It Means for Reliability

The FOMO Fragility