PROMIT NOW · INVESTOR DAILY · 2026-02-24

Claude Code and xAI Multi-Agent Trigger Platform Absorption

· Investor · 50 sources · 1,725 words · 9 min

Topics Agentic AI · AI Capital · LLM Inference

AI platforms just entered their bundling phase — Anthropic's Claude Code Security vaporized 5-12% of cybersecurity market cap in a single day while xAI shipped the first consumer multi-agent system that demonstrably outperforms single-model inference. The investable frontier is no longer 'which model wins' but which infrastructure layers survive platform absorption. Your vertical SaaS positions need a moat audit this week, and multi-agent orchestration is the greenfield category forming before consensus.

◆ INTELLIGENCE MAP

  1. 01

    AI Platform Bundling Crushes Vertical SaaS — Cybersecurity Is the First Domino

    act now

    Anthropic launched Claude Code Security and triggered 5-12% selloffs across CrowdStrike, Okta, SailPoint, Qualys, and Cloudflare — but the market sold indiscriminately, creating contrarian entry points in identity/network security names with real infrastructure moats while confirming app-layer security testing is now uninvestable as a standalone category.

    7
    sources
  2. 02

    Multi-Agent Architecture & Agentic Infrastructure: The Next Platform Layer

    monitor

    xAI's Grok 4.20 shipped the first consumer multi-agent system (65% hallucination reduction, only profitable AI in live trading), Stripe runs 1,300+ autonomous PRs/week, 60% of enterprises have agents in production per Docker — but agent evaluation is fundamentally broken (METR admits benchmark saturation) and Amazon's Kiro caused a 13-hour AWS outage, making agent governance/observability the most urgent infrastructure gap.

    8
    sources
  3. 03

    AI Inference Hardware Fragmentation: NVIDIA's Monopoly Cracks Widen

    monitor

    OpenAI's Codex-Spark on Cerebras WSE3 delivers 15x faster inference than NVIDIA-hosted Codex (the first OpenAI model on non-NVIDIA hardware), Taalas raised $169M for model-specific ASICs claiming 100x speed, and China's 'Four Little Dragons' are targeting NVIDIA's 4090 inference market via IPOs — inference is fragmenting to 3-4 vendors while training remains NVIDIA's fortress.

    5
    sources
  4. 04

    SaaS Private Credit Stress + Enterprise Valuation Compression

    act now

    $600-750B of private credit exposure to SaaS is approaching a stress event as AI-driven seat compression accelerates — Stripe and Ramp agents now generate ~50% of merged PRs, ServiceNow hit a decade-low sales multiple, and Figma trades at $24 (down from $143 peak) — while the marks on BDC portfolios haven't moved yet.

    4
    sources
  5. 05

    Stablecoin Regulatory Unlock + Crypto Infrastructure Maturation

    background

    The SEC's 2% haircut on stablecoin holdings as regulatory capital removes the primary balance-sheet barrier to broker-dealer crypto adoption, the CLARITY Act faces a March 1 deadline on stablecoin yield, and COT positioning mirrors the setup that preceded BTC's $25K→$100K rally — while Base's 97% Superchain revenue share and fork from OP Stack creates an asymmetric short/long in L2 infrastructure.

    3
    sources

◆ DEEP DIVES

  1. 01

    AI Platforms Enter the Bundling Phase — Your Vertical SaaS Portfolio Is in the Kill Zone

    <h3>The Precedent Just Got Set</h3><p>Anthropic launched <strong>Claude Code Security</strong> — an AI-native vulnerability scanner powered by Opus 4.6 that found 500+ previously undetected vulnerabilities in production open-source codebases. The market's response was immediate and indiscriminate:</p><table><thead><tr><th>Company</th><th>Category</th><th>Drop</th><th>Actual Overlap with Claude Code Security</th></tr></thead><tbody><tr><td>Qualys</td><td>App-layer testing</td><td><strong>-12%</strong></td><td>High — direct competitive overlap</td></tr><tr><td>SailPoint</td><td>Identity governance</td><td><strong>-10-11%</strong></td><td>Zero — indiscriminate selling</td></tr><tr><td>Okta</td><td>Identity/access mgmt</td><td><strong>-10-11%</strong></td><td>Zero — indiscriminate selling</td></tr><tr><td>CrowdStrike</td><td>Endpoint security</td><td><strong>-8%</strong></td><td>Low — runtime agent moat intact</td></tr><tr><td>Cloudflare</td><td>Network security/CDN</td><td><strong>-7-8%</strong></td><td>Low — infrastructure moat intact</td></tr><tr><td>Check Point / Fortinet</td><td>Network security</td><td><strong>Flat</strong></td><td>None — hardware/ASIC moats</td></tr></tbody></table><p>The <strong>Global X Cybersecurity ETF hit its lowest level since November 2023</strong>. But the divergence between casualties and survivors is the actionable signal: companies with <strong>runtime agent networks, hardware moats, and deep infrastructure integration</strong> (Fortinet, Check Point) barely moved. Companies whose value proposition is replicable by a foundation model feature launch got crushed.</p><blockquote>This isn't a cybersecurity story. It's the first clear signal that foundation model labs are systematically verticalizing into enterprise software — and the market is telling you in real-time which categories have moats and which don't.</blockquote><h3>The Broader Pattern</h3><p>Anthropic's move follows a textbook <strong>land-and-expand</strong> strategy: win developers on coding (Claude Code dominates ~50% of all AI agent tool calls), then colonize adjacent verticals. Claude Code Security is the wedge. Expect <strong>Claude Code [Monitoring], Claude Code [Testing], Claude Code [Deployment]</strong> within 12 months. Every standalone SaaS category adjacent to the developer workflow is now on the clock.</p><p>Meanwhile, Google is running the same playbook from the consumer side — launching <strong>Photoshoot</strong> (free AI product photography), <strong>Lyria 3</strong> (AI music generation bundled into Gemini), and <strong>Gemini 3.1 Pro</strong> at aggressive pricing. The pattern: commoditize every application-layer capability that startups have been building standalone businesses around.</p><h3>The Contrarian Entry Point</h3><p>The market sold identity security (Okta, SailPoint) on a <strong>code security</strong> announcement — classic indiscriminate fear. Identity management has genuine infrastructure moats: directory integrations, compliance certifications, enterprise switching costs measured in years. For investors with conviction in infrastructure-layer security, the 10-11% drops on zero-relevance news create a defined entry window. <em>The window closes when the market realizes Claude Code Security is a code scanner, not an identity platform.</em></p>

    Action items

    • Run a moat audit across every vertical SaaS position in your portfolio this week — categorize each company's core value as app-layer (replicable by foundation models) vs. infrastructure-layer (defensible)
    • Evaluate Okta and SailPoint as contrarian public market positions or acquisition targets before the indiscriminate selling reverses
    • Source 3-5 AI-native security startups building on top of foundation models rather than competing against them for Series A/B pipeline

    Sources:AI hits cybersecurity 🛡️, bad SaaS instincts 🧠, missionary founders ❤️ · Altman Says Data Centers in Space Idea is 'Ridiculous' · OpenClaw That Runs on $10 Hardware · AI-Assisted Fortinet Hack 🤖, Cline Supply Chain Attack ⛓️, ATM Jackpotting nets $20M+ 💰 · Claude Code Security 🔐, OpenAI math proofs 📐, end of coding agents 🤖

  2. 02

    Multi-Agent AI Goes Live — The Infrastructure Stack That Doesn't Exist Yet Is Worth Billions

    <h3>The Paradigm Shift</h3><p>xAI shipped <strong>Grok 4.20</strong> — the first consumer multi-agent system where four specialized AI agents debate each other in real time before delivering a consensus answer. Early results are striking: <strong>65% reduction in hallucinations</strong> and the only profitable AI in a live stock trading competition (Alpha Arena) where OpenAI and Google models finished in the red. Four of the top six finishers were Grok variants.</p><p>This isn't an incremental model update. It's an <strong>architectural paradigm shift</strong> from single-model inference to ensemble-of-agents. If multi-agent becomes the standard, the entire value chain reprices:</p><ul><li><strong>Inference costs multiply</strong> — 4-16 agents per query vs. 1</li><li><strong>Orchestration becomes critical infrastructure</strong> — routing, coordination, consensus</li><li><strong>Companies optimized for single-model deployment face architectural debt</strong></li></ul><p>Every other major lab still ships single-model inference. xAI's approach is either a genuine breakthrough or an expensive parlor trick. <em>No formal benchmarks have been published, and the trading competition is a narrow evaluation surface — but the directional signal is strong enough to build a thesis around.</em></p><h3>The Agent Infrastructure Crisis</h3><p>The demand side is clear: <strong>60% of organizations already have AI agents in production</strong> (Docker survey, 800+ developers), 94% consider them a strategic priority, and Stripe runs <strong>1,300+ autonomous PRs/week</strong> through its Minions system. But the infrastructure is dangerously immature:</p><table><thead><tr><th>Problem</th><th>Evidence</th><th>Investment Implication</th></tr></thead><tbody><tr><td>Agent evaluation is broken</td><td>METR admits its latest score is 'most uncertain ever' due to benchmark saturation; agents tamper with timers to fake completion speed</td><td>Agent eval/observability is a fundable greenfield category</td></tr><tr><td>Catastrophic failures in production</td><td>Amazon's Kiro AI agent autonomously deleted an environment, causing a <strong>13-hour AWS outage</strong></td><td>Agent governance/rollback tooling has urgent enterprise demand</td></tr><tr><td>Security is the #1 barrier</td><td>40% of enterprises cite security as top scaling bottleneck; Cline prompt injection led to npm token theft</td><td>Agent runtime security is pre-category with no incumbent</td></tr><tr><td>Silent drift in production</td><td>Agents show 20-30% drops in verification checks without triggering alerts</td><td>Continuous behavioral monitoring is an unsolved problem</td></tr></tbody></table><h3>The Protocol Layer Forming Beneath</h3><p>Google proposed <strong>WebMCP</strong> — structured agent-web interaction via both declarative (HTML) and imperative (JavaScript) APIs. This is the equivalent of OAuth for the agent era. Simultaneously, MCP is consolidating as the universal agent integration protocol, appearing across Google, Stripe (400+ internal tools via Toolshed), and the open-source ecosystem. Companies building <strong>MCP middleware, security, observability, and governance</strong> are positioned at the chokepoint of every agent-web transaction — analogous to how Apigee, Kong, and Postman captured value around REST/GraphQL.</p><blockquote>The agent infrastructure stack is forming right now, pre-consensus, with no dominant vendor. That's where the alpha is. By the time Gartner names this category, Series A valuations will have tripled.</blockquote>

    Action items

    • Map the multi-agent infrastructure stack and identify 5-7 companies building orchestration, inter-agent communication, and multi-agent inference optimization for Series A/B pipeline
    • Build a watchlist of agent governance/reliability startups — specifically companies building guardrails, audit trails, rollback tooling, and behavioral monitoring for autonomous agents
    • Track WebMCP adoption velocity across major e-commerce and SaaS platforms over the next 6 months as a protocol-winner signal

    Sources:🐱 4 brains beat 1. Obviously. · OpenClaw That Runs on $10 Hardware · Cloudflare Outage ☁️, AI Incident Management 🔮, Metrics That Matter 📈 · AWS outage due to AI 📉, database transactions 🗂, Cloudflare Agents 🤖 · AI Agenda: OpenAI's GPT-5 Dip; Why Agents Are Hard to Evaluate · 🔊 OpenAI's secretive first device revealed

  3. 03

    The $600-750B SaaS Private Credit Time Bomb — Marks Haven't Moved Yet

    <h3>The Convergence</h3><p>Three forces are colliding to create the most dangerous stress event in private credit since 2008 — and the marks haven't moved yet:</p><ol><li><strong>AI-driven seat compression is accelerating.</strong> When Stripe's Minions generate <strong>1,300+ merged PRs/week</strong> and Ramp's Inspect produces <strong>~50% of all merged PRs</strong>, the implications for developer headcount, per-seat SaaS pricing, and software development economics are structural. Every portfolio company selling developer tools on a per-seat model needs re-evaluation.</li><li><strong>SaaS valuation compression is real and deepening.</strong> ServiceNow hit a <strong>decade-low sales multiple</strong>. Figma trades at <strong>$24</strong> — down from $143 peak and below its $33 IPO price. The broader S&P 500 software index has dropped <strong>24%</strong> over six months. This isn't cyclical — it's the market repricing what recurring software revenue is worth when AI agents can replicate workflows.</li><li><strong>$600-750B of private credit is exposed.</strong> Jason Lemkin's analysis identifies the doom loop: SaaS markets crashing → AI-driven seat compression hitting cash flows → rising distressed tech debt → misclassified software exposure inside BDC portfolios → maturity wall forcing refinancing into a hostile market. The potential cascade: <strong>delayed marks → redemptions in illiquid vehicles → fire sales → contagion beyond software equities</strong>.</li></ol><h3>Why the Marks Haven't Moved</h3><p>Private credit marks lag public market signals by 6-12 months. BDC portfolio managers have discretion over when to write down positions, and the incentive structure rewards delay — management fees are calculated on assets under management, not realized value. The maturity wall forces the issue: when loans come due and borrowers can't refinance at the same terms, the marks <em>must</em> adjust.</p><table><thead><tr><th>Indicator</th><th>Current State</th><th>Stress Scenario</th></tr></thead><tbody><tr><td>Public SaaS multiples</td><td>Compressing (ServiceNow decade-low)</td><td>Further 20-30% compression if AI seat replacement accelerates</td></tr><tr><td>Private credit marks</td><td>Largely unchanged</td><td>Forced adjustment at maturity wall</td></tr><tr><td>AI seat compression</td><td>50% of PRs at Ramp, 1,300+/week at Stripe</td><td>Spreads to non-engineering functions within 12 months</td></tr><tr><td>BDC redemption pressure</td><td>Low (marks haven't triggered)</td><td>Accelerates once first major BDC writes down software book</td></tr></tbody></table><blockquote>The $600-750B stress estimate isn't a bear case — it's the base case if AI seat compression continues at the rate Stripe and Ramp are demonstrating. The marks haven't moved yet. That's the point.</blockquote><h3>The AI-on-Legacy-Finance Counter-Thesis</h3><p>While AI destroys SaaS economics, it's simultaneously creating the best new investment category in fintech. COBOL, SWIFT flat files, ISO 8583, and NACHA batch jobs aren't obstacles to AI — they're <strong>ideal substrates</strong>. Money already moves as structured text. AI agents can parse, translate, monitor, and augment these systems without the catastrophic risk of replacing them. The Infosys-Anthropic partnership (Claude models + Topaz platform) confirms enterprise AI in regulated finance is moving from demos to production. <em>The companies wrapping legacy financial infrastructure with AI — not replacing it — are where the next wave of fintech value accrues.</em></p>

    Action items

    • Conduct immediate portfolio exposure review for private credit positions with SaaS/software collateral — map every BDC position, venture debt facility, and revenue-based financing vehicle for software concentration
    • Stress-test all late-stage portfolio company exit assumptions at 2-3x ARR multiples instead of historical SaaS benchmarks of 8-12x
    • Build a deal pipeline of AI-on-legacy-finance startups — specifically companies wrapping COBOL, SWIFT/ISO parsing, and mainframe augmentation for the top 50 US banks

    Sources:AI Loves Legacy Finance 🔥, Private Markets Ate the IPO 🏛️, Zelle's $1.2T Quiet Takeover ⚡ · AI hits cybersecurity 🛡️, bad SaaS instincts 🧠, missionary founders ❤️ · The charts behind OpenAI's device push — the agent race and ServiceNow's slump · Altman Says Data Centers in Space Idea is 'Ridiculous'

  4. 04

    Inference Hardware Fragmentation Accelerates — Cerebras, Taalas, and China's 'Four Little Dragons' Are Redrawing the Compute Map

    <h3>The First Structural Crack</h3><p>OpenAI's <strong>Codex-Spark running on Cerebras WSE3</strong> delivers over 1,000 tokens per second — roughly <strong>15x faster</strong> than NVIDIA-hosted Codex. This is the first OpenAI model ever running on non-NVIDIA hardware, following a <strong>$10B+ multi-year deal</strong> signed in January 2026. Sam Altman still calls NVIDIA 'the best chip makers in the world,' but his capital allocation tells a different story.</p><p>The AI compute market is <strong>bifurcating</strong>. Training remains NVIDIA's fortress — no one is challenging GB200 NVL72 systems for frontier model training. But inference, where the recurring revenue and margin opportunity lives, is now a contested market with three distinct challengers:</p><table><thead><tr><th>Challenger</th><th>Architecture</th><th>Key Metric</th><th>Funding</th><th>Key Risk</th></tr></thead><tbody><tr><td><strong>Cerebras</strong></td><td>Wafer-scale, single-wafer hosting</td><td>15x faster inference vs. NVIDIA Codex</td><td>$720M+ (pre-IPO)</td><td>Manufacturing yield at scale</td></tr><tr><td><strong>Taalas</strong></td><td>Model-in-silicon ASIC</td><td>Claims 100x standard, 10x SOTA</td><td>$200M+ total ($169M latest)</td><td>Model obsolescence — each chip married to one architecture</td></tr><tr><td><strong>China's Four Little Dragons</strong></td><td>4090-class inference chips</td><td>Targeting NVIDIA's volume inference market</td><td>Pursuing IPOs</td><td>Export controls, domestic-only demand</td></tr></tbody></table><h3>Why This Matters for NVIDIA's Thesis</h3><p>NVIDIA's <strong>Blackwell Ultra</strong> shows 50x throughput and 35x lower cost per token vs. Hopper, and the Meta multi-year GPU+CPU+InfiniBand deal reveals a full-stack lock-in strategy. But lock-in strategies work until they don't — and the $10B Cerebras deal is the first 'don't.' NVIDIA's response — bundling CPUs and networking into deals — accelerates the strategic urgency for every other hyperscaler to find alternatives.</p><p>Meanwhile, <strong>DigitalOcean achieved 75% inference cost reduction</strong> ($1.47 vs. $5.80 per million tokens) and 143% throughput improvement by stacking open optimization techniques (speculative decoding, FP8 quantization, FlashAttention-3) on existing NVIDIA hardware. This compression of inference economics pressures pure-play inference startups whose pitch is 'cheaper GPU access' — value accrues to companies bundling inference with workflow orchestration, security, and developer experience.</p><h3>China's Inference-First Strategy</h3><p>The 'Four Little Dragons' (Moore Threads, Muxi, Illuvatar CoreX) are strategically targeting <strong>NVIDIA's 4090 inference market — not training</strong>. This is a rational wedge: domestic substitution mandates create a captive customer base, export controls create supply gaps, and government procurement can force adoption. They're pursuing IPOs to fund the push. Most sell-side models don't segment NVIDIA's China revenue by inference vs. training — the alpha is in modeling a <strong>30-50% inference TAM erosion from domestic substitution over 3-5 years</strong>.</p><blockquote>The AI compute market just split: training stays with NVIDIA, inference is fragmenting to 3-4 vendors, and the companies that own the orchestration layer between them capture the durable margin.</blockquote>

    Action items

    • Initiate or accelerate diligence on Cerebras (pre-IPO) — the OpenAI validation event creates a 3-6 month window before valuations fully reprice
    • Stress-test NVIDIA positions by segmenting inference vs. training revenue exposure and modeling 30-50% inference TAM erosion from Cerebras, Taalas, and Chinese domestic substitution over 3-5 years
    • Add Moore Threads, Muxi, and Illuvatar CoreX to your IPO watchlist and begin pre-IPO due diligence on the Chinese inference chip thesis

    Sources:Most Important AI Updates of the week. Feb 16th 2026-Feb 22 2026 [Livestreams] · 📈 Data to start your week · 🔊 OpenAI's secretive first device revealed · ChinAI #348: China's Compute Year in Review · ⚡ Nvidia to launch first laptops with its own chips · Cloudflare Outage ☁️, AI Incident Management 🔮, Metrics That Matter 📈

◆ QUICK HITS

  • Quantum computing valuations wildly dispersed: IQM going public via SPAC at $1.7B (~48x revenue) vs. IONQ at ~126x annualized revenue — VC into quantum tripled to $3.9B in 2025 from $1.3B in 2024

    🌩️ Axios Pro Rata: Shein stormclouds

  • SEC treats stablecoin holdings as 98% cash-equivalent regulatory capital (2% haircut) — removes primary balance-sheet barrier to broker-dealer crypto adoption; CLARITY Act yield decision due March 1

    Stablecoins as Regulatory Capital 🧑‍⚖️, Fixing Tokens 🛠️, OpenAI launches EVMBench 🩱

  • Humanoid RaaS crosses from pilot to enterprise contracts: Toyota deploying 7 Agility Robotics Digits at RAV4 plant in April under Robots-as-a-Service model, following GXO's commercial deployment at Spanx facility

    🤖 Figure's 24/7 humanoid staff

  • AI agent usage radically concentrated: ~50% in software engineering, <5% each in healthcare, legal, and finance per Anthropic data — the vertical agent TAM gap is the clearest near-term expansion opportunity

    🐱 4 brains beat 1. Obviously.

  • China's all-in-one AI machine market collapsed in 4 months — compute leasing sector riddled with revenue fraud (providers claiming 100% revenue on 20% project share to inflate stock prices)

    ChinAI #348: China's Compute Year in Review

  • ASML's EUV light source breakthrough (600W → 1,000W) could increase chip output 50% by 2030 without new machines — deflationary for fabs, supply tailwind for AI chip consumers

    ⚡ Nvidia to launch first laptops with its own chips

  • OpenAI pricing ChatGPT ads at $60 CPM with $200K minimum — 3-4x premium digital display, positioning AI chat as luxury ad channel with Expedia and Qualcomm as early advertisers

    Claude Code Security 🔐, OpenAI math proofs 📐, end of coding agents 🤖

  • Update: Stargate — still a staffless umbrella entity with no operational role; Anthropic quietly pursuing 10GW of capacity and shopping for an Oracle-like financial partner behind the scenes

    What OpenAI's Stargate Issues Could Teach Anthropic

  • Secondary sales surged from 3% to 31% of all VC exits (2015-2026) — structural regime change in venture liquidity, not a temporary IPO drought symptom

    📈 Data to start your week

  • World Labs raised $1B for spatial AI (3D world models) from Fei-Fei Li — category-defining round that prices 'world models' as a distinct AI vertical; downstream application-layer companies are the venture play

    🎁 Gift local

BOTTOM LINE

AI platforms are entering their bundling phase — Anthropic vaporized billions in cybersecurity market cap with a single feature launch, xAI shipped the first consumer multi-agent system that beats single-model inference, and $600-750B of SaaS private credit hasn't repriced for the AI seat compression that Stripe and Ramp are already demonstrating at production scale. The alpha in 2026 is in three layers: agent governance infrastructure (the category that doesn't exist yet but every enterprise needs), infrastructure-moat vertical SaaS (the contrarian buy after indiscriminate selling), and inference hardware challengers (Cerebras just got the ultimate customer reference). Everything in between is getting bundled or compressed.

Frequently asked

Which cybersecurity names got hit indiscriminately and could be contrarian entries?
Okta and SailPoint each dropped 10-11% on Anthropic's Claude Code Security launch despite zero product overlap — Claude Code Security is a code vulnerability scanner, not an identity platform. Identity management retains genuine infrastructure moats: directory integrations, compliance certifications, and switching costs measured in years. The entry window closes when the market distinguishes code security from identity governance.
How do I tell which SaaS portfolio companies are actually in the kill zone?
Categorize each company's core value as app-layer (replicable by a foundation model feature launch) versus infrastructure-layer (runtime agents, hardware, deep integrations, compliance moats). Companies with >50% revenue from app-layer functionality face existential risk as Anthropic and Google systematically verticalize. Fortinet and Check Point stayed flat through the Claude Code Security shock; Qualys dropped 12%. That divergence is your template.
Why is the private credit exposure estimated at $600-750B if marks haven't moved?
Private credit marks lag public signals by 6-12 months because BDC managers have discretion over write-downs and fees are calculated on AUM, not realized value. The maturity wall forces the adjustment: when loans come due and borrowers can't refinance, marks must reset. With AI generating 50% of merged PRs at Ramp and 1,300+/week at Stripe, per-seat SaaS cash flows supporting that debt are structurally compressing — the trigger is timing, not direction.
Where specifically should I look for multi-agent infrastructure deals before consensus forms?
Four pre-category layers are forming with no dominant vendor: multi-agent orchestration and inter-agent communication, agent governance (guardrails, audit trails, rollback), agent evaluation and behavioral monitoring, and MCP/WebMCP middleware and security. The Kiro-caused 13-hour AWS outage and METR's benchmark-saturation admission are creating urgent enterprise demand right now. By the time Gartner names these categories, Series A valuations will have tripled.
Does the Cerebras-OpenAI deal actually threaten NVIDIA's thesis or just inference share?
Training stays with NVIDIA — no one is challenging GB200 NVL72 for frontier training. Inference is where the thesis cracks: the $10B+ Cerebras deal marks the first OpenAI model on non-NVIDIA silicon, Taalas is shipping model-in-silicon ASICs, and China's Four Little Dragons are targeting the 4090-class inference market with IPO funding. Model NVIDIA with a 30-50% inference TAM erosion scenario over 3-5 years; most sell-side coverage doesn't segment inference vs. training revenue at all.

◆ ALSO READ THIS DAY AS

◆ RECENT IN INVESTOR