How should I sort AI app-layer portfolio companies after the Replit/Cursor divergence?

Demand three data points from every AI app-layer portco at the next board cycle: gross margin trajectory under current and projected token costs, inference cost per query (own stack vs. API), and NRR cohort data showing net expansion rather than logo growth. Companies that can't show improving unit economics under inference deflation are acqui-sale candidates, not independent franchises, and should be priced accordingly.

What gross margin threshold should trigger concern given Grok 4.3's pricing cut?

Flag any AI wrapper portco with gross margin below 60% when stress-tested against Grok 4.3's $1.25/$2.50 per million token pricing. With token deflation running 40-60% per cycle and open-weight models now within six points of frontier on the Artificial Analysis Intelligence Index, today's COGS are tomorrow's floor — wrappers without margin buffer face forced sales rather than independent rounds.

How does the Musk v. OpenAI trial affect secondary market exposure?

The trial converts OpenAI's nonprofit-to-for-profit conversion from assumed paperwork into contested litigation seeking structural relief — asset transfer and removal of Altman and Brockman as nonprofit officers. A 15-25% litigation discount is warranted on OpenAI secondary marks, and even a cosmetic outcome likely creates a 12-month holding period without visibility, an illiquidity cost that current $500B-implied marks don't reflect.

Where is capital actually flowing if the app layer is bifurcating?

Down the stack, into infrastructure. In a single 48-hour window: Coatue's Next Frontier targeting tens of billions in data center land, Founders Fund closing $6B with $1.5B from insiders, Nebius paying $615M for Eigen AI, Meta acqui-hiring Assured Robot Intelligence, and a 12-bank consortium funding Versana. The $725B 2026 hyperscaler capex underpins infrastructure demand even as application-layer monetization slips.

Which infrastructure vectors are most investable right now?

Three stand out: inference optimization (speculative decoding, KV compression, persistent cache, MoE routing) where the Nebius-Eigen $615M deal sets the comp and 2-5 fundable teams remain; physical AI positioning and sensing, where Meta's humanoid 'Android' framing pulls Google, Tesla, and Amazon in behind it; and data center adjacent picks-and-shovels including power, cooling, interconnect, and physical-infra security like counter-drone.

Edition 2026-05-03 · read as Investor

AIAppLayerSplits:Replit's$1BARRvsCursor'sMargins

Sources: 8
Words: 1,814
Read: 9min

Topics AI Capital LLM Inference Agentic AI

◆ The signal

Replit disclosed roughly a billion dollars of ARR with three hundred percent net revenue retention, a 350x jump in eighteen months, while Cursor is reportedly selling to SpaceX at sixty billion on negative twenty-three percent gross margins. Inside the same forty-eight hours, open-weight models closed to within six points of frontier on the Artificial Analysis Intelligence Index and Grok 4.3 cut token pricing by forty to sixty percent. The app layer now sorts into companies that own their economics and companies that will need a patient, very rich buyer. The second cohort is larger than it looks.

◆ INTELLIGENCE MAP

01
AI App Layer Splits: Durable Economics vs. Acqui-Sale Candidates
act now
Replit: ~$1B ARR, 300% NRR, viable standalone. Cursor: -23% gross margins, $60B sale to SpaceX. Grok 4.3 at $1.25/$2.50 per M tokens collapses wrapper margins further. Microsoft embedding AI agents into Word kills CLM startups. Only pricing-power names survive independently.
300%
Replit net revenue retention
4
sources
- Replit ARR
- Replit NRR
- Cursor gross margin
- Grok 4.3 input price
1. Replit NRR300
2. Cursor Gross Margin-23
02
Open-Weight Models Close to 6-Point Gap — Closed-Model Book Needs Repricing
monitor
DeepSeek V4 Pro, Kimi K2.6, MiMo V2.5 Pro now score 52-54 on Intelligence Index vs. GPT-5.5 at 60. DeepSeek's hours-long KV cache vs competitors' 5-minute window rewrites agent TCO. HF projects 99% API → 95% local workload in 24-36 months. Value migrates from weights to harness, cache, and distribution.
6
benchmark-point gap remaining
3
sources
- Open-weight score
- GPT-5.5 score
- Gap (Q ago)
- Gap (now)
1. GPT-5.560
2. Gemini 3.157
3. Claude Opus 4.757
4. DeepSeek V4 Pro53
5. MiMo V2.5 Pro53
6. Kimi K2.653
03
OpenAI Trial: Live Binary Cap-Table Event
monitor
Musk is seeking asset transfer to the nonprofit and removal of Altman + Brockman — structural relief, not damages. Jury selection surfaced ground-truth adoption data: 44% of non-tech jurors are non-users or say AI makes work slower. Trial consumes leadership attention while Anthropic and DeepMind ship undistracted.
~$500B
valuation at risk
2
sources
- Relief sought
- Non-user jurors
- Brockman testifies
1. OpenAI governance risk65
04
Capital Migrates Down the Stack — Physical AI Infra Land Grab
monitor
Coatue's Next Frontier will spend tens of billions on data center land. Founders Fund closed $6B (with $1.5B insider commit). Nebius paid $615M for inference optimization (Eigen AI). Meta acqui-hired a 1-year-old robotics startup. ZaiNar emerged with $5B pipeline for GPS-alternative positioning. Infrastructure conviction hardening as app-layer monetization softens.
$6B
Founders Fund growth close
4
sources
- Founders Fund raise
- FF insider commit
- Nebius/Eigen AI deal
- ZaiNar pipeline
1. Founders Fund6
2. Nebius/Eigen AI0.615
3. Versana (banks)0.043
4. ZaiNar pipeline5
05
Agent Infra Bifurcates: MCP vs Skills Creates Two Fundable Categories
background
Agent extensibility has split into MCP (protocol/integration to live systems) and Skills (filesystem/knowledge, unisolated execution). Skills' security gap — arbitrary bash/python/curl with no sandbox — mirrors pre-Docker containerization. Dual-LLM architectures for prompt injection defense structurally double agent compute costs. Agent users projected to surpass humans on Hugging Face by EOY 2026.
2x
dual-LLM COGS multiplier
2
sources
- Skills isolation
- Secure agent COGS
- HF agent crossover
1. MCP (integration)75
2. Skills (knowledge)25

◆ DEEP DIVES

01
The Replit/Cursor Split Is the AI App Layer's Natural Experiment — Sort Your Portfolio Now
Two Companies, One Category, Opposite Economics
The single most useful data point in AI investing this week is a side-by-side that no one set up deliberately. Replit disclosed ~$1B ARR — up from $2.8M roughly 18 months ago — with 300% net revenue retention, the first hard evidence that an AI-native coding platform can produce SaaS-grade unit economics at scale. Simultaneously, Cursor is reportedly selling to SpaceX at a $60B valuation on negative 23% gross margins, meaning every dollar of revenue costs $1.23 to serve. One company is an independent franchise. The other needs a balance sheet to survive.
The divergence is structural, not cyclical. Replit owns more of its inference stack and has built workflow lock-in that drives expansion revenue. Cursor, for all its adoption, remains a foundation-model-dependent wrapper whose cost of goods scales linearly (or worse) with usage. This is the cleanest natural experiment the AI app layer has produced — same category, opposite outcomes, driven entirely by where in the stack each company chose to compete.
Inference Deflation Widens the Gap
The backdrop makes the Cursor problem harder, not easier. Grok 4.3 launched at $1.25/$2.50 per million tokens — a 40-60% cut from Grok 4.2 — continuing a deflation curve that compresses margins for anyone whose COGS are dominated by token spend. Open-weight models (DeepSeek V4 Pro, Kimi K2.6, MiMo V2.5 Pro) now score 52-54 on the Artificial Analysis Intelligence Index against 57-60 for closed frontier models — a gap that was 15 points one quarter ago and is 6 now. DeepSeek's hours-long disk-based KV cache versus competitors' 5-minute TTL is a structural TCO advantage for agent workloads that rewrites unit economics for anyone building on top.
Hugging Face's Clem Delangue projects workload distribution flipping from 99% proprietary API to 95% local/specialized over 24-36 months. Even at half that magnitude, the valuation math for API-wrapper businesses collapses.
In AI, only two positions are safe: owning the infrastructure the bubble runs on, or owning the rare app-layer companies whose customers expand 3x a year. Everything in between is an acqui-sale waiting to happen.
Microsoft's Embed-and-Extinguish Playbook Compounds the Risk
Microsoft embedding AI contract agents directly into Word is an extinction-level event for pure-play contract lifecycle management startups — Ironclad, LinkSquares, and peers lost their distribution moat in a single product announcement. Google is doing the same with Gemini creating docs, sheets, and slides in-chat. The pattern is clear: platform owners are absorbing the easy AI features and leaving only the hardest, most vertical problems for startups.
Meanwhile, enterprise AI ROI remains unproven in practice: 500 bankers reported finding AI outputs 'consistently unusable,' and 80% of Claude's users sit in $100K+ households, signaling a hard ceiling on consumer AI TAM expansion. The AI app layer is being squeezed from above (platform incumbents bundling) and below (inference deflation destroying margins), with only Replit-class NRR as an escape route.
Portfolio Implications
The Replit/Cursor dichotomy is now your sorting mechanism. Demand three data points from every AI app-layer portfolio company at the next board meeting: gross margin trajectory (under current and projected token costs), inference cost per query (own stack vs. API), and NRR cohort data (net expansion, not just logo growth). Companies that can't show improving unit economics under inference deflation are acqui-sale candidates, not independent franchises — price them accordingly.
Action items
- Demand gross margin trajectory, inference cost per query, and NRR cohort data from every AI app-layer portco at next board cycle
- Stress-test all AI wrapper portcos against Grok 4.3 pricing ($1.25/$2.50 per M tokens); flag any with gross margin below 60%
- Explore Replit pre-IPO secondary access; 300% NRR at ~$1B ARR is the rare AI name where economics justify the story
- Commission Microsoft-Word-killed-my-startup scenario analysis for CLM, legal-tech, and productivity-AI holdings by end of May
Sources:Replit's 350x ARR jump vs Cursor's -23% margins: the AI coding thesis just bifurcated · AI capex hits $725B while OpenAI misses targets — your AI book needs a repricing · Pentagon picks 7 AI winners, Anthropic benched — defense AI TAM just got allocated · Hugging Face's agent-native pivot: where to deploy capital in open-source AI's next leg
02
OpenAI's Musk Trial Is a Live Binary Event — The Secondary Market Hasn't Priced the Tail
Structural Relief, Not Damages
The Musk v. OpenAI trial, now in session, is doing something the private market has not quite absorbed. It is converting OpenAI's nonprofit-to-for-profit conversion from assumed paperwork into contested litigation. Musk is not suing for money. He is asking a federal court to transfer assets from OpenAI's business arm to its charitable arm and remove Altman and Brockman as nonprofit officers. That is structural relief, which is a politer way of saying a jury is being asked to unwind the cap table. Secondaries and SPVs carrying OpenAI exposure, along with the thicket of API-dependent wrapper positions, are all tethered to a corporate form that is now, as of this week, a live question in a courtroom.
The evidentiary spine is Microsoft's $10B investment and Musk's 'bait and switch' claim. The judge has excluded AI extinction testimony as legally irrelevant, which tells you most of what you need to know: courts will not treat 'AI safety' as a cognizable interest in governance fights. The governance question itself is still alive.
Three paths, middle one underpriced
The scenarios map cleanly enough.
1. Adverse ruling: the court orders real asset migration and a leadership change. Private marks on OpenAI exposure are wrong in a direction nobody has bothered to model.
2. Cosmetic remedy: something dramatic on the docket, mild in practice. Secondaries grind back to the last tender print. Holders sit on the position without a clean exit for another 4+ quarters.
3. Prolonged drag: the case extends into next year and the overhang itself becomes the trade.
This is probably wrong, but the middle outcome is the mispriced one, not because it is most likely but because it forces holders into a 12-month lockup without visibility. Meanwhile the trial is eating leadership bandwidth in a way nobody has booked against the schedule. Altman and Brockman have been in the gallery all week, Brockman testified Monday, and Anthropic and DeepMind are operating undistracted. That is the opportunity cost line item nobody wrote down.
The $500B headline valuation prices in a clean conversion. It does not price in an 18-month delay, a consent decree, or a settlement that reshapes the economic rights of the preferred stack.
The Accidental Adoption Survey
Jury selection produced, by accident, the most honest AI adoption data available this year. Of 9 non-tech jurors, which is as close to a random slice of American workers as you get without paying a consulting firm for one, 2 don't use AI at all and 2 say it makes their jobs slower because of error-checking overhead. Call it forty-four percent non-users or net-negative, and then ask whether that number appears in the base case of any horizontal-AI-SaaS model priced on universal knowledge-worker TAM. Separately, OpenAI missed internal revenue and user targets ahead of its IPO ambitions, even as Codex revenue doubled in under 7 days and the GPT-5.5 API is growing 2x faster than any prior launch. Platform strategy is compounding while the broader adoption thesis remains unproven.
What to Do
The defensive work here is cheap. A careful holder is already writing the downside scenario on NAV under an adverse ruling, covering secondaries and SPVs along with the API-dependent names, before the jury returns. The counter-thesis is that the trial settles or fizzles, which is fine and also costs nothing to be wrong about. A written downside case takes a weekend and protects against a binary nobody has modeled. The distraction window, separately, creates an entry point for Anthropic and DeepMind-adjacent positions while the market is watching OpenAI's courtroom drama.
Action items
- Write an OpenAI adverse-ruling NAV scenario covering secondaries, SPVs, and API-dependent portcos — complete before jury returns
- Apply 15-25% litigation discount to any OpenAI secondary marks; flag overhang risk in LP reporting
- Use the distraction window to add Anthropic secondary and DeepMind talent spinout exposure before the verdict recalibrates the tape
Sources:OpenAI's nonprofit-to-for-profit conversion is now a live legal question rather than a paperwork exercise, and the distinction matters if any part of the AI cap table sits in a portfolio. · AI capex hits $725B while OpenAI misses targets — your AI book needs a repricing
03
Money Is Moving Down the Stack — The Physical AI Infra Land Grab Has Started
The Capital Flow Map Says One Thing
Pull back from the application layer and the flows tell one story, which is that money is migrating to infrastructure. Not models, not wrappers. Physical assets, inference optimization, and the picks-and-shovels tier where capital intensity is the moat rather than the liability.
The data points inside a single forty-eight hour window:
- Coatue's Next Frontier intends to spend tens of billions on physical land for AI data centers, which is a venture firm going vertical into real estate or, more honestly, conceding that the interesting scarcity is dirt.
- Founders Fund closed a six billion dollar growth vehicle with $1.5B from insiders, which is the strongest conviction signal a top-tier firm can send without buying a billboard.
- Nebius paid six hundred fifteen million dollars for Eigen AI to own inference optimization, setting the comp for the category whether anyone likes it or not.
- Meta acqui-hired Assured Robot Intelligence, a one-year-old startup, into Superintelligence Labs. Humanoid robotics is now in land-grab pricing.
- ZaiNar emerged from stealth targeting five billion dollars in pipeline for GPS-alternative positioning.
- Versana pulled twelve-plus global banks (BNP, JPM, MS, BofA, Citi, Barclays, Deutsche, Wells, USB, Apollo) into a forty-three million dollar round. That is a consortium wearing a cap table, and it says private credit data is becoming regulated plumbing.
Infrastructure Conviction Hardening While App-Layer Softens
The structural driver is $725B in hyperscaler capex planned for 2026, which underpins infrastructure demand even if the app layer fails to monetize. The split is visible: OpenAI missed internal revenue targets, 500 bankers find AI outputs 'consistently unusable', and every hyperscaler is still accelerating spend. The enterprise ROI gap is real. The pipes get laid anyway.
The through-line is consistent: infrastructure conviction is accelerating exactly as application-layer monetization is slipping. That is the picks-and-shovels trade, playing out on tape.
Last week the argument was that capital intensity caps returns at the infrastructure layer. This week the argument is that open weights cap returns at the model layer. Both can be true. Usually are.
Three Investable Vectors
Inference optimization is the highest-conviction move, and the Nebius-Eigen six hundred fifteen million dollar comp is now the reference price. Two to five fundable teams exist with proprietary inference IP; the window closes as hyperscalers finish roll-ups. Targets: speculative decoding (PFlash-style 10x prefill speedup), KV compression, persistent cache, MoE routing. This is probably wrong in one direction, which is that one of the hyperscalers buys the category before the fund closes. Price accordingly.
Physical AI positioning and sensing is pre-consensus. ZaiNar's five billion dollar pipeline will clear or it will not, and either outcome is informative, which is the rare case where the binary is worth paying for. Meta's 'Android of humanoids' framing means Google, Tesla, and Amazon follow. Back research-pedigree teams in behavior modeling and sim-to-real with eighteen to twenty-four month strategic exit paths.
Data center physical layer remains under-owned by traditional VC, which is less a thesis than a scheduling error. Adjacent picks-and-shovels — power, cooling, interconnect, site development, physical-infra security (Amazon's drone-strike repair timeline of months is what makes counter-drone fundable, not the press release) — all benefit from Coatue going vertical into land.
Action items
- Open diligence pipeline on 3-5 inference optimization startups (speculative decoding, KV compression, persistent cache, MoE routing) by end of May
- Build a robotics platform-alignment map: identify 2-3 humanoid OEMs that benefit from Meta's 'Android' play before formal partner program announcement
- Source counter-drone and physical-infra-security startups; schedule intro calls this quarter
Sources:Replit's 350x ARR jump vs Cursor's -23% margins: the AI coding thesis just bifurcated · AI capex hits $725B while OpenAI misses targets — your AI book needs a repricing · Pentagon picks 7 AI winners, Anthropic benched — defense AI TAM just got allocated · OpenAI's nonprofit-to-for-profit conversion is now a live legal question rather than a paperwork exercise, and the distinction matters if any part of the AI cap table sits in a portfolio.

◆ QUICK HITS

Update: Pentagon defense AI oligopoly formalized — 7 vendors (SpaceX, OpenAI, Google, NVIDIA, Reflection, Microsoft, AWS) at IL6/IL7. Reflection is the only non-hyperscaler and the pre-consensus intro to get done this week.
Pentagon picks 7 AI winners, Anthropic benched — defense AI TAM just got allocated
Update: Anthropic now formally carries 'supply chain risk' DoD label — secondary marks should compress 15-25% vs. OpenAI parity; model the probability this cascades to Fortune 500 enterprise procurement
Pentagon picks 7 AI winners, Anthropic benched — defense AI TAM just got allocated
Versana's $43M round brought in 12+ global banks (BNP, JPM, MS, BofA, Citi, Barclays, Deutsche, Wells, USB, Apollo) — private credit/syndicated loan data is becoming regulated market plumbing; analogous plays in secondaries and private wealth data follow within 12 months
Replit's 350x ARR jump vs Cursor's -23% margins: the AI coding thesis just bifurcated
OpenAI Codex revenue doubled in under 7 days; GPT-5.5 API growing 2x faster than any prior launch — platform distribution compounding even as model moat compresses
The headline is that open-weight trillion-parameter mixture-of-experts models are now landing at roughly ninety percent of frontier performance, and the implied conclusion is that anyone long closed AI should mark the book down. The implied conclusion is probably directionally right. It is also the third time in eighteen months someone has announced that the moat has collapsed, so a little care is warranted before the spreadsheet comes out. Here is the puzzle. Ninety percent of frontier on published benchmarks is not ninety percent of frontier on the revenue-generating workloads, or rather, the more interesting version of the question is which ten percent is missing and who is paying for it. If the missing ten percent is long-context reasoning on regulated workloads, the closed labs keep the enterprise contracts and the pricing that comes with them. If the missing ten percent is cosmetic polish on consumer chat, the repricing is already overdue and the market has not caught up. There are three ways this plays out. One, the open-weight tier commoditizes inference, margins compress at the model layer, and the value migrates to whoever owns distribution and data — which is a thesis that has been right for ten quarters and wrong for two of them. Two, the frontier labs widen the gap again on the next training run and the ninety-percent number ages like most ninety-percent numbers do. Three, the gap stays at ninety percent indefinitely, which is the scenario nobody is modeling and probably the one worth modeling. This is probably wrong, but the interesting consequence is not the repricing of the closed-AI book. It is what the closed labs are now not doing with the capital they raised on the previous thesis. Every dollar defending a shrinking quality premium is a dollar not spent on the next differentiator, whatever that turns out to be. The investors who funded the last round were buying optionality on frontier capability. They are now holding optionality on distribution, which is a different asset priced the same way. Last week the argument was that capital intensity capped returns at the infrastructure layer. This week the argument is that open weights cap returns at the model layer. Both can be true. Usually are.
MiniMax flipped M2.7 from MIT to Non-Commercial license — canary for broader open-weight license tightening; audit any portco using MiniMax for commercial continuity risk
The headline is that open-weight trillion-parameter mixture-of-experts models are now landing at roughly ninety percent of frontier performance, and the implied conclusion is that anyone long closed AI should mark the book down. The implied conclusion is probably directionally right. It is also the third time in eighteen months someone has announced that the moat has collapsed, so a little care is warranted before the spreadsheet comes out. Here is the puzzle. Ninety percent of frontier on published benchmarks is not ninety percent of frontier on the revenue-generating workloads, or rather, the more interesting version of the question is which ten percent is missing and who is paying for it. If the missing ten percent is long-context reasoning on regulated workloads, the closed labs keep the enterprise contracts and the pricing that comes with them. If the missing ten percent is cosmetic polish on consumer chat, the repricing is already overdue and the market has not caught up. There are three ways this plays out. One, the open-weight tier commoditizes inference, margins compress at the model layer, and the value migrates to whoever owns distribution and data — which is a thesis that has been right for ten quarters and wrong for two of them. Two, the frontier labs widen the gap again on the next training run and the ninety-percent number ages like most ninety-percent numbers do. Three, the gap stays at ninety percent indefinitely, which is the scenario nobody is modeling and probably the one worth modeling. This is probably wrong, but the interesting consequence is not the repricing of the closed-AI book. It is what the closed labs are now not doing with the capital they raised on the previous thesis. Every dollar defending a shrinking quality premium is a dollar not spent on the next differentiator, whatever that turns out to be. The investors who funded the last round were buying optionality on frontier capability. They are now holding optionality on distribution, which is a different asset priced the same way. Last week the argument was that capital intensity capped returns at the infrastructure layer. This week the argument is that open weights cap returns at the model layer. Both can be true. Usually are.
Hugging Face runs 15M users on 200 employees and declined Nvidia capital — new capital-efficiency benchmark for AI infra diligence; update sector thesis before portco board meetings
Hugging Face's agent-native pivot: where to deploy capital in open-source AI's next leg
Reddit beat consensus by $53M ($663M revenue, 30% YoY search user growth) — first clean public-market proof that vertical AI search monetizes; pattern-match to Perplexity, Glean, and private vertical search comps
Pentagon picks 7 AI winners, Anthropic benched — defense AI TAM just got allocated
GitHub Copilot pricing increase in June will test whether usage-based AI monetization holds at enterprise scale — watch churn data for the first real signal on AI price sensitivity
AI capex hits $725B while OpenAI misses targets — your AI book needs a repricing
Berkshire reports Q1 Saturday (May 3) with $373B cash and Abel's first solo shareholder meeting — three things to watch: deployment framework language, Kraft Heinz disclosure, appetite for $50B+ acquisitions
Berkshire is sitting on three hundred seventy-three billion dollars of dry powder, which is either the most disciplined act of patience in the history of public markets or, to use a more honest description, the largest admission in writing that Buffett cannot find anything worth buying at current prices. Both things can be true. They usually are. Anthropic landing on a DoD blacklist is the other puzzle on the desk this week, and it is the more interesting one, because the Berkshire story has been the Berkshire story for eleven quarters and counting. The blacklist question cuts at whether the frontier labs can hold onto the enterprise customer that was supposed to anchor their revenue narrative — specifically the customer that writes checks denominated in decades rather than seat licenses. That is a different kind of risk than the one the venture round priced. The two trades connect in a way that the headlines will not quite say out loud. Berkshire's cash pile is an opportunity cost argument dressed up as prudence; every dollar not deployed is a dollar implicitly short the things being bid up, which right now means roughly everything adjacent to compute and frontier AI. Anthropic getting cut out of a procurement channel is one of the few data points that would vindicate the patience. One data point is not a trend. There is a reasonable case that the DoD story is procedural and gets resolved in a quarter, and a less reasonable case that it signals a broader screening of frontier labs on national security grounds. This is probably wrong, but the more interesting version is the second one, because it reprices the government-adjacent portion of the AI revenue stack without touching the commercial story at all. LPs asking for a view deserve that distinction rather than a single number. The view, with the usual willingness to be wrong in public: Berkshire's cash is not a signal about AI, it is a signal about everything else, and conflating the two is the mistake that will show up in letters next year. The Anthropic item is worth watching. The three hundred seventy-three billion is worth watching less than people think.
FDA AI pilot targeting 40% drug approval time reduction — if validated, biotech AI theses re-rate materially; add to watchlist for Q3 signal
AI capex hits $725B while OpenAI misses targets — your AI book needs a repricing
K-Scale declared 'RIP' — humanoid robotics shakeout has begun; winners consolidating around Physical Intelligence, Waymo, Tesla, and Nvidia
The headline is that open-weight trillion-parameter mixture-of-experts models are now landing at roughly ninety percent of frontier performance, and the implied conclusion is that anyone long closed AI should mark the book down. The implied conclusion is probably directionally right. It is also the third time in eighteen months someone has announced that the moat has collapsed, so a little care is warranted before the spreadsheet comes out. Here is the puzzle. Ninety percent of frontier on published benchmarks is not ninety percent of frontier on the revenue-generating workloads, or rather, the more interesting version of the question is which ten percent is missing and who is paying for it. If the missing ten percent is long-context reasoning on regulated workloads, the closed labs keep the enterprise contracts and the pricing that comes with them. If the missing ten percent is cosmetic polish on consumer chat, the repricing is already overdue and the market has not caught up. There are three ways this plays out. One, the open-weight tier commoditizes inference, margins compress at the model layer, and the value migrates to whoever owns distribution and data — which is a thesis that has been right for ten quarters and wrong for two of them. Two, the frontier labs widen the gap again on the next training run and the ninety-percent number ages like most ninety-percent numbers do. Three, the gap stays at ninety percent indefinitely, which is the scenario nobody is modeling and probably the one worth modeling. This is probably wrong, but the interesting consequence is not the repricing of the closed-AI book. It is what the closed labs are now not doing with the capital they raised on the previous thesis. Every dollar defending a shrinking quality premium is a dollar not spent on the next differentiator, whatever that turns out to be. The investors who funded the last round were buying optionality on frontier capability. They are now holding optionality on distribution, which is a different asset priced the same way. Last week the argument was that capital intensity capped returns at the infrastructure layer. This week the argument is that open weights cap returns at the model layer. Both can be true. Usually are.

◆ Bottom line

The take.

The AI app layer just ran its first clean natural experiment: Replit hit $1B ARR with 300% NRR while Cursor sells at $60B on negative gross margins — and with open-weight models closing to within 6 points of frontier, Grok slashing prices 40-60%, and OpenAI's cap table under live litigation, the only two investable positions in AI are owning the infrastructure or owning the rare company whose customers triple their spend annually. Everything in between is a forced sale waiting to happen, and the next 90 days is when the market figures that out.

Frequently asked

How should I sort AI app-layer portfolio companies after the Replit/Cursor divergence?: Demand three data points from every AI app-layer portco at the next board cycle: gross margin trajectory under current and projected token costs, inference cost per query (own stack vs. API), and NRR cohort data showing net expansion rather than logo growth. Companies that can't show improving unit economics under inference deflation are acqui-sale candidates, not independent franchises, and should be priced accordingly.
What gross margin threshold should trigger concern given Grok 4.3's pricing cut?: Flag any AI wrapper portco with gross margin below 60% when stress-tested against Grok 4.3's $1.25/$2.50 per million token pricing. With token deflation running 40-60% per cycle and open-weight models now within six points of frontier on the Artificial Analysis Intelligence Index, today's COGS are tomorrow's floor — wrappers without margin buffer face forced sales rather than independent rounds.
How does the Musk v. OpenAI trial affect secondary market exposure?: The trial converts OpenAI's nonprofit-to-for-profit conversion from assumed paperwork into contested litigation seeking structural relief — asset transfer and removal of Altman and Brockman as nonprofit officers. A 15-25% litigation discount is warranted on OpenAI secondary marks, and even a cosmetic outcome likely creates a 12-month holding period without visibility, an illiquidity cost that current $500B-implied marks don't reflect.
Where is capital actually flowing if the app layer is bifurcating?: Down the stack, into infrastructure. In a single 48-hour window: Coatue's Next Frontier targeting tens of billions in data center land, Founders Fund closing $6B with $1.5B from insiders, Nebius paying $615M for Eigen AI, Meta acqui-hiring Assured Robot Intelligence, and a 12-bank consortium funding Versana. The $725B 2026 hyperscaler capex underpins infrastructure demand even as application-layer monetization slips.
Which infrastructure vectors are most investable right now?: Three stand out: inference optimization (speculative decoding, KV compression, persistent cache, MoE routing) where the Nebius-Eigen $615M deal sets the comp and 2-5 fundable teams remain; physical AI positioning and sensing, where Meta's humanoid 'Android' framing pulls Google, Tesla, and Amazon in behind it; and data center adjacent picks-and-shovels including power, cooling, interconnect, and physical-infra security like counter-drone.

◆ Same day, different angle

Read this day as…

◆ Recent in investor

AIAppLayerSplits:Replit's$1BARRvsCursor'sMargins

◆ INTELLIGENCE MAP

◆ DEEP DIVES

Two Companies, One Category, Opposite Economics

Inference Deflation Widens the Gap

Microsoft's Embed-and-Extinguish Playbook Compounds the Risk

Portfolio Implications

Structural Relief, Not Damages

Three paths, middle one underpriced

The Accidental Adoption Survey

What to Do

The Capital Flow Map Says One Thing

Infrastructure Conviction Hardening While App-Layer Softens

Three Investable Vectors

◆ QUICK HITS

The take.

Frequently asked

◆ RELATED THREADS