Product daily

Edition 2026-05-23 · read as Product

AnthropicEndsCursorDiscount,OpenAIPouncesWithCodex

Sources
36
Words
1,831
Read
9min

Topics Agentic AI LLM Inference AI Capital

◆ The signal

A team lead checked her Cursor bill this morning and saw the number she had been quietly building her hiring plan around. That number expires on June 15, when Anthropic eliminates the 70-90% implicit discount third-party tools (Cursor, Cline, Zed) have been passing through to developers. OpenAI countered within hours with two months of free Codex for enterprise switchers inside a 30-day window. The real question is not which model writes better code. It is whether the switching cost to Codex is smaller than the per-developer cost of staying on Claude at full API rates.

◆ INTELLIGENCE MAP

  1. 01

    Anthropic's June 15 Pricing Reset Forces a 30-Day Vendor Decision

    act now

    Anthropic is collapsing third-party tool arbitrage on June 15 while OpenAI offers 2 months free Codex to switchers. ServiceNow already burned its full-year Anthropic budget by May. The per-developer cost assumption in most budget decks is wrong by roughly an order of magnitude.

    70-90%
    discount being eliminated
    8
    sources
    • Anthropic biz share
    • OpenAI biz share
    • Pricing deadline
    • Codex offer window
    1. Anthropic34.4
    2. OpenAI32.3
  2. 02

    Enterprise AI Costs Are Structurally Ungovernable

    act now

    ServiceNow exhausted its entire annual Anthropic budget by May with zero per-user visibility. Duolingo's AI mandate produced 20% unusable output and performative adoption. Anthropic lacks enterprise-standard telemetry, SLAs, or usage monitoring — forcing customers to build observability tools themselves.

    20%
    AI output 'slop' rate
    5
    sources
    • Budget blown by
    • AI slop rate
    • Anthropic rev growth
    • GPU demand ratio
    1. Anthropic ARR30
    2. OpenAI spending100
    3. Nebius rev growth684
  3. 03

    MCP Becomes the Enterprise Agent Standard — €100M Committed

    monitor

    SAP launched a €100M partner fund, ServiceNow shipped Action Fabric decoupling workflows from UI, and Salesforce added agent-native WhatsApp. All three converged on MCP as the headless execution layer. Enterprise procurement now asks 'can our agents call this?' — not 'show me the dashboard.'

    €100M
    SAP agent partner fund
    4
    sources
    • Token vol agentic
    • SAP partner fund
    • Bot detection bypass
    • RFP window
    1. Agentic workloads59
    2. Traditional AI41
  4. 04

    The PM Role Is Being Unbundled by AI-Native Orgs

    background

    Lovable has zero PMs — engineers talk to users, write specs, and ship. Elena Verna shipped an enterprise pricing page solo in hours (previously: PM + designer + engineer + a week). Abridge embeds clinician-scientists who are both MDs and full-stack engineers. The coordination layer PMs provide is the part being eliminated.

    90%
    time spent building
    3
    sources
    • Verna build time
    • Traditional coord.
    • Abridge release accel
    • Alert fatigue rate
    1. HI-C model (building)90
    2. Traditional PM (coordinating)70
  5. 05

    AI Cyber Capability Jumps to Full Network Takeover

    monitor

    Anthropic's Mythos is the first model to clear both UK AISI simulated attack ranges — achieving autonomous full network takeover. PraisonAI auth bypass was weaponized in 4 hours. The patch window assumption (days-to-weeks of safety) is structurally broken at AI speed.

    4 hours
    disclosure to exploit
    5
    sources
    • Mythos capability
    • Exploit speed
    • Identity fraud TAM
    • Mozilla bugs found
    1. Previous gen60
    2. Current gen100

◆ DEEP DIVES

  1. 01

    Anthropic's June 15 Pricing Reset: The 30-Day Decision Your Budget Depends On

    What Actually Changed

    A developer on the $200 Claude plan opened Cursor this morning and shipped a feature in about ninety minutes of model time. She did not check her usage. She has not checked it for six months, because the math always worked out: she paid Anthropic $200 and got something that, at API rates, would have cost roughly ten times that. Anthropic just announced that every Claude subscription now includes API credits equal to the plan's dollar amount ($200 plan = $200 API credits). The pitch is generous. What it actually does, for the cohort using Claude through Cursor, Cline, Zed, OpenCode, and T3 Code at effective 70-90% discounts to API pricing, is raise the per-developer cost by roughly an order of magnitude. Starting June 15, third-party tool usage gets a separate credit pool. When those credits burn, the meter runs at full API rates.

    The era of subsidized AI inference through integrations is ending. The per-developer cost assumption in most budget decks is now wrong by an order of magnitude.

    Why Now: The IPO Clock

    Anthropic hired a CFO and is likely targeting an October 2026 IPO. The model where power users got large implicit subsidies does not produce the revenue-per-user numbers public market investors want to see. Ramp data has Anthropic at 34.4% business adoption vs. OpenAI's 32.3%, the first time Anthropic has led. They are consolidating pricing power at the moment they have the leverage to do it.

    OpenAI's Displacement Play

    Sam Altman offered 2 months of free Codex to enterprise customers who switch within 30 days. The timing tracks Anthropic's developer frustration to the hour — Theo, Jeremy Howard, Matt Pocock, and Omar Sanseviero posted criticism within hours of the announcement. OpenAI lost the business adoption lead for the first time and is buying it back with displacement pricing.

    The Decision Framework

    Harness ReplaceableHarness Not Replaceable
    Load-bearing workflowRenegotiate with Anthropic in the 30-day window while leverage is realPilot Codex on the 2-month free offer this week
    Exploratory usageMove to whichever vendor is currently subsidizingMove to whichever vendor is currently subsidizing

    The Deeper Signal

    Multiple sources confirm this is an IPO-driven margin play. PMs building on Claude should model forward at least one more pricing adjustment before October as the S-1 narrative tightens. The 50% rate limit increase for two months is a grace period, not a concession. Teams that have a switching-cost memo ready will move in 72 hours. Teams that do not will spend the quarter in a Slack thread.

    Action items

    • Model the per-developer cost impact of Anthropic's new pricing by Monday — compare current third-party harness spend vs. full API rates and vs. OpenAI's Codex offer
    • Draft a one-page memo documenting what price change would make the team reverse course on AI vendor choice, and circulate it to engineering + finance this week
    • Evaluate OpenAI's 2-month free Codex offer against your top 3 Claude-dependent workflows by end of next week

    Sources:A product manager opened three vendor pricing pages this week · Your AI cost model breaks June 15 · Apple's agent App Store changes your distribution strategy · A engineer on a small team pushed a deploy on Tuesday · Anthropic just flipped OpenAI in enterprise

  2. 02

    Enterprise AI's Cost Governance Crisis — The ServiceNow Warning

    The Budget Blowout Pattern

    ServiceNow's CDIO Kellie Romack watched her team's full-year Anthropic budget get consumed before May 2026. She cannot tell you which users drove it, or which workloads, because Anthropic does not ship the telemetry that would answer those questions. PagerDuty and National Life Group describe the same problem. Nimesh Mehta at National Life Group calls Anthropic 'great for consumer usage but not great for companies.'

    The signal is not that AI is expensive. The signal is that AI costs are structurally unpredictable and the model providers have not built the instrumentation customers need to govern them.

    The Performative Adoption Trap

    Duolingo's CEO publicly acknowledged that the blanket 'evaluate all employees on AI usage' policy failed. Two findings worth keeping: AI content at scale produces roughly 20% unusable 'slop' that needs human QC, and mandating AI usage across all roles produced performative adoption with no productivity gain. They reversed the policy. Separately, 'tokenmaxxing' — employees gaming AI consumption metrics to look productive — is showing up at enterprises including Amazon. What teams tell themselves users do is automate work. What users actually do, when scored on AI usage, is consume tokens.

    Why This Is Structural

    The cost model breaks on a mismatch most teams have not named. A user finds the workflow that saves her two hours, and she runs it 11 times a day instead of the 3 the pricing model assumed. Retention improves. Usage depth climbs. Gross margin goes sideways, then down. Inference cost is variable. Revenue per seat is fixed. That gap widens with success.

    Two Product Categories Being Pulled Into Existence

    1. AI cost governance: ServiceNow built AI Control Tower internally and now sells it to customers. Per-customer, per-feature inference telemetry has moved from nice-to-have to procurement blocker.
    2. Multi-model abstraction: Strategic the moment a provider raises prices without SLAs or usage transparency. The low switching costs that worry Anthropic are the buyer's leverage.

    The Forward-Deployed Engineer Tax

    Google Cloud is hiring hundreds of FDEs for Gemini. OpenAI acquired Tomoro to stand up DeployCo with 150 FDEs. ServiceNow and Salesforce are building the same capability in-house. The 'AI replaces services' story does not survive contact with this data. Self-serve AI launches into enterprise without a services component should plan against roughly 20% activation.


    The Stable Cell

    One axis: inference cost fixed per seat or variable per call. Other axis: customer pays per seat or per outcome. The only stable cell is variable cost matched to variable pricing. Every other cell is a bet that usage will not grow, which is a strange bet to place on a feature the same deck is telling the board is working. The decision a product team can make Monday is which cell they are actually in, and whether the pricing page reflects it.

    Action items

    • Audit per-customer, per-feature inference cost attribution this sprint — if you cannot break down LLM API costs by customer and use case, build the metering layer before your next AI feature launch
    • Replace AI adoption metrics (tokens consumed, sessions per week) with outcome metrics (task completion rate, time-to-first-useful-output, week-4 retention) by next sprint planning
    • Model your AI feature unit economics assuming usage grows 3-4x over 12 months with no price decrease — stress-test against Anthropic's demonstrated willingness to raise prices

    Sources:A finance lead at ServiceNow opened the Anthropic invoice in May · Duolingo's 20% AI slop rate is your quality bar · A product manager at a mid-market SaaS company opened her analytics dashboard · Anthropic's $900B raise + AI cybercrime confirmation · A head of sales loaded the target account list on Monday

  3. 03

    MCP as the Headless Enterprise Standard: Your Q4 Deadline

    Three Vendors, One Architecture, One Week

    SAP, ServiceNow, and Salesforce all shipped autonomous agent architectures this week, and they converged on the same execution layer: headless workflows callable over MCP. ServiceNow's Action Fabric decoupled workflow logic from the UI and exposed it for any third-party agent to call. SAP shipped a Knowledge Graph for agent context plus a €100M partner fund. Hundred-million-euro funds are not how companies announce features. They are how companies announce platform bets they plan to defend for years.

    The enterprise buying committee has moved from 'show me the dashboard' to 'can our agents orchestrate your workflows.' The window before this shows up in RFPs is 2-3 quarters.

    The Production Data Confirms It

    Vercel's AI Gateway data across 200,000+ production teams shows 59% of all token volume now flows through agentic workloads. Anthropic captures 61% of AI spend, driven by Opus on heavy reasoning. Google captures 38% of volume, driven by Flash on cheap, fast tasks. Most large teams route across multiple providers rather than committing to one. That is not a request-response interaction model anymore. It is autonomous agents picking the model for the task.

    What 'Agent-Consumable' Actually Means

    A procurement manager at a Fortune 500 opened three enterprise demos this week and asked the same question in each: 'Can our agents call this directly, or do my people have to click through your UI?' Two vendors had no answer. The third moved to the next stage.

    The Build Scope Is Smaller Than Expected

    For most teams, shipping an MCP server against the existing API is a week of scoping, 2-4 weeks of build, assuming the underlying API is not already a mess. The harder question is whether the product's core UI should be restructured for agent-first interaction. That is a roadmap question, not a sprint question.

    Glean's Cautionary Data

    Glean benchmarked off-the-shelf MCP against an enterprise knowledge graph. Raw MCP used 30% more tokens and was preferred 2.5x less on agentic tasks. The protocol gets you on the field. The intelligence layer above it (contextual enrichment, domain knowledge) is where the product actually wins. MCP is necessary, not sufficient.


    The Forcing Function

    Pull the last twenty support tickets from top-decile accounts. Count how many assume a human in the seat versus an agent doing the work. If that ratio shifted even ten points toward agents in the last two quarters, headless is a retention bet on the next renewal cycle, not a platform bet for the next board meeting. That is the decision this sprint, not next year.

    Action items

    • Audit your product's API surface for agent-consumability this quarter — document whether a third-party AI agent can discover, authenticate, and execute your core workflows without a UI
    • Scope an MCP-compatible headless layer for your top 3 workflows — estimate should be 2-4 weeks of build if the underlying API is sound
    • Evaluate SAP's €100M Autonomous Enterprise partner fund for fit with your product — application likely within next quarter

    Sources:A customer success lead at a mid-market SaaS company opened her own product's API documentation · 59% of AI traffic is now agentic · Google's Universal Commerce Protocol is your next integration decision · A platform PM opened her integrations dashboard on Monday

  4. 04

    The PM Unbundling: What Survives When One Person Ships the Whole Feature

    The Lovable Evidence

    Lovable has no product managers. The company is growing fast enough that the absence is deliberate. It is the operating model. Engineers talk to users, write specs, ship code, and read feedback. Elena Verna, who ran growth at Amplitude, Miro, and Dropbox, now spends 90% of her time building with almost no meetings. She personally shipped Lovable's enterprise pricing page to production. That work previously needed a PM, a designer, an engineer, and roughly a week of calendar time.

    The PM role decomposes into three pillars: coordination, judgment, and prioritization. AI-enabled flat orgs are eliminating the first. The question is whether the remaining two justify the title.

    What's Actually Being Eliminated

    The PM value proposition has four components:

    1. User research: understanding what to build
    2. Prioritization: deciding what to build next
    3. Spec-writing: defining what to build
    4. Cross-team coordination: aligning who builds it

    Lovable's bet is that components 1-3 collapse into the engineer when the engineer talks to users directly, and component 4 disappears when the org is small enough that coordination happens in a shared channel. Abridge runs the counter-model: clinician-scientists (MDs who are also full-stack engineers) embedded in product teams, with PRDs and written specs still load-bearing. Their release cadence accelerated 4-6x despite keeping the specs.

    The HI-C Pattern

    Ravi Mehta's framing is the useful one. AI does not make a PM world-class at design or engineering. It makes them average-to-good at everything at once. The PMs who survive look less like project managers and more like mini-GMs who prototype and iterate directly. Companies that ungate information access attract disproportionate talent density. The ones that protect management layers end up with coordinators and no builders.

    The Talent Retention Risk

    Senior builders who can get autonomy at a Lovable-style flat org will leave to get it. Some will say they need a VP title. The best builders will not. They will go where they can spend 90% of their time creating instead of 90% aligning.


    The Diagnostic

    Two questions for Monday. First, of the work shipped last quarter, how much of the PM contribution was judgment about what to build vs. coordination of people building it? Second, if the coordination half went to zero tomorrow, does the judgment half justify the role? If yes, the job gets better. If no, the job gets done by someone else.

    Action items

    • Calculate your personal build-vs-coordinate ratio this week — track hours spent on judgment/building vs. alignment/meetings and benchmark against the 90% building that Verna reports
    • Ship one small customer-facing artifact end-to-end using AI tools (landing page, experiment, pricing change) without engaging your cross-functional team — before end of quarter
    • Identify which of your PM activities are judgment (what to build, why, for whom) vs. coordination (aligning, unblocking, reporting) — protect and deepen the judgment side

    Sources:A product manager at a Series B company opened Lovable's careers page three times last month · A clinician finishes a patient visit and, instead of typing notes for forty minutes after the encounter, reviews a draft that is already there

◆ QUICK HITS

  • Update: AI cyber capability — Anthropic's Mythos is now the first model to clear both UK AISI attack simulations, achieving autonomous full network takeover; PraisonAI auth bypass was weaponized within 4 hours of disclosure. Compress patch SLAs to <24 hours for critical vulns.

    A security engineer watched an automated tool chain three low-severity findings

  • AI persona drift quantified: research confirms significant persona degradation within 8 dialogue rounds due to attention decay — embed 'canary phrases' in system prompts and monitor for disappearance as a lightweight drift detection mechanism.

    AI persona drift quantified at 8 rounds

  • Notion launched a full developer platform with markdown API, external data sync, agent tool building, and plans to host Claude/Codex as 'teammates' — evaluate whether to build ON it, AGAINST it, or AROUND it.

    Your AI cost model breaks June 15

  • Only 15% of enterprise buyers have the data foundation for agentic AI, yet they're spending millions anyway — add a data-readiness qualification gate before the contract, not after, to prevent the 85% churn cohort.

    A head of sales loaded the target account list on Monday

  • Claude Code shipped /goal command: engineer types a measurable objective, walks away, and a separate Haiku model judges completion — the evaluator-as-judge pattern is the reference architecture for any autonomous AI workflow.

    A staff engineer kicked off Anthropic's autonomous coding mode on a Tuesday afternoon

  • Google changed AI Overviews handling of 'best' queries — self-ranking listicles that previously won placement may now surface your competitors instead. Run an immediate SEO audit on 'best [category]' content.

    Duolingo's 20% AI slop rate is your quality bar

  • Gemini leaking private phone numbers from training data directly to users — any product using LLMs trained on web-scraped data needs output-layer PII detection. This is architectural, not a Google-specific bug.

    The Download from MIT Technology Review

  • Microsoft's agent memory architecture validates 400-500 memory ceiling with 97.2% retention precision using consolidation + forgetting mechanisms — use as PRD benchmark for any persistent agent features.

    A head of sales loaded the target account list on Monday

◆ Bottom line

The take.

Your AI vendor costs reset June 15 whether you're ready or not — Anthropic is eliminating 70-90% third-party discounts while OpenAI runs a 30-day displacement campaign with free Codex. ServiceNow already blew through its full-year budget by May because nobody built the telemetry to see it coming. Meanwhile, SAP just committed €100M to making 'can agents call your product?' the default enterprise procurement question. The PM who instruments cost attribution, ships an MCP endpoint, and has a switching-cost memo ready by month-end owns the next vendor negotiation. The PM still in a Slack thread about it in July gets the bill.

— Promit, reading as Product ·

Frequently asked

What exactly changes on June 15 for teams using Claude through Cursor, Cline, or Zed?
Anthropic is moving third-party tool usage into a separate API credit pool tied to your subscription's dollar value, ending the 70-90% implicit discount those harnesses passed through. Once those credits burn, usage meters at full API rates, raising per-developer cost by roughly an order of magnitude for heavy users.
Is OpenAI's two-month free Codex offer worth piloting if our workflows are deeply tied to Claude?
Running the evaluation costs nothing and produces leverage either way. Even if you stay on Claude, a documented Codex pilot against your top three workflows gives you switching-cost data for renegotiation and a fallback if Anthropic adjusts pricing again before its expected October IPO.
Why should PMs assume more pricing changes are coming, not just this one?
Multiple sources frame the June 15 reset as an IPO-driven margin play, with Anthropic targeting public markets around October 2026 and consolidating pricing power now that it leads OpenAI in business adoption (34.4% vs. 32.3% per Ramp). Expect at least one more adjustment as the S-1 narrative tightens.
How do I avoid the ServiceNow-style budget blowout on my own AI features?
Build per-customer, per-feature inference telemetry before your next AI launch and stress-test unit economics assuming 3-4x usage growth with no price decrease. ServiceNow consumed its full-year Anthropic budget before May because it could not attribute spend to users or workloads — that instrumentation is now a procurement blocker, not a nice-to-have.
What should be in the one-page switching-cost memo, and who needs to see it?
Document the price change that would reverse your AI vendor choice, the workflows most exposed, the harness replaceability of each, and the rough switching cost in engineering weeks. Circulate to engineering and finance this week — teams with this memo ready move in 72 hours when pricing shifts; teams without it lose a quarter to Slack debates.

◆ Same day, different angle

Read this day as…

◆ Recent in product

Keep reading.