Product daily

Edition 2026-05-15 · read as Product

AnthropicEndsThird-PartyClaudeDiscount,OpenAIPounces

Sources
36
Words
1,420
Read
7min

Topics Agentic AI LLM Inference AI Capital

◆ The signal

Anthropic kills the 70-90% implicit discount on third-party Claude usage on June 15 — every developer tool routing through Cursor, Cline, or OpenCode just became an order of magnitude more expensive. OpenAI responded within hours with 2 months of free Codex for enterprise switchers. You have 30 days to model the cost impact, decide which provider your engineering team standardizes on, and renegotiate before leverage disappears. Meanwhile, ServiceNow burned its entire full-year Anthropic budget by May — proving the cost model most teams are running is already broken.

◆ INTELLIGENCE MAP

  1. 01

    AI Cost Model Breaks June 15 — Arbitrage Era Ends

    act now

    Anthropic splits credit pools for third-party tool usage starting June 15. Teams using Claude via Cursor/Cline/OpenCode lose 70-90% implicit discounts. OpenAI offers 2 free months of Codex as displacement pricing. ServiceNow burned its full-year Anthropic budget by May with no per-user telemetry to explain why.

    70-90%
    discount being eliminated
    7
    sources
    • Discount eliminated
    • Deadline
    • OpenAI free offer
    • ServiceNow budget burn
    1. TodayModel cost impact
    2. June 15Third-party credits split
    3. 30-day windowOpenAI free Codex expires
    4. Oct 2026Anthropic likely IPO
  2. 02

    Enterprise Procurement Now Asks: Can Agents Call Your Product?

    monitor

    SAP committed €100M to agent-native architecture. ServiceNow decoupled workflows from UI via Action Fabric exposed through MCP. Three of the five largest enterprise vendors shipped agent-callable frameworks in the same week. 'Can our agents orchestrate your workflows?' is replacing 'show me the dashboard' in enterprise demos.

    €100M
    SAP agent partner fund
    5
    sources
    • SAP partner fund
    • Agentic token volume
    • Enterprise vendors shipped
    • RFP timeline
    1. SAP Fund100
    2. Agentic Traffic59
    3. Glean MCP Token Overhead30
  3. 03

    PM Role Splits: Builder vs. Coordinator

    monitor

    Elena Verna shipped Lovable's enterprise pricing page alone — work that previously needed PM + designer + engineer + a week. She spends 90% of time building, near-zero meetings. Lovable has no PMs. Duolingo's blanket AI mandate produced performative adoption and 20% unusable output before reversal. The role survives where judgment compounds; it dies where it's pure coordination.

    90%
    time building (vs. aligning)
    3
    sources
    • Verna build time
    • AI slop rate
    • Designers in workforce
    • Lovable PMs
    1. HI-C Model (Verna)90
    2. Traditional PM20
  4. 04

    AI Cyber Capability: Full Network Takeover in One Generation

    monitor

    Anthropic's Mythos and OpenAI's GPT-5.5 jumped from 'advanced persistence' to 'full network takeover' — clearing both UK AISI attack ranges. Mozilla's AI harness found 271 Firefox bugs including sandbox escapes. Exploit weaponization now happens in 4 hours (PraisonAI). Your 30-60 day patch SLA was designed for a world that no longer exists.

    4 hours
    disclosure to weaponization
    6
    sources
    • Mozilla AI bugs found
    • PraisonAI time-to-exploit
    • Palo Alto vulns found
    • Identity fraud TAM
    1. Previous gen50
    2. Current gen100
  5. 05

    Multi-Model Routing Is Production Default — Vendor Lock-In Collapsed

    background

    Vercel's production data across 200K+ teams confirms heavy multi-model routing as norm. Anthropic captures 61% of spend (Opus), Google captures 38% of volume (Flash). Anthropic overtook OpenAI in business adoption (34.4% vs 32.3% per Ramp). Engineers route by task, not by contract — procurement doesn't match reality.

    34.4%
    Anthropic biz adoption
    10
    sources
    • Anthropic biz share
    • OpenAI biz share
    • Anthropic spend share
    • Google volume share
    1. Anthropic61
    2. Google38
    3. Other1

◆ DEEP DIVES

  1. 01

    Your AI Developer Tool Costs Change Sign on June 15 — Here's the Decision Tree

    What Actually Changed

    A developer on Cursor today is paying for Claude through what amounts to a wholesale relationship Anthropic no longer wants to honor. Starting June 15, Claude subscriptions split credit pools between first-party usage (Claude app, Claude Code) and third-party tool usage (Cursor, Cline, OpenCode, Zed, T3 Code). Third-party usage gets a separate credit pool equal to the plan's dollar value. Once depleted, the user lands on API rates. Teams that built their developer workflow on Claude through third-party harnesses, at an effective 70-90% implicit discount, just absorbed a price increase of roughly an order of magnitude.

    This is not a random pricing adjustment. Anthropic hired a CFO and is likely targeting an October 2026 IPO. The previous shape, where power users got enormous implicit subsidies, does not produce the revenue-per-user numbers public markets pay for. Expect at least one more adjustment before the S-1 narrative tightens.

    OpenAI's Counter-Move Has a Clock

    Sam Altman offered 2 months of free Codex to enterprise customers who switch within 30 days. Displacement pricing, timed to developer frustration. Theo, Jeremy Howard, Matt Pocock, and Omar Sanseviero criticized the change within hours. Ramp data showing Anthropic at 34.4% versus OpenAI's 32.3% explains the urgency: OpenAI lost the business adoption lead for the first time and is fighting to reclaim it.

    The era of subsidized AI inference through third-party integrations is ending. Model the cost impact now — or discover it on your invoice in July.

    ServiceNow Is the Warning Shot

    What product teams tell themselves: enterprise AI cost is a budgeting problem. What ServiceNow's CDIO Kellie Romack actually did: burned through the entire full-year Anthropic budget before May ended, and could not tell which users drove it because Anthropic does not ship the telemetry. PagerDuty and National Life Group report the same problem. National Life's Nimesh Mehta calls Anthropic "great for consumer usage but not great for companies." The signal underneath: AI costs are structurally unpredictable and model providers have not built the instrumentation enterprises need to govern them.

    The Decision Tree for This Sprint

    Axis 1: Is the Claude usage load-bearing or exploratory?

    Axis 2: Is the harness replaceable with Anthropic-native tooling at similar quality?

    • Load-bearing + Replaceable: Renegotiate with Anthropic inside the 30-day window while the leverage is real
    • Load-bearing + Not replaceable: Pilot Codex on the 2-month free offer this week, not next month
    • Exploratory in either cell: Stop paying metered rates for exploration. Move to whichever vendor is currently subsidizing it

    Action items

    • Model the per-developer cost impact of Anthropic's new third-party credit structure against your current Claude usage by May 23
    • Initiate OpenAI Codex pilot on 2-3 engineering workflows this sprint using the 2-month free offer
    • Ship per-customer, per-feature inference cost telemetry for all AI features before your next AI feature launch
    • Draft a cost-cap policy for AI developer tools — maximum monthly spend per engineer with alerting at 70%

    Sources:A product manager opened three vendor pricing pages this week · Your AI cost model breaks June 15 · A finance lead at ServiceNow opened the Anthropic invoice · A developer opened the Claude console on a Tuesday · A platform PM opened her integrations dashboard

  2. 02

    "Can an Agent Call Your Product?" Is Now an Enterprise Sales Question — Not a Strategy Offsite Topic

    Three Enterprise Giants Shipped Headless This Week

    SAP, ServiceNow, and Salesforce all launched autonomous agent architectures in the same week, converging on the same execution pattern: headless workflows callable over MCP. SAP put up €100M in partner funding and a Knowledge Graph that hands business context to agents. ServiceNow's Action Fabric decoupled workflow logic from the UI and exposed it for any third-party agent to call. Companies do not stand up hundred-million-euro funds for features. They stand them up for platform bets they plan to defend for years.

    Vercel's production data across 200,000+ teams shows 59% of all token volume is now agentic workloads. Notion launched a Developer Platform aimed explicitly at agent tooling, with pre-built agents from Ramp, Clay, and Vercel. The convergence thesis everyone has been writing decks about — every productivity app becomes an agent-hosting platform — is what the usage data is now showing.

    A procurement manager spent forty minutes clicking through a vendor onboarding flow. Next quarter, an agent will. She will review the result. If your product can't be called headlessly, the agent routes around it to the system of record directly.

    Where This Shows Up First: Renewals, Not Win Rates

    Here is what users are actually doing. Two of a customer success lead's top ten accounts asked the same question on the same Tuesday: "Can our agents call your product directly, or do my people have to click through your UI?" The docs did not answer it. That conversation hits renewals before it hits pipeline. A Fortune 500 procurement lead opened three demos this week asking the same question. Two vendors had no answer. The third moved to the next stage.

    The Build Decision

    Separate the thing being pitched from the thing being done. Shipping an MCP server against an existing API is smaller than the deck suggests: a week of scoping, two to four weeks of build, assuming the underlying API is not already a mess. That is the easy part. The harder call is whether the product's core UI should be restructured around the assumption that an agent is the primary first-touch user for a non-trivial share of sessions. That is a roadmap question, not a sprint question.

    Glean's Warning

    Glean benchmarked off-the-shelf MCP against an enterprise knowledge graph. Raw MCP used 30% more tokens and was preferred 2.5x less on agentic tasks. The protocol is the table stake. The intelligence layer above it is where retention will come from.


    The 2x2 for Your Backlog

    Has agent-callable APIUI-only
    Revenue-influencing workflowDefend this cell — add context layerBuild MCP server this quarter
    Reporting/admin workflowMaintain, monitor adoptionDeprioritize — agents won't bother

    Action items

    • Audit your top 10 enterprise accounts' support tickets and feature requests — count how many assume an agent is doing the work vs. a human in the seat
    • Scope an MCP-compatible headless layer for your top 3 revenue-generating workflows by end of Q2
    • Evaluate SAP's €100M Autonomous Enterprise partner fund for fit with your product — application deadlines likely within next quarter
    • Add semantic metadata to your design system and API documentation in machine-readable format

    Sources:A customer success lead at a mid-market SaaS company opened her own product's API documentation · 59% of AI traffic is now agentic · A designer on a mid-sized SaaS team spent six weeks this spring · Google's Universal Commerce Protocol is your next integration decision · Your AI cost model breaks June 15

  3. 03

    The PM Role Splits Into Builder and Coordinator — The Coordinator Half Is the One Under Threat

    The Proof Point: One Person Shipped What Used to Take a Team

    Elena Verna opened a Lovable project last week and shipped Lovable's enterprise pricing page to production alone. No PM scoping the requirements. No designer on mocks. No engineer on build. The same work, six months ago, would have pulled three roles and a week of calendar Tetris. Verna ran growth at Amplitude, Miro, and Dropbox before this. She now spends 90% of her time building and almost none of it in meetings. Lovable has zero product managers.

    What teams tell themselves is happening: a hobbyist learned Figma. What is actually happening: a senior operator who ran growth at four unicorns chose to drop the coordination layer entirely. Lovable is hiring Growth PMs alongside her, not under her. That is a collection of autonomous operators, not a growth team.

    The PM value proposition decomposes into three pillars: cross-functional coordination, customer/market judgment, and strategic prioritization. AI-enabled flat orgs are eliminating pillar one. The question is whether pillars two and three justify the role on their own.

    Duolingo's Counter-Evidence: AI Mandates Fail at Scale

    Duolingo acknowledged publicly that their "evaluate every employee on AI usage" policy did not work. Two findings drove the reversal: AI content at scale produced ~20% unusable output that required human QC, and the mandate produced performative adoption with no productivity gain. Amazon staff are now gaming "MeshClaw token leaderboards." Same Goodhart pattern, different wrapper.

    The lesson is not that AI doesn't work. It is that measuring AI inputs (tokens, sessions, logins) instead of outputs (tasks completed, quality, cycle time) produces theater. The PM who survives this transition is the one who defines what "done" looks like. Not the one who mandates which tool the team opens in the morning.

    Autonomous Coding Changes the Bottleneck

    Claude Code's new /goal command lets an engineer type a measurable end state and walk away. A separate Haiku model checks completion against explicit criteria. The unit of engineering output stops being story points per sprint and becomes review throughput per senior engineer per day. If half the backlog is pattern-execution work — migrations, test generation, refactors — the sprint is mis-sized by a factor nobody will name until the retro.

    The 2x2 for Your Career This Quarter

    Org enables single-operator shippingOrg requires handoff chains
    PM time is mostly judgment/strategyBecome an HI-C — ship directlyRestructure the team for autonomy
    PM time is mostly coordinationRole gets absorbed by buildersRole persists but hollows out

    Action items

    • Calculate your personal build-vs-coordinate ratio this week — log every 30-minute block as 'creating artifact' vs 'aligning people'
    • Ship one small project end-to-end using AI tools (pricing page, experiment setup, internal tool) without engaging your cross-functional team
    • Replace any team AI adoption metrics measuring inputs (tokens consumed, sessions) with output metrics (tasks completed, time-to-ship, revision rate) before next performance review cycle
    • Identify 2-3 senior ICs or managers who might be more productive with full autonomy and fewer reports — prototype the HI-C model on one team

    Sources:A product manager at a Series B company opened Lovable's careers page · Duolingo's 20% AI slop rate is your quality bar · A staff engineer kicked off Anthropic's autonomous coding mode

◆ QUICK HITS

  • Update: Anthropic overtook OpenAI in business adoption (34.4% vs 32.3% per Ramp) — now confirmed across 10+ independent sources with Anthropic valued at $900B vs OpenAI's $852B

    Anthropic just flipped OpenAI in enterprise

  • Microsoft's agent memory architecture benchmarks at 97.2% retention precision with a 400-500 memory cap using consolidation and deliberate forgetting — use as your agent memory PRD spec ceiling

    A head of sales loaded the target account list on Monday

  • Only 15% of organizations have the data foundation for agentic AI, yet they're spending millions anyway — nearly half cite data quality and lineage as the primary blocker, not tooling

    A head of sales loaded the target account list on Monday

  • AI persona drift is quantified: significant degradation within 8 dialogue rounds due to attention decay — add drift detection to acceptance criteria for any multi-turn conversational feature

    AI persona drift quantified at 8 rounds

  • Google Gemini leaking private phone numbers from training data — output-layer PII scanning is now a mandatory non-functional requirement, not a nice-to-have

    A user asked Gemini a routine question and got back someone else's phone number

  • LLM endpoints get indexed by Shodan within 3 hours and attract 175 active hijacking attempts per week — if shipping any inference endpoint, auth + spend caps are day-one requirements

    A backend engineer shipped a new inference endpoint on a Tuesday afternoon

  • Abridge's wedge-to-platform playbook: 80M+ medical conversations nobody else has, release cadence compressed from semi-annual to monthly in a regulated vertical, now at $5.3B valuation

    A clinician finishes a patient visit and reviews a draft that is already there

  • Update: Apple's AI agent App Store governance likely at WWDC June 2026 — will require pre-declaring agent capability boundaries and sandboxing runtime generation on iOS

    Apple's agent App Store changes your distribution strategy

  • Intercom rebranding entire company to 'Fin' (their AI agent) — signals the transition from AI-as-feature to AI-as-identity in SaaS positioning

    Your AI cost model breaks June 15

  • CRM usage actually ROSE since AI tool adoption — the system of record isn't dying, it's being consumed as infrastructure at the API layer while decisions move to separate intelligence tools

    A sales operations lead opened her CRM three times on Tuesday

◆ Bottom line

The take.

Your AI costs break on June 15 when Anthropic kills the third-party discount, your enterprise buyers are already asking 'can agents call your product directly' (SAP just put €100M behind that question), and the PM role is splitting into people who build things and people who coordinate builders — with AI compressing the first group's cycle time so fast that the second group's calendar becomes the bottleneck. The three decisions this sprint: model the June 15 cost impact before it hits your invoice, scope an MCP-compatible headless layer for your top workflows before it shows up in an RFP, and decide whether your job is creating artifacts or scheduling meetings about artifacts other people create.

— Promit, reading as Product ·

Frequently asked

What exactly changes for Claude users on June 15, 2026?
Anthropic is splitting Claude subscription credits into two pools: one for first-party usage (Claude app, Claude Code) and a separate, smaller pool for third-party tools like Cursor, Cline, OpenCode, and Zed. Once the third-party pool is depleted, usage falls back to API rates, eliminating the 70-90% implicit discount developers have been relying on and effectively raising third-party tool costs by roughly an order of magnitude.
How should I decide between staying on Claude or piloting OpenAI's Codex offer?
Map each workflow on two axes: whether the Claude usage is load-bearing or exploratory, and whether the harness is replaceable with Anthropic-native tooling. Load-bearing and replaceable workloads should renegotiate with Anthropic inside the 30-day window. Load-bearing and non-replaceable workloads should pilot Codex on the 2-month free offer this sprint. Exploratory work should follow whichever vendor is currently subsidizing it.
Why did ServiceNow burn its full-year Anthropic budget by May, and how do I avoid the same fate?
ServiceNow could not attribute spend to specific users or workloads because Anthropic does not ship the telemetry enterprises need to govern usage. The fix is to instrument per-customer, per-feature inference cost telemetry before your next AI feature launch, and to set per-engineer monthly spend caps with alerting at 70% so adoption growth doesn't translate directly into uncapped invoice growth.
Should I build an MCP server for my product, and how big is that effort?
If you have revenue-influencing workflows that are currently UI-only, yes — scope it this quarter. Building an MCP server against an existing API typically takes a week of scoping plus two to four weeks of build, assuming the API isn't a mess. The harder question is whether your core UI should be restructured around agents as primary users, which is a roadmap-level decision rather than a sprint task.
What does the PM role split into, and which half is at risk?
The role decomposes into a builder half (shipping artifacts directly with AI tools) and a coordinator half (running handoffs across design, eng, and research). The coordinator half is the one being absorbed by AI-enabled flat orgs like Lovable, which has zero PMs. To stay on the durable side, log your time this week as 'creating artifact' versus 'aligning people' — if coordination exceeds 70%, invest in shipping end-to-end yourself.

◆ Same day, different angle

Read this day as…

◆ Recent in product

Keep reading.