Product daily

Edition 2026-05-16 · read as Product

AnthropicPricingResetGivesPMs29DaystoRewireCosts

Sources
36
Words
1,564
Read
8min

Topics Agentic AI LLM Inference AI Capital

◆ The signal

Anthropic kills the 70-90% implicit discount that developers using Claude through third-party harnesses (Cursor, Cline, Zed) have been living inside — effective June 15. OpenAI responded within hours offering two months of free Codex for enterprise switchers who commit within 30 days. Meanwhile, ServiceNow burned its entire full-year Anthropic budget by May because nobody built per-user cost telemetry. Your AI vendor economics have a hard deadline in 29 days: the team with a one-page 'what we do when pricing moves' memo ships the decision in 72 hours. The team without one spends Q3 in a Slack thread.

◆ INTELLIGENCE MAP

  1. 01

    AI Vendor Pricing War Has a 30-Day Clock

    act now

    Anthropic restructures third-party Claude pricing June 15 (70-90% effective increase for harness users). OpenAI counters with 2 months free Codex for 30-day switchers. Ramp confirms Anthropic overtook OpenAI in B2B (34.4% vs 32.3%). This isn't a market shift to watch — it's a negotiation window with a specific expiration date.

    June 15
    pricing deadline
    12
    sources
    • Anthropic B2B share
    • OpenAI B2B share
    • Anthropic ARR
    • Discount eliminated
    • Codex free offer
    1. Anthropic34.4
    2. OpenAI32.3
    3. Google (volume)38
  2. 02

    AI Cost Governance Is the Production Risk Nobody Instrumented

    act now

    ServiceNow burned its entire full-year Anthropic budget by May. Anthropic lacks per-user telemetry to explain why. Duolingo reversed its blanket AI mandate after 20% of output was unusable. Only 15% of orgs have the data foundation for agentic AI. The pattern: AI features that work create unpredictable cost curves, and most products lack the metering to forecast them.

    20%
    AI output slop rate
    7
    sources
    • ServiceNow budget
    • Orgs data-ready
    • AI slop rate
    • Anthropic grow vs plan
    1. Orgs with AI data foundation15
  3. 03

    Enterprise Agent Infrastructure Converges on MCP Standard

    monitor

    SAP committed €100M partner fund + Knowledge Graph for agent context. ServiceNow decoupled workflow logic from UI via Action Fabric exposed over MCP. Notion launched a full developer platform for agent tooling. Three of the largest enterprise vendors picking the same week to standardize on headless, MCP-callable workflows is a procurement signal: RFPs asking 'can our agents call this' arrive in 2-3 quarters.

    €100M
    SAP agent fund
    6
    sources
    • SAP partner fund
    • Agentic token share
    • RFP window
    • Bot detection bypass
    1. Agentic workloads59
    2. Traditional API41
  4. 04

    PM Role Compresses: One Operator Ships What Three Did

    background

    Elena Verna (ex-Amplitude, Miro, Dropbox growth) shipped Lovable's enterprise pricing page to production alone — no PM, no designer, no engineer handoff. She reports spending 90% of time building vs coordinating. Lovable has zero product managers. The unbundling is real: judgment and strategy survive; coordination becomes overhead AI eliminates.

    90%
    time building vs meetings
    3
    sources
    • Verna build time
    • Traditional PM build
    • Lovable PMs
    • Designers in US
    1. HI-C (Verna)90
    2. Traditional PM20
  5. 05

    AI Cyber Capability Crossed Full Network Takeover

    monitor

    Anthropic's Mythos is the first model to clear both UK AISI simulated attack ranges — jumping from 'advanced persistence' to 'full network takeover' in one generation. Mozilla found 271 Firefox bugs using a custom Mythos harness; curl found 1 CVE with same model. The delta is the harness, not the model. PraisonAI auth bypass was weaponized in 4 hours post-disclosure.

    271
    bugs found via AI harness
    6
    sources
    • Mozilla bugs found
    • curl bugs found
    • PraisonAI exploit time
    • Identity fraud TAM
    1. Mozilla (custom harness)271
    2. curl (raw model)1

◆ DEEP DIVES

  1. 01

    Your AI Vendor Bill Changes June 15 — Here's the Decision Framework

    The Pricing Move

    A developer opened Cursor on Tuesday morning and ran the same Claude-powered loop she runs every day. It cost her nothing extra, because her $200 Claude subscription was effectively subsidizing her API usage through the third-party harness. Anthropic announced this week that every Claude subscription now includes API credits equal to the plan's dollar amount ($200 plan = $200 API credits). Pitched as generous. For the cohort running Claude through Cursor, Cline, Zed, and OpenCode at effective 70-90% discounts to API pricing, it is a price increase of roughly an order of magnitude. Starting June 15, third-party tool usage moves to a separate credit pool. Once burned, API rates apply.

    The era of subsidized AI inference through integrations is ending. The 50% rate limit increase for two months is a grace period, not a permanent concession.

    The Competitive Response

    OpenAI replied inside hours. Sam Altman offered two months of free Codex to enterprise customers who switch within 30 days. That is displacement pricing, timed to the exact moment of developer frustration. The Ramp data explains the urgency: Anthropic at 34.4% of business customers versus OpenAI at 32.3%, the first time Anthropic has led. OpenAI lost the business adoption lead and is fighting to reclaim it with a shot clock.

    Why This Happened Now

    Anthropic hired a CFO and is likely targeting an October 2026 IPO. The prior model, where power users captured large implicit subsidies, does not produce the revenue-per-user metrics public markets reward. Revenue moved from $9B to $30B+ ARR in roughly four months. At a $900B valuation, they need $40-50B annually. Product teams should model at least one more pricing adjustment before October as the S-1 narrative tightens.

    The 2x2 for This Sprint

    Harness replaceableHarness NOT replaceable
    Load-bearing workflowRenegotiate with Anthropic in 30-day windowPilot Codex on free offer THIS week
    Exploratory usageMove to whichever vendor is subsidizingMove to whichever vendor is subsidizing

    The teams that wrote a one-page memo defining "what price change would make us reverse course" will move in 72 hours. Teams without that memo will spend the quarter in a Slack thread while the window closes.

    Action items

    • Model the per-developer cost impact of Anthropic's new pricing on all third-party Claude usage by end of next week
    • Open an evaluation of OpenAI Codex on the 2-month free offer for your heaviest Claude workflow within 7 days
    • Draft the one-page 'vendor reversal' memo: at what price point, what capability gap, or what reliability threshold does the team switch providers — circulate before next sprint planning
    • Negotiate enterprise SLA guarantees (availability, advance notice of feature changes, capacity reservation) with Anthropic this quarter while their IPO prep creates leverage

    Sources:AINews · TLDR AI · ben's bites · The Pragmatic Engineer · Techpresso · Laura Bratton

  2. 02

    AI Cost Governance: The Feature You Ship Before the Next AI Feature

    The ServiceNow Canary

    ServiceNow's CDIO Kellie Romack watched her team's full-year Anthropic budget get consumed before mid-2026. She cannot say which users drove it, or which workloads, because Anthropic does not ship the telemetry. PagerDuty and National Life Group describe the same posture. The signal is not that AI is expensive. The signal is that AI costs are structurally unpredictable and the model providers have not built the instrumentation a buyer actually needs to run the feature in production.

    A feature whose success creates unpredictable costs is not a shipped feature. It is a finance conversation waiting to happen.

    Three Failure Patterns Converging

    1. Tokenmaxxing: Amazon mandated AI usage and staff gamed the consumption leaderboards. Duolingo mandated AI across all roles, got 20% unusable output, and reversed the policy. Tokens consumed and sessions opened measure whether a user pokes the feature. They do not measure whether the feature produced value.
    2. Power-user concentration: A team ships an AI feature without per-user cost visibility. Three months later they discover a handful of power users are consuming most of the budget. Then finance calls.
    3. Data readiness gap: Only 15% of organizations have the data foundation for agentic AI. Companies are spending millions anyway. Nearly half cite data quality as the primary blocker. What customers actually do is buy, fail to activate, churn, and blame the vendor.

    What Instrumentation Looks Like

    ServiceNow built AI Control Tower internally and staffed a dedicated person to watch consumption, then turned around and sold the tool to its own customers. The category gap is the product opportunity: per-customer, per-feature inference cost telemetry does not exist in most production stacks. The metrics that show whether the surface is working are time-to-first-useful-output and week-four retention on the AI surface. Token volume tells you nothing about either.

    The Pricing Model Test

    The only stable cell on the pricing 2x2 is variable cost matched to variable pricing. Fixed cost per seat against variable inference is a bet that usage will not grow, which is a strange bet to place on a feature the roadmap deck tells the board is working. Palantir's value-based pricing, charging against cost savings delivered, is becoming the template for AI deployment companies. OpenAI's DeployCo is hiring Palantir alumni to execute it.

    Action items

    • Audit your product's AI cost attribution: can you break down LLM API costs per customer, per feature, per use case? If not, spec the metering layer this sprint
    • Replace AI engagement metrics (tokens, sessions, prompts) with outcome metrics (task completion rate, time saved, revision rate) in your next board deck
    • Add per-endpoint spend caps wired to automatic alerts as a P1 requirement for any AI feature currently in production or shipping this quarter
    • Model your AI feature pricing against Palantir's outcome-based framework: what measurable result does the customer get, and would they pay per result instead of per seat?

    Sources:Laura Bratton · TLDR Marketing · TLDR Dev · TLDR Data · The Pragmatic Engineer · Martin Peers

  3. 03

    Enterprise Buyers Now Ask 'Can Our Agents Call This?' — Your API Surface Has 2-3 Quarters

    The Convergence Signal

    Three of the five largest enterprise software companies shipped agent-callable workflow architectures in the same week. They converged on the same execution layer: headless workflows exposed over MCP. SAP put up a €100M partner fund and a Knowledge Graph that hands business context to agents. ServiceNow separated workflow logic from UI via Action Fabric and exposed it for any third-party agent to call. Notion shipped a full developer platform with agent tool building and plans to host Claude and Codex as "teammates."

    Companies do not stand up hundred-million-euro funds for features. They stand them up for platform bets they intend to defend for years.

    What Changed for Your Product

    A procurement manager at a Fortune 500 opened three enterprise software demos this week and asked each vendor the same question: "Can our agents call this directly, or do my people have to click through your UI?" Two vendors did not have an answer. The third did, and moved to the next stage. The window before that question lands in RFPs is 2-3 quarters.

    The Work Is Smaller Than the Deck Suggests

    For most teams the build is a week of scoping and 2-4 weeks of implementation, assuming the underlying API is not already a mess. The sprint decision is whether to ship an MCP server against the existing API this quarter. The second decision, whether the product's core UI should be restructured around the assumption that an agent is the primary first-touch user, is a roadmap question and does not need to be answered this month.

    The Diagnostic

    Has agent-callable APIUI-only access
    Agent can complete taskYou're the execution target agents pickAgents route to your competitor
    Requires human judgmentSurvive with hybrid flowsSafe for now, monitor

    Separately, Glean benchmarked raw MCP against enterprise knowledge graphs and reported raw MCP used 30% more tokens and was preferred 2.5x less on agentic tasks. The intelligence layer above MCP, not MCP alone, is where differentiation lives. Exposing endpoints is table stakes. Curating the context those endpoints return is the actual product.

    Action items

    • Audit your product's API surface for agent-consumability this sprint: can a third-party AI agent discover, authenticate, and execute your core workflows without a UI?
    • Evaluate the SAP Autonomous Enterprise partner fund for fit with your product — application deadline likely within next quarter
    • Build a knowledge graph or contextual enrichment layer on top of your MCP endpoints rather than exposing raw API responses — reference Glean's 30% token overhead as justification
    • Look at your last 20 enterprise support tickets and feature requests — count how many assume a human vs. an agent doing the work. If the ratio shifted 10+ points toward agents in two quarters, the headless layer is a retention bet due before next renewal cycle

    Sources:TLDR IT · TLDR · Simplifying AI · TLDR Design · ben's bites · a16z

  4. 04

    The PM Role Unbundled: What Survives When AI Collapses the Coordination Layer

    The Case Study

    Elena Verna — former head of growth at Amplitude, Miro, Dropbox, and SurveyMonkey — pushed Lovable's enterprise pricing page to production alone last week. No PM writing requirements. No designer on mocks. No engineer on the build. In a traditional org, that page needed all three people and a week of calendar coordination to land. Verna says she spent 90% of her time building, with almost no meetings.

    Lovable employs zero product managers. The company is growing fast enough that the absence reads as the operating model, not an oversight.

    What's Actually Being Unbundled

    The PM role decomposes into four jobs: user research, prioritization, spec-writing, and cross-functional coordination. Lovable's bet is that the first three collapse into the engineer or operator the moment they talk to users directly, and the fourth disappears when the org is small enough to coordinate in one shared channel. Ravi Mehta's framing is the cleanest version of this: AI makes one person average-to-good at everything at once, which is enough when the alternative was coordinating three specialists.

    The PMs who survive this shift look less like project managers and more like mini-GMs who happen to prototype and iterate directly.

    The Parallel in Regulated Verticals

    Abridge (clinical AI, $5.3B valuation, 250+ health systems) staffs 'clinician scientists' — MDs who are also full-stack engineers. Their release cadence moved from quarterly to monthly, a 4-6x acceleration, not by stripping coordination but by putting domain judgment in the same seat as the code. The pattern ports cleanly. The highest-velocity teams put the person who understands the problem in the chair where the artifact gets built.

    The Diagnostic for Monday

    Two questions, in order:

    1. Of the work shipped last quarter, how much of the PM contribution was judgment about what to build versus coordination of the people building it?
    2. If the coordination half went to zero tomorrow, does the judgment half still justify the role?

    If yes to the second, the job gets better. If no, the job gets done by someone shaped like Verna. The companies that ungate information access — the blocker Verna names — pull disproportionate talent density. The ones protecting management layers end up with coordinators and no builders.

    Action items

    • Calculate your personal build-vs-coordinate ratio this week and benchmark against 90% building — identify the top 3 coordination rituals that could be eliminated or automated
    • Experiment with shipping one small project end-to-end using AI tools (landing page, experiment setup, internal tool) without engaging your cross-functional team this quarter
    • Identify your domain's 'clinician scientist' equivalent and create a hiring spec for technical domain experts who can understand the problem AND build solutions
    • Rewrite your PM career narrative around judgment and strategy rather than coordination — update positioning for the HI-C era

    Sources:Lenny's Newsletter · Latent.Space · TLDR Design · TLDR Dev

◆ QUICK HITS

  • Update: Anthropic Mythos is the first model to clear both UK AISI simulated attack ranges (full network takeover) — jump from 'advanced persistence' to autonomous kill chain in one generation. Assume sophisticated automated attacks reach broader threat actors within 12 months.

    CyberScoop

  • Microsoft's agent memory architecture stabilizes at 400-500 memories with 97.2% retention precision using consolidation, forgetting, and delayed maturation — first validated benchmark for PMs speccing persistent agent features.

    TLDR Data

  • AI persona drift quantified: significant degradation within 8 dialogue rounds due to attention decay as context grows — add drift detection to acceptance criteria for any multi-turn AI feature.

    Brian Ardinger, Inside Outside Innovation

  • Claude Code ships /goal mode: user writes measurable condition, walks away, separate Haiku evaluator judges completion. The evaluator-as-judge pattern (worker proposes done, separate model verifies) is reusable for any autonomous AI workflow.

    Daily Dose of DS

  • Google's Universal Commerce Protocol embeds BNPL (Affirm + Klarna) directly into AI-assisted shopping via Gemini — Affirm targeting $100B GMV with transformer-based underwriting that outperforms its own legacy systems.

    TLDR Fintech

  • Intercom is rebranding the entire company to Fin (their AI agent) — the most aggressive signal yet that product identity is collapsing into the AI layer rather than the workspace around it.

    ben's bites

  • Gemini leaking private phone numbers from training data — PII in foundation model training corpora reaching the output layer. Add output-layer PII scanning if your AI features surface model-generated text to users.

    The Download from MIT Technology Review

  • CRM moat migrating from data to intelligence: Jason Lemkin cut Salesforce from 10+ seats to 2 humans + 1 API seat, spent 83% more ($12K → $22K), with 20+ agents running on top. Revenue held; DAU collapsed.

    a16z

  • GPU compute remains 4:1 demand-to-supply ratio (Nebius, 684% revenue growth). OpenAI committed $20B to Cerebras. Inference cost 'bends down' thesis is structurally challenged for 4-6 more quarters.

    Martin Peers

  • US AI regulation in 'knife fight' — ODNI, Commerce (CAISI), and national security aides in open jurisdictional conflict. CAISI pulled its own webpage announcing Google/Microsoft/xAI voluntary agreements within days of publishing.

    Risky.Biz

◆ Bottom line

The take.

Your AI vendor costs have a 29-day deadline (Anthropic's June 15 third-party repricing), your cost governance has a gap (ServiceNow burned a full year's budget by May because nobody built per-user telemetry), and your API surface has a 2-3 quarter window before enterprise buyers start requiring agent-callable workflows in RFPs. The common thread: the teams that instrumented — costs, outcomes, agent compatibility — before this week are the ones moving in 72 hours. Everyone else is writing the memo they should have written last quarter.

— Promit, reading as Product ·

Frequently asked

What exactly changes for Claude usage through Cursor, Cline, and Zed on June 15?
Third-party harness usage moves to a separate, capped credit pool tied to your subscription's dollar value, after which standard API rates apply. For developers who were effectively getting 70-90% off API pricing through these tools, that's roughly a 10x price increase. Anthropic is offering a 50% rate limit bump for two months as a grace period, not a permanent concession.
What's OpenAI's counter-offer and how long is the window?
OpenAI is giving enterprise switchers two months of free Codex if they commit within 30 days of the announcement. It's displacement pricing aimed precisely at developer frustration with the Anthropic change, and the shot clock is the point — running a parallel Codex evaluation during the free period costs nothing but forces a decision before the window closes.
Why can't ServiceNow tell which users blew through their Anthropic budget?
Anthropic doesn't ship per-user, per-workload cost telemetry, and ServiceNow didn't build its own metering layer before deploying. The result is a full-year budget consumed by May with no attribution. The fix is treating per-customer and per-feature inference cost tracking as a P1 requirement for any AI feature in production, not a finance afterthought.
What should the one-page 'vendor reversal' memo actually contain?
Three thresholds that trigger a switch: a price point (what cost-per-task makes the current vendor uneconomical), a capability gap (what model behavior would force migration), and a reliability threshold (what SLA breach ends the relationship). Teams with this memo ship a decision in 72 hours when pricing moves; teams without one spend the quarter debating in Slack.
Should we expect more Anthropic pricing changes before October 2026?
Plan for at least one more adjustment. Anthropic hired a CFO and is reportedly targeting an October 2026 IPO at a valuation that requires $40-50B in annual revenue, and the prior model of large implicit subsidies to power users doesn't produce the revenue-per-user metrics public markets reward. Negotiate enterprise SLAs and capacity reservations now while their IPO prep creates leverage on your side too.

◆ Same day, different angle

Read this day as…

◆ Recent in product

Keep reading.