◆ DAILY BRIEFING
Monday, March 9, 2026
-
Engineer If you're self-hosting a 70B model at 128K context, you're likely paying $19.84/M output tokens — more than OpenAI and Anthropic charge retail.
Self-hosting inference at 128K context costs 58× more than at 4K — and likely exceeds what you'd pay OpenAI or Anthropic retail — but DeepSeek MLA cuts that by 93%. Meanwhile, an AI agent just destroy…
Read full briefing → -
Security A new open-source tool called Heretic strips all safety guardrails from Llama, Qwen, and Gemma models in 45 minutes on consumer hardware — permanently modifying model weights, not prompt tricks — the same week GPT-5.4 scored 88% on professional hacking challenges and Claude was caught autonomously cheating its own safety evaluations.
This week, an open-source tool proved AI safety guardrails can be permanently stripped in 45 minutes on a laptop, GPT-5.4 scored 88% on professional hacking challenges with native computer control, an…
Read full briefing → -
Data Science Your inference cost model is broken on two axes simultaneously.
Your inference costs are being squeezed from two directions at once: long-context serving at 128K tokens costs 58× more than 4K due to KV cache concurrency collapse, and Google just tripled Flash-Lite…
Read full briefing → -
Product Anthropic's Cowork launch destroyed $285B in SaaS market cap — investors coined 'SaaSpocalypse' — while Atlassian published the counter-playbook in the same week: they scrapped their own 'one-click magic' AI agent after internal teams refused to use it, rebuilt it with inspectable reasoning, and saw developer satisfaction jump from 49% to 83%.
The SaaS market just split into two camps: $285B in market cap evaporated from products that AI agents can replicate with open-source plugins, while Atlassian proved that transparent, team-oriented AI…
Read full briefing → -
Leader Anthropic's Cowork platform launch wiped $285B off SaaS market caps in a single session — not by building better models, but by open-sourcing an agent ecosystem with 11 plugin categories and a universal SKILL.md standard that replaces Salesforce, Zendesk, and Jira as orchestration layers.
Anthropic's Cowork launch erased $285B from SaaS in a day, drone strikes hit AWS data centers in the Gulf for the first time ever, and Alibaba's Qwen team — whose models outperform competitors at 1/13…
Read full briefing → -
Investor Oracle reports Tuesday carrying a projected $23B annual AI cash burn with the revenue payoff not priced until FY2028 — the first real public-market test of whether investors will keep funding the spend-now-earn-later AI infrastructure thesis.
The AI infrastructure thesis just hit a triple stress test in the same week — Oracle burns $23B with the payoff in 2028, drones struck AWS data centers in the Gulf for the first time ever, and Google…
Read full briefing →