Security daily

Edition 2026-03-22 · read as Security

ClaudeCodeChannelsTurnsTelegramIntoaShellBackdoor

Sources
10
Words
1,806
Read
9min

Topics Agentic AI AI Regulation AI Safety

◆ The signal

Claude Code Channels now bridges Telegram and Discord directly to live code execution sessions — protected only by a sender allowlist and pairing code. A compromised messaging account gives an attacker interactive shell access to your developer's environment, bypassing your VPN, EDR, and network segmentation entirely. This drops alongside METR data showing 50% of AI-generated PRs that pass automated tests would fail human review, and Cursor silently swapping its foundation model to Chinese open-source Kimi K2.5 without notifying users. Audit your developers' AI tooling inventory today — you're defending an SDLC you no longer control.

◆ INTELLIGENCE MAP

  1. 01

    AI Coding Agents Open New C2 Vectors with 50% Defect Rate

    act now

    Claude Code Channels bridges messaging apps to code execution via MCP. Open SWE ships shell access with live message injection — a built-in prompt injection vector. METR confirms ~50% of AI-generated PRs fail human review. Cursor silently swapped to Chinese Kimi K2.5 model. Five major AI coding agents launched simultaneously.

    ~50%
    AI PRs failing human review
    4
    sources
    • AI PRs that fail review
    • Concurrent agent launches
    • Open SWE tools shipped
    • Cursor model cost
    1. Pass automated + human review50
    2. Pass automated, fail human review50
  2. 02

    Autonomous AI Agents Hit Enterprise Scale Before Governance

    monitor

    Single-user token consumption surged 6,000x from 150K to 870M tokens/day using multi-agent architectures. AI agents now mimic human traffic to evade bot detection and can discover+use unfamiliar APIs zero-shot. New x402 and mpp protocols enable autonomous agent payments. Companies gamifying AI adoption are incentivizing data leakage.

    870M
    tokens/day per user
    4
    sources
    • Token growth
    • Sub-agents per user
    • Vera Rubin efficiency
    • Groq valuation
    1. Summer 20240.15
    2. March 2026870
  3. 03

    Enterprise AI Vendor Instability and Semiconductor Concentration

    monitor

    Anthropic surged from 40% to 73% enterprise share in 90 days while OpenAI collapsed to 26% — your vendor risk assumptions are outdated. NVIDIA locked 70% of TSMC 3nm capacity, creating extreme supply concentration. A 44GW data center power shortfall through 2028 threatens cloud SaaS availability. Anthropic faces March 24 DoD hearing that could alter service terms.

    73%
    Anthropic enterprise share
    3
    sources
    • Anthropic share
    • OpenAI share
    • TSMC 3nm locked
    • Power shortfall
    1. Anthropic73
    2. OpenAI26
  4. 04

    Infrastructure Fragility: IoT Vendor Attacks and Energy Disruption

    background

    Intoxalock breathalyzer vendor attack bricked vehicles nationwide — drivers stranded with zero local fallback. Iran war's Strait of Hormuz closure entering week four has spiked jet fuel from $85 to $200/barrel. SE Asian nations imposing 4-day workweeks and shutting schools as energy reserves deplete. Data center power reliability degrading across the region.

    $200
    jet fuel per barrel
    2
    sources
    • Strait closure
    • Jet fuel baseline
    • Jet fuel now
    • Laos gas stations shut
    1. Pre-conflict87
    2. Current200

◆ DEEP DIVES

  1. 01

    Messaging Apps as Developer C2, a 50% AI Defect Rate, and a Silent Chinese Model Swap — Your SDLC's New Threat Model

    Three Simultaneous Shifts Your SDLC Wasn't Designed For

    Four independent intelligence sources this cycle converge on a single conclusion: AI coding agents have crossed from developer convenience to active threat surface — and the transition happened faster than security governance could follow. Three specific developments demand immediate response.


    1. Claude Code Channels: Messaging-to-Shell C2

    Anthropic's Claude Code v2.1.80+ now includes Channels, a research preview that bridges Telegram and Discord bots directly to active code execution sessions via MCP. The access control model consists of a sender allowlist with pairing-code verification — strangers get silently dropped, but anyone on the list has interactive shell access.

    The security implication is stark: this is remote code execution capability gated by messaging platform account security. Telegram has a documented history of session hijacking (SIM swaps, SS7 attacks, token theft). Discord accounts are routinely compromised via phishing and token grabbers. A compromised messaging account with an entry on the allowlist gives an attacker interactive control over a developer's environment — effectively a C2 channel that bypasses your VPN, EDR, and network segmentation.

    Claude Code Channels turns a compromised Discord token into a live shell session on your developer's workstation — no malware delivery needed.

    2. METR Research: Half of AI-Generated PRs Are Defective

    METR's analysis of ~300 AI-generated PRs that passed SWE-bench automated grading found roughly 50% would not be merged by human reviewers. Failures clustered around code quality issues, broken surrounding code, and core functionality failures that test suites missed entirely. This matters because companies like Stripe, Ramp, and Coinbase have deployed internal coding agents that pick up tickets, write code in sandboxes, and open PRs without human intervention — and the agents run at 3 AM.

    Combined with LangChain's Open SWE shipping ~15 tools including shell execution, PR creation, and a live message injection feature where middleware intercepts follow-up messages from Slack or Linear and feeds them to the executing agent, the attack chain becomes concrete: adversary posts crafted message in monitored Slack channel → message injected into executing agent → agent with shell access executes attacker's intent.

    3. Cursor's Silent Foundation Model Swap

    Cursor, one of the most widely adopted AI coding assistants, built its new model on Kimi K2.5, a Chinese open-source model from Moonshot AI — without conspicuous disclosure to users. Every developer using Cursor for code completion, refactoring, or generation is processing source code context, variable names, internal API structures, and business logic through an inference pipeline built on a foreign foundation model with opaque training data provenance. If your organization approved Cursor six months ago, you approved a different model stack than what's running today.

    Cross-Source Pattern: MCP as Systemic Risk

    MCP (Model Context Protocol) now appears in Claude Code Channels, Google Stitch, Colab notebooks, Browserbase, and scheduled tasks. It is becoming the TCP/IP of agent-to-tool communication. This standardization creates systemic risk: a vulnerability in MCP's specification or widely-used implementations would have Log4Shell-scale blast radius across the entire AI agent ecosystem. The open plugin architecture compounds this — community-built connectors mediate between messaging platforms and code execution environments with no security audit requirement.

    CapabilityToolAttack VectorBlast Radius
    Messaging → code executionClaude Code ChannelsCompromised Telegram/DiscordDeveloper workstation, repos, credentials
    Shell execution + message injectionOpen SWEPrompt injection via Slack/LinearSandbox, GitHub repos, CI/CD
    Scheduled unattended executionClaude Code TasksCompromised MCP credentialsAll connected repos, CI/CD, MCP tools
    Opaque model swapCursor / Kimi K2.5Supply chain provenance shiftAll code processed through Cursor

    Compliance Gap

    If your organization maintains SOC 2 Type II, your change management controls likely don't account for autonomous agents creating PRs at 3 AM. Auditors will ask: who approved this change? If the answer is an AI agent, you have a control gap. GDPR and HIPAA implications compound if AI-generated code touches sensitive data paths — defects in AI-authored code handling PII create compliance violations no test caught but every auditor will find.

    Action items

    • Block or restrict Claude Code Channels on corporate development machines until security team validates access control model; if permitted, mandate hardware MFA on all linked Telegram/Discord accounts
    • Inventory all AI coding tools (Claude Code, Cursor, Open SWE, Copilot) across engineering by end of this week, with specific attention to Cursor installations processing proprietary code through Kimi K2.5
    • Implement mandatory human review gates for all AI-generated PRs this sprint — add 'generated-by-agent' labels to CI/CD pipeline and block autonomous merges without human approval
    • Audit all MCP server configurations and apply least-privilege to all agent credentials this sprint — scope GitHub tokens to specific repos, implement automatic secret rotation on agent service accounts
    • Update AI tool vendor contracts this quarter to require notification of foundation model changes and data processing transparency

    Sources:Your devs can now control code execution from Telegram — and autonomous agents are writing PRs at 3 AM · AI coding agents flooding your dev pipeline — and your supply chain risk model isn't ready · Your developers using Cursor? Its new model quietly runs on a Chinese open-source foundation — check your AI supply chain · Low-Signal Week: No CVEs, But Facial-Recognition Search Tools and AI Automation Blind Spots Deserve Your Watch List

  2. 02

    870 Million Tokens Per Day, Autonomous Payments, and Gamified Data Leakage — The AI Agent Governance Vacuum

    Enterprise AI Agent Adoption Is Outrunning Every Governance Framework

    Four independent sources this cycle document an AI agent scale-up that has no governance infrastructure behind it. The numbers are staggering, the capabilities are expanding, and the controls are absent. This isn't a future-state warning — it's a current-state gap assessment.


    The Scale Problem: 6,000x Token Growth in 18 Months

    A documented power user went from ~150,000 tokens/day in summer 2024 to 870 million tokens/day in March 2026 — a 6,000x increase — using a multi-agent architecture with specialized sub-agents handling portfolio management, research, editorial analysis, and economic modeling. NVIDIA's Jensen Huang is pushing every company toward OpenClaw, the orchestration framework enabling this pattern, calling it "the web browser of 1992."

    From a security lens, each of those 870 million tokens flows through AI inference pipelines touching sensitive business data. The orchestrator delegates to sub-agents, creating implicit trust chains between AI components. A research sub-agent ingesting external data is an ingress point — if prompt-injected via a poisoned document, it passes tainted context to agents with financial system access. This is lateral movement via context propagation.

    The next major enterprise security crisis won't come from a vulnerability in your perimeter — it'll come from an autonomous AI agent with overprivileged access processing a billion tokens of your sensitive data while everyone's asleep.

    The Capability Problem: Agents That Evade Detection and Spend Money

    Computer-use agents have reached fidelity sufficient to perfectly mimic human browsing behavior, degrading bot-detection and anti-scraping defenses. AI models starting with Claude 4.5+ and Codex 5.2+ can discover and use unfamiliar APIs zero-shot — without prior training or human guidance. This means any exposed API with a readable schema is now navigable by autonomous agents, including undocumented endpoints and legacy services relying on obscurity.

    Two new open payment protocols — x402 from Coinbase and mpp from Stripe/Tempo — enable agents to autonomously discover merchants, execute payments, and verify transactions using sub-cent stablecoin fees. Public registries at x402scan.com and mppscan.com catalog payment-accepting API endpoints — effectively a reconnaissance directory for any agent or adversary. Note: these claims come from a promotional a16z guest post by the CEO of Merit Systems, which operates several of the products cited — calibrate confidence accordingly.

    The Incentive Problem: Gamified Data Leakage

    Multiple sources confirm companies are gamifying AI tool usage with internal leaderboards tied to performance reviews, driving massive token consumption. This creates a perverse incentive to maximize data input to AI tools regardless of classification — proprietary code, customer records, strategic plans, security configurations all flowing to third-party AI providers. This is the insider threat your DLP wasn't designed for: well-intentioned employees, incentivized by management, systematically exfiltrating data to AI platforms.

    Compounding this, enterprise copilot deployments are broadly disappointing — most delivering underwhelming 30% speedups when evaluated only on "time saved." When sanctioned tools disappoint, users seek alternatives. Shadow AI proliferates precisely in the teams where sanctioned adoption is lowest.

    The Infrastructure Catalyst

    NVIDIA's combined Vera Rubin + Groq architecture arriving later in 2026 will deliver 35x throughput per megawatt over current Blackwell chips. NVIDIA valued inference-specialist Groq at $20 billion. When inference costs plummet by an order of magnitude, the economic governor on agent proliferation disappears. The 12-18 month window before this hardware ships is your governance preparation window.

    MetricSummer 2024March 2026Security Implication
    Tokens/day (single user)~150K100M–870M6,000x more data through AI pipelines
    Agent architectureSingle chatbot1 orchestrator + 4 sub-agents5x more trust boundary crossings
    Human oversightEvery interactionAutonomous overnightZero real-time anomaly detection
    Payment capabilityNoneAutonomous via x402/mppFinancial fraud via prompt injection

    Action items

    • Conduct a threat model assessment for any current or planned multi-agent AI deployments this sprint — map agent-to-agent trust boundaries, data access patterns, and authorization inheritance chains
    • Implement token consumption monitoring as security telemetry — set baselines, alert on deviations, treat token budgets as both financial and security controls
    • Deploy or extend DLP to monitor AI tool traffic, and establish approved AI tool lists that block unsanctioned alternatives — prioritize teams where sanctioned copilot adoption is lowest
    • Check x402scan.com and mppscan.com for any of your domains or vendor domains this week — add to external attack surface monitoring
    • Evaluate OpenClaw's security architecture before it enters your environment this quarter — assess authentication, sandboxing, output validation, and supply chain integrity

    Sources:870M tokens/day through autonomous agents — your data governance model wasn't built for this · AI Agents Now Bypass Your Bot Defenses and Spend Money Autonomously — Here's Your New Threat Surface · Intoxalock breach bricked safety-critical IoT — is your connected fleet next? · Your AI Governance Blind Spot: Enterprise 'Org Legibility' Pushes Are Expanding Your Data Attack Surface

  3. 03

    Enterprise AI Vendor Landscape Inverted — Your Third-Party Risk Posture Needs Recalibration

    The Market Shifted Faster Than Your Vendor Risk Assessments

    The enterprise AI vendor landscape experienced a seismic inversion in Q1 2026: Anthropic surged from 40% to 73% enterprise market share while OpenAI collapsed from 60% to 26%. Anthropic's Claude Code alone reportedly generated $2.5 billion in February 2026 revenue. OpenAI is in strategic retreat — throttling its $1.6 trillion Stargate data center project and pivoting from building to renting infrastructure. For security leaders, this isn't a market analysis — it's a third-party risk event with immediate implications.


    Vendor Stability as Security Risk

    If your organization has enterprise OpenAI integrations, their infrastructure retreat raises concrete questions: Where does your data reside? As OpenAI moves from owned to rented infrastructure, your data residency assumptions may no longer hold. What are your SLA guarantees worth? A company pivoting its infrastructure model is, by definition, introducing transition risk. What's your vendor lock-in exposure? Organizations that didn't build API abstraction layers now face operational risk tied to a destabilizing provider.

    Simultaneously, Anthropic faces its own risk vector: active federal litigation with the Department of Defense over AI technology use restrictions, with a hearing scheduled for March 24 before Judge Rita Lin. Anthropic's Head of Policy Sarah Heck and Head of Public Sector Thiyagu Ramasamy filed sworn declarations claiming the Pentagon's security concerns were never raised during negotiations. The outcome could trigger terms of service changes, service restrictions, or operational disruptions for commercial customers.

    Both leading AI vendors are destabilized — one by market collapse, the other by federal litigation. If either is in your critical path, your continuity plan needs a backup.

    Semiconductor Supply Chain Compounds the Risk

    NVIDIA has locked 70% of TSMC's 3nm capacity, while ASML's EUV production ceiling of ~700 units/year physically constrains global fab expansion. Morgan Stanley projects a 44 gigawatt data center power shortfall through 2028. These aren't theoretical constraints — they mean chronic cloud capacity pressure affecting SaaS availability, extended lead times for security hardware, and geopolitical concentration risk on Taiwan that remains unmitigated.

    Sources disagree on urgency: one source frames the Anthropic market share figures with high confidence, while others note the numbers lack sourced methodology. The directional shift is corroborated across procurement channels, but the exact percentages should be treated as estimates, not confirmed intelligence. Regardless, the volatility itself is the risk signal.

    Workforce Pipeline Erosion

    CS graduate placement has collapsed from 89% to 19% in 2.5 years. While this may seem tangential, the security talent pipeline shares the same academic feeder. Combined with Gemini cutting ~30% of staff while citing AI productivity gains, the signal is that AI-driven workforce contraction will eventually constrain security hiring — making AI-assisted security tooling and internal upskilling more critical.

    Action items

    • Reassess enterprise AI vendor risk scores for both OpenAI and Anthropic this week — specifically review data residency, infrastructure commitments, and SLA guarantees in light of OpenAI's infrastructure pivot and Anthropic's March 24 DoD hearing
    • Build or validate an API abstraction layer that allows switching between AI providers without rewriting integrations this quarter
    • Extend hardware procurement timelines for 2026–2027 security appliance refresh cycles — factor in semiconductor supply concentration when budgeting

    Sources:Your AI vendor stack is shifting under you — Anthropic just ate OpenAI's enterprise share while your supply chain bets on TSMC face new concentration risk · Intoxalock breach bricked safety-critical IoT — is your connected fleet next? · Your Super Micro servers just became a supply chain risk: co-founder caught smuggling AI chips to China via front companies

◆ QUICK HITS

  • Intoxalock breathalyzer vendor hit by cyberattack — vehicles bricked nationwide as ignition interlock devices couldn't complete routine checks, stranding drivers with zero local fallback

    Intoxalock breach bricked safety-critical IoT — is your connected fleet next?

  • White House federal AI framework drops, explicitly aimed at preempting state-level AI enforcement — three separate sources flagged it; assign GRC lead to map implications against your current compliance controls (SOC 2, state AI acts)

    AI coding agents flooding your dev pipeline — and your supply chain risk model isn't ready

  • WordPress.com now lets AI agents autonomously write and publish posts without human review — collapses cost of phishing infrastructure, SEO poisoning, and brand impersonation at scale

    Your developers using Cursor? Its new model quietly runs on a Chinese open-source foundation — check your AI supply chain

  • Facial-recognition search engines now match uploaded photos to OnlyFans profiles — trivially weaponizable for social engineering, doxxing, and executive reconnaissance via LinkedIn headshots

    Low-Signal Week: No CVEs, But Facial-Recognition Search Tools and AI Automation Blind Spots Deserve Your Watch List

  • Update: Iran conflict — SE Asian energy crisis intensifies with jet fuel at $200/barrel (up from $85–90), 40% of Laos gas stations closed, Philippines on 4-day workweeks; if you have data center or BPO dependencies in the region, activate continuity monitoring

    Your Super Micro servers just became a supply chain risk: co-founder caught smuggling AI chips to China via front companies

  • SEC approved Nasdaq pilot to trade tokenized stocks and ETFs via blockchain settlement — introduces smart contract and custody risks into regulated financial infrastructure; flag for finserv security architecture teams

    Intoxalock breach bricked safety-critical IoT — is your connected fleet next?

  • Enterprise 'org legibility' push is centralizing tacit institutional knowledge into RAG corpora and knowledge bases — creating concentrated, high-value breach targets most security programs haven't classified or access-controlled

    Your AI Governance Blind Spot: Enterprise 'Org Legibility' Pushes Are Expanding Your Data Attack Surface

  • SOAR automation complacency paradox: playbooks that auto-close 95% of alerts correctly train analysts to ignore the 5% where adversaries live — audit auto-close rates by category and implement mandatory human spot-checks

    Low-Signal Week: No CVEs, But Facial-Recognition Search Tools and AI Automation Blind Spots Deserve Your Watch List

◆ Bottom line

The take.

AI coding agents now bridge messaging platforms directly to code execution, run scheduled tasks overnight without human oversight, and process proprietary source code through silently-swapped Chinese foundation models — while METR research confirms half their output that passes automated testing wouldn't survive human review. Your SDLC was designed for human developers with accountability and audit trails; block Claude Code Channels on corporate machines today, inventory every AI coding tool by Friday, and mandate human approval on all agent-generated PRs before the defects they introduce become the vulnerabilities attackers exploit.

— Promit, reading as Security ·

Frequently asked

How does Claude Code Channels actually give attackers shell access?
Claude Code v2.1.80+ Channels bridges Telegram and Discord bots to active code execution sessions via MCP, gated only by a sender allowlist and pairing code. If an attacker compromises a messaging account on that allowlist — through SIM swap, SS7 attack, phishing, or token theft — they gain interactive shell access to the developer's environment, bypassing VPN, EDR, and network segmentation entirely.
What should I do this week to respond to these risks?
Block or restrict Claude Code Channels on corporate dev machines until the access control model is validated, and mandate hardware MFA on any linked Telegram or Discord accounts. In parallel, inventory all AI coding tools across engineering — paying particular attention to Cursor installs now running on Kimi K2.5 — and add a mandatory human review gate for AI-generated PRs in CI/CD.
Why does Cursor's model swap to Kimi K2.5 matter for security?
Cursor quietly rebuilt its model on Kimi K2.5, a Chinese open-source foundation from Moonshot AI, without clearly notifying users. Every developer using Cursor is now sending source code, variable names, internal API structures, and business logic through an inference pipeline with opaque training data provenance — meaning your six-month-old vendor approval no longer reflects the model stack processing your proprietary code.
How credible is the claim that 50% of AI-generated PRs would fail human review?
The figure comes from METR's analysis of roughly 300 AI-generated PRs that had already passed SWE-bench automated grading, with about half judged unmergeable by human reviewers due to code quality issues, broken surrounding code, and missed core functionality. It's a single study with a specific benchmark scope, but the direction is corroborated by enterprise reports of autonomous agents at companies like Stripe, Ramp, and Coinbase opening PRs overnight without human oversight.
Why is MCP described as a systemic risk rather than just another protocol?
MCP now mediates agent-to-tool communication across Claude Code Channels, Google Stitch, Colab, Browserbase, and scheduled tasks, making it the de facto standard for agent integrations. A vulnerability in the MCP specification or a widely-used implementation would have Log4Shell-scale blast radius, and the open plugin architecture means community-built connectors bridge messaging platforms and code execution with no mandatory security audit.

◆ Same day, different angle

Read this day as…

◆ Recent in security

Keep reading.