AI Forecast Tracker
Week 17 · April 13–26, 2026
high signal

The Acceleration

Six events in two weeks broke the same way. Anthropic crossed $30 billion ARR and overtook OpenAI for the first time, with Google formalizing up to $40 billion of cash and TPU compute. Claude Opus 4.7 took SWE-bench Verified SOTA at 87.6 percent without raising its price. DeepSeek V4 Pro shipped frontier-comparable coding for $3.48 per million output tokens — running on Huawei Ascend, not Nvidia. The iShares software ETF dropped 24 percent in Q1, putting software at a discount to the S&P 500 for the first time in the cloud era. Snap cut 16 percent of its workforce explicitly citing AI and the stock rallied. Indian IT's top five lost 7,389 net heads in FY26 — first negative year since the pandemic — as HCLTech named "AI-led deflation" in management remarks. The frontier accelerates while the floor falls out.

5 predictions updated7 milestones30 companies refreshed

Key Developments

1

Anthropic crossed $30 billion ARR — overtaking OpenAI — and Google formalized up to $40 billion in cash plus 5 gigawatts of TPU compute

Anthropic's run rate ramped from $9 billion at end-2025 to $19 billion in early March to $30 billion in early April — the fastest enterprise software ramp on record, and the first time Anthropic exceeded OpenAI's roughly $25 billion ARR. Enterprise clients paying $1 million or more per year doubled from 500-plus to 1,000-plus in two months. On April 24 Google formalized up to $40 billion of strategic backing: $10 billion cash at a $350 billion valuation, up to $30 billion contingent on performance, and 5 gigawatts of Google Cloud TPU capacity over five years. Combined with Amazon's separately announced $25 billion commitment, the two strategic backers have pledged up to $65 billion behind a single company. Anthropic separately declined primary offers above $800 billion; secondary trades reportedly marked the company above $1 trillion mid-month. Claude Code is the cited primary growth driver, approaching $1 billion annualized within six months of public launch.

Challenges P-012 AI investment boom is a bubble likely to burst by ...Confirms P-030 AI-native companies can achieve billion-dollar rev...Confirms P-038 Approximately 1% of total US economic growth in 20...
Counterpoint

Three triples in one quarter is not three triples in three quarters. Anthropic's $9B → $19B → $30B trajectory could reflect pulled-forward Q2 enterprise budgets closing into Q1 rather than a sustainable new baseline; the Q2 ARR growth rate is the single number that matters. The $40 billion Google commitment is structured as $10 billion cash plus up to $30 billion contingent — meaning $30 billion is a target, not a transfer, and 5 gigawatts of TPU capacity delivered "over five years" is back-end weighted. And Anthropic still spends roughly $7 billion annually on compute against undisclosed gross margins; "fastest ramp on record" can coexist with "deepest losses on record." The tell will be Q2 — if the rate drops below 2x quarter over quarter, the trajectory flattens fast.

Source →
2

Claude Opus 4.7 took SWE-bench Verified SOTA at 87.6% — capability moved while price held flat for the first time at frontier

Anthropic released Claude Opus 4.7 on April 16 across Claude.ai, the API, Bedrock, Vertex, Foundry, Snowflake Cortex, and GitHub Copilot — at the same $5/$25 per million input/output tokens as 4.6. The model scored 87.6% on SWE-bench Verified, three times more production task resolution than 4.6 and 2.6 percentage points ahead of GPT-5.3-Codex at 85.0%. CursorBench moved from 58% to 70%. The release ships with built-in self-verification of outputs and a new xhigh effort tier; image resolution tripled to 2,576 pixels. Two days earlier, the Claude Code desktop app relaunched around parallel agent sessions — orchestrating multiple agents simultaneously is now the default UX, not an edge case. Inference economics finally moved with capability at the frontier instead of trailing it.

Confirms P-001 AI models will handle most aspects of software eng...Confirms P-025 AI can already write 100% of production code; top ...Confirms P-018 AI inference costs have fallen 99%+ in a single ye...
Counterpoint

One caveat: the 87.6% number runs on Anthropic's partner BenchLM.ai harness, not an independent SWE-bench Pro evaluation — classic Benchmark Theater territory where the harness operator and the model vendor share incentives. SWE-bench Verified is the easier sibling of SWE-bench Pro; the gap between them on Opus 4.5 was 36 percentage points (80% vs 44%), and 4.7's Pro number has not landed. METR's task-horizon measurement — the closest thing to a vendor-independent capability gauge — has not yet re-tested 4.7 either. A 10-14% release-to-release gain is within the normal Claude variance (4.4 to 4.5 was about 12%); the structural question is whether the 89-day doubling cadence has held. If the Pro and METR numbers, when they arrive, show smaller deltas than the Verified jump suggests, the headline overshoots the reality.

Source →
3

DeepSeek V4 ships frontier-comparable coding for $3.48 per million tokens — running on Huawei Ascend, not Nvidia

DeepSeek released V4-Pro and V4-Flash previews on April 24: V4-Pro is a 1.6 trillion parameter mixture-of-experts model (49 billion active per token, 1 million token context) — the largest open-weight model ever released. On benchmarks it hits 80.6% SWE-bench Verified (matches Claude), 93.5% LiveCodeBench (ahead of Gemini 91.7% and Claude 88.8%), 96.4% AIME 2026 (vs Claude 4.6 95.0%), and Codeforces 3206 (ahead of GPT-5.4's 3168). Pricing is $1.74/$3.48 per million input/output tokens — about 5x cheaper than Claude Opus 4.6 and 8-10x cheaper than GPT-5.5. Critically, V4 was confirmed running on domestic Huawei Ascend and Cambricon chips; V3 had required Nvidia. Combined with Alibaba's 10,000-chip Zhenwu cluster opened in Guangdong this month, and Shenzhen's 10,000-card Huawei Ascend cluster activated the same week, the export-control workaround is operational rather than theoretical. Stanford HAI's 2026 AI Index put the US lead on frontier-model performance at 2.7%.

Confirms P-018 AI inference costs have fallen 99%+ in a single ye...Confirms
Counterpoint

Self-reported benchmarks from Chinese model labs have historically replicated at about 85% of the claimed numbers when independent third parties retest — Benchmark Theater territory until DeepSeek's V4 numbers land on a third-party harness. Assume the headline scores will sit around 75-80% Verified rather than 80.6%. Benchmark parity is not deployment parity: US hyperscalers still hold roughly 5-10x the inference infrastructure China does, and 5 gigawatts of new TPU capacity routed exclusively to Anthropic in a single deal exceeds the total capacity of Alibaba's flagship Zhenwu cluster. The Huawei Ascend performance per watt remains 30-40% below Hopper-class silicon, which means V4 economics improve only when measured per token; per training run, the gap remains large. And the export-control bypass story is undermined by the DOJ's $2.5 billion Wally Liaw indictment April 16 alleging Supermicro hardware routed to China through Malaysia — domestic alternatives are growing because the smuggling channel is closing.

Source →

What the Evidence Moved

P-001AI models will handle most aspects of software engineering t...

Three concurrent April signals. Claude Opus 4.7 took SWE-bench Verified SOTA at 87.6% (April 16) at unchanged $5/$25 pricing — three times more production task resolution than 4.6 — with the Claude Code desktop app rebuilt April 14 around parallel agent sessions. BNY Mellon Q1 2026 disclosed >40% of code AI-authored across 218 production AI solutions. Snap (April 15) cited 65% AI-generated codebase when announcing 1,000 layoffs and $500M annualized cost reduction. Multi-source confirmation that 'AI handles most software engineering tasks' is operationally real at frontier firms.

70%73% +3pp
P-002AI will disrupt 50% of entry-level white-collar jobs over 1-...

Stanford HAI 2026 AI Index (April 13) sourced entry-level dev employment to a tier-A institution: ages 22-25 down nearly 20% since 2024. Goldman Sachs decomposition: 25K substituted minus 9K augmented = 16K net AI-attributed jobs lost monthly, with the substitution-to-augmentation ratio widening from 1.5:1 to 2.8:1. Challenger Gray's March report named AI in 25% of layoffs (up from 10% in February) — first time AI led all stated reasons in a single month. Snap and Meta added clean named-cause cases.

58%62% +4pp
P-02140% of enterprise applications will feature task-specific AI...

Google Cloud Next '26 committed $750M to a five-year partner agentic AI fund covering its 120K-member partner network — largest hyperscaler partner investment ever. Anthropic's MCP protocol crossed 10,000 enterprise server implementations and 97M+ SDK downloads. Salesforce Agentforce hit $800M ARR (+169% YoY) and announced cross-platform integration with Gemini Enterprise Agent Platform — agents from different vendors now sharing context across enterprise workflows. Platform layer for Gartner's 40% threshold is materializing on schedule.

80%83% +3pp
P-022Over 40% of agentic AI projects will fail by 2027 due to uni...

Gartner Hype Cycle for Agentic AI landed alongside Writer's 2026 enterprise survey: 79% face significant adoption challenges despite 88% running GenAI; 47% of agent-deploying firms still lack security safeguards; 29% of employees use unsanctioned agents. HCLTech CEO formally named 'AI-led deflation' April 21. Governance lag is broader and deeper than Gartner's original number assumed; the 40% failure-by-2027 projection looks conservative now.

75%78% +3pp
P-025AI can already write 100% of production code; top engineers ...

SpaceX option to acquire Cursor at $60B (April 21) confirms Cursor at $2B ARR with ~40 employees as the canonical AI-native productivity case. Claude Code parallel sessions (April 14) make multi-agent orchestration the default workflow. BNY Mellon (40% AI code) and Snap (65% AI code) are the cleanest enterprise-scale validations. Power-law productivity claim is confirmed at frontier-firm level, not aspirational.

70%75% +5pp

Company Impact

Anthropic

Data refresh

ARR $30B (April 2026) — overtook OpenAI; 1,000+ enterprise clients at $1M+/year (2x in two months); Opus 4.7 takes SWE-bench Verified SOTA at 87.6%; Google formalized up to $40B commitment April 24 (cash + 5GW TPU). Score holds at 9.1 (ceiling); summary refreshed.

TechCrunch · Bloomberg · CNBC · Anthropic

9.1
See assessment →

Alphabet / Google

Data refresh

Cloud Next '26 — $750M agentic AI partner fund, Gemini Enterprise Agent Platform, 8th-gen TPUs (April 22). Up to $40B Anthropic commitment April 24 (cash + 5GW TPU compute over 5 years). Q1 2026 earnings April 29 — re-check after report. Score holds at 7.8.

Google Cloud Press · TechCrunch · Bloomberg

7.8
See assessment →

BNY Mellon

Score change

Q1 2026 — revenue $5.4B (+13% YoY), EPS $2.24 (+42%), 218 production AI solutions (4x YoY), >40% of new code AI-authored, 50% of annual account plans AI-drafted, 70% of restricted-party payment screening reviewed by AI. NVIDIA SuperPOD on-prem added. aam 8→9, score 6.9→7.0.

Investing.com · Motley Fool transcript

7.0
See assessment →

BlackRock

Data refresh

Q1 2026 — revenue $6.7B (+27% YoY), AUM $13.9T, Aladdin tech services revenue +22% YoY to $530M (annualizing ~$2.1B). Project Prometheus anchor investor (April 23) — $10B raise at $38B valuation. Score holds at 8.0; flagged for next-pass review on Aladdin run rate.

BlackRock IR · Yahoo Finance · Bloomberg

8.0
See assessment →

Bank of America

Data refresh

Q1 2026 — NII $15.9B (+9%, beat consensus), EPS $1.11, net income $8.6B (+17%). Efficiency ratio improved 63%→61% (290bps), with AI cited as direct contributor. Erica at 21.3M users, 169M Q1 interactions. Full-year NII guide raised to +6-8%. Score holds at 6.9.

Bank of America IR · CNBC

6.9
See assessment →

Adobe

Data refresh

Factual fix: Firefly ending ARR exceeded $250M Q1 FY2026 — corrected from prior $400M figure that was not corroborated. Generative credit consumption +45% QoQ, MAU 80M+ (+50% YoY), Express in 99% of Fortune 500. $25B buyback through April 2030. Score holds at 5.8.

Yahoo Finance · Futurum Group

5.8
See assessment →

Cursor (Anysphere)

Data refresh

April 21 — SpaceX struck deal granting option to acquire Anysphere for $60B later in 2026 (or $10B breakup fee). Cursor at $2B ARR with ~40 employees. Separate $2B fundraise at $50B valuation talks ongoing (a16z, Thrive). Score holds at 8.1.

TechCrunch · Bloomberg

8.1
See assessment →

Sources

Next issue drops Monday

Subscribe to get the briefing before the market opens.