Newsletter Archive

Thursday, April 30, 2026

AI evals are becoming the new compute bottleneck · [AINews] The Inference Inflection · OpenAI Meets Key AI Computing Capacity Goal Ahead of Schedule                                                                                
AI Pulse AI Pulse Daily Thursday, April 30, 2026
AI evals are becoming the new compute bottleneck

Evaluating frontier AI models has become prohibitively expensive: a single GAIA benchmark run costs $2,829, agent leaderboards spend $40K per evaluation round, and scientific ML architecture sweeps require thousands of H100-hours. Cost spreads of 33× on identical tasks reveal that evaluation methodology—not just model capability—now drives budget decisions.

Hugging Face · 17 min read Research

[AINews] The Inference Inflection

OpenAI, DeepMind, and Intel execs are signaling that inference compute is becoming the critical bottleneck in AI infrastructure, not training. CPU demand is surging as enterprises refresh aging hardware and shift budgets from GPUs to inference optimization — a five-to-six year cycle inflection point that could reshape how companies architect AI systems.

Latent Space · 10 min read Market

OpenAI Meets Key AI Computing Capacity Goal Ahead of Schedule

OpenAI has secured its target AI computing capacity in the US years earlier than planned, clearing the path for accelerated data center expansion and larger model training runs.

Bloomberg Tech · 1 min read Industry

Operating-Layer Controls for Onchain Language-Model Agents Under Real Capital

A 21-day live deployment of 3,505 language-model trading agents on real Ethereum revealed that reliability emerges not from the base model but from the operating layer around it—prompt compilation, typed controls, policy validation, and execution guards. The system processed 7.5M agent invocations and $20M in trading volume with 99.9% settlement success, but only after targeted fixes (e.g., reducing fabricated sell rules from 57% to 3%) that text-only benchmarks never caught.

arXiv AI · 3 min (abstract) Research

Cursor Introduces a TypeScript SDK for Building Programmatic Coding Agents With Sandboxed Cloud VMs, Subagents, Hooks, and Token-Based Pricing

Cursor released a public beta TypeScript SDK that gives developers programmatic access to its coding agents, enabling deployment via CI/CD pipelines, backend services, and embedded workflows rather than just interactive use in the IDE.

MarkTechPost · 6 min read Tools

SoftBank reportedly weighs $100 billion valuation for new AI and robotics spinout in potential U.S. IPO

SoftBank is spinning out a new AI and robotics company called Roze with a reported $100 billion valuation targeting a U.S. IPO, consolidating its bets in autonomous systems and artificial intelligence into a standalone public entity.

CNBC Tech · 1 min read Industry

US Big Tech Ratchets Up AI Spending Past $700 Billion This Year

US tech giants plan to spend $725 billion on capital expenditures this year, with the majority going to AI data center infrastructure. This massive spending surge reflects the industry's bet-the-company shift toward AI capabilities and has downstream implications for GPU scarcity, cloud pricing, and compute availability.

Bloomberg Tech · 1 min read Market

AI agent governance takes focus as regulators flag control gaps

Australia's financial regulator (APRA) warned banks and superannuation trustees that AI governance is dangerously immature, finding boards lack understanding of model risks and over-rely on vendor claims. The warning follows a review of major institutions deploying AI in loan processing, fraud detection, and customer service—forcing financial firms to overhaul oversight practices.

AI News · 3 min read Policy

Quick Hits
Big Tech just proved AI infrastructure spending works. Then it raised the bill anyway AI News Market
'Our enterprise AI solutions have become our primary growth driver for cloud for the first time,' CEO Sundar Pichai tells analysts, noting that sales on those products grew eightfold from a year ago - facebook.com Facebook Industry
Exploding number of AI data center build-outs delay Texas housing projects — data centers' high demand for electricians prices out contractors, homes now take two months longer to complete Tom's Hardware Hardware
OpenAI’s new security model is for ‘critical cyber defenders’ only The Verge AI Industry
Anthropic Weighs Funding Round at Valuation Above $900 Billion - PYMNTS.com PYMNTS Market

Share AI Pulse Daily

Post on X Share on LinkedIn

Got feedback? Just hit reply — we read every response.

You're receiving this because you subscribed to AI Pulse Daily.

Visit AI Pulse  ·  Manage preferences  ·  Unsubscribe

← Back to AI Pulse