Newsletter Archive

Thursday, May 21, 2026

AI Pulse Daily Thursday, May 21, 2026

Visit AI Pulse · Share

Build real-time voice applications with Amazon SageMaker AI and vLLM

AWS SageMaker now supports bidirectional streaming for real-time inference, enabling sub-100ms speech-to-text by eliminating request-response latency. This post shows how to deploy Mistral's Voxtral-Mini-4B speech model using vLLM containers for live captioning, voice agents, and contact center analytics.

Amazon AWS ML · 13 min read Tools

Anthropic will pay xAI $1.25 billion per month for compute

Anthropic will pay xAI $1.25 billion per month for compute access under a new deal, cementing xAI as a major compute provider and giving Anthropic a critical alternative to Nvidia-dependent infrastructure amid ongoing GPU scarcity.

TechCrunch · 3 min read Industry

Anthropic to Pay SpaceX Nearly $45 Billion for Computing Deal

Anthropic commits nearly $45 billion to SpaceX over three years for computing resources to power Claude, marking a major infrastructure partnership in the race for AI compute capacity.

Bloomberg Tech · 1 min read Industry

Quoting SpaceX S-1

SpaceX's S-1 filing reveals a $1.25 billion-per-month cloud services agreement with Anthropic through May 2029, giving the AI lab access to COLOSSUS and COLOSSUS II compute clusters while SpaceX trains Grok 5 on the latter. The deal underscores the scale of modern foundation model training infrastructure and Anthropic's compute hunger.

Simon Willison · 2 min read Industry

Nvidia’s revenue hits record $81B — easily topping Wall Street estimates — on massive demand for AI chips - New York Post

Nvidia posted record $81B revenue, crushing analyst expectations on explosive demand for its AI chips. The result signals sustained market dominance and capital availability for scaling AI deployments.

NY Post · 1 min read Market

Anthropic is paying SpaceX $15 billion per year

Anthropic is paying SpaceX $1.25 billion monthly through May 2029—$15 billion annually—for GPU compute capacity, according to SpaceX's IPO filing. The deal underscores AI labs' acute compute scarcity and represents 83% of SpaceX's current annual revenue, signaling how critical hardware access has become to frontier model development.

Axios · 2 min read Industry

Nvidia Posts Record $82B Quarter as Agentic AI Arrives - PYMNTS.com

Nvidia hit an $82B quarterly revenue record as agentic AI systems move from labs into production. The timing reflects booming enterprise demand for inference infrastructure and the shift from single-model deployments to multi-agent workflows requiring heavier GPU utilization.

PYMNTS · 5 min read Market

AMD Commits More Than $10 Billion to Taiwan AI Investments

AMD pledges over $10 billion in Taiwan investments to expand AI chip partnerships and packaging capacity, signaling an aggressive push against Nvidia's market dominance.

Bloomberg Tech · 1 min read Hardware

Quick Hits

[AINews] OpenAI GPT-next disproves 80 year old Erdős planar unit distance problem for under $1000

Latent Space