Newsletter Archive

Friday, March 27, 2026

How we build evals for Deep Agents · Quantization from the ground up · Speaking of Voxtral - Mistral AI                                                                                
AI Pulse AI Pulse Friday, March 27, 2026
How we build evals for Deep Agents

LangChain shares its framework for building targeted evaluations that shape agent behavior: source production-relevant data, design metrics that measure specific behaviors you care about (not just benchmark scores), and run focused experiments over time. The key insight—more evals ≠ better agents—challenges common testing practices.

LangChain · 9 min read Tools

Quantization from the ground up

Sam Rose's interactive essay breaks down LLM quantization from first principles, revealing why outlier "super weights" can tank model quality if removed and demonstrating empirically that 16-bit to 8-bit quantization incurs almost no accuracy penalty using perplexity and GPQA benchmarks on Qwen 3.5 9B.

Simon Willison · 2 min read Tools

Speaking of Voxtral - Mistral AI

Mistral AI launched Voxtral TTS, a 4-billion-parameter text-to-speech model supporting 9 languages with emotional expression, low latency, and voice adaptation. The lightweight model is designed for production voice agent workflows and is now available in Mistral Studio.

Mistral AI · 6 min read Industry

Inside Meta’s chaotic AI boomtown in rural Louisiana

Meta is constructing a colossal Hyperion AI data center on 2,250 acres of rural Louisiana farmland, reshaping a community with thousands of workers, round-the-clock construction, and truck convoys. The project offers a rare on-the-ground look at the physical, economic, and human toll of the infrastructure race underpinning the AI boom.

Fortune · 7 min read Hardware

MIT engineers design proteins by their motion, not just their shape

MIT's VibeGen AI model generates novel proteins by specifying desired motion patterns rather than just 3D shapes, enabling design of proteins with custom mechanical dynamics for therapeutics and materials.

MIT AI News · 8 min read Research

Scoop: Altman told staff he tried to "save" Anthropic in Pentagon clash

Sam Altman told OpenAI staff he was trying to "save" competitor Anthropic during Pentagon contract negotiations, while privately criticizing Anthropic's CEO and ultimately securing the defense deal Anthropic lost. Internal Slack messages expose a calculated play that positioned OpenAI as a peacemaker while it capitalized on Anthropic's collapse.

Axios · 3 min read Industry

Judge temporarily blocks Pentagon's ban on Anthropic

A federal judge blocked Trump's Pentagon ban on Anthropic as a supply chain risk, granting a preliminary injunction after the company argued the designation caused irreparable business harm. The ruling provides temporary relief while separate cases proceed on First Amendment and procurement law grounds.

Axios · 2 min read Policy

Judge temporarily blocks Trump administration's Anthropic ban - NPR

A federal judge issued a temporary restraining order blocking the Trump administration's ban on Anthropic, one of the largest AI safety companies, signaling legal challenges to the administration's approach to AI governance.

Npr · 4 min read Policy

Quick Hits
Claude AI Maker Anthropic Considers IPO as Soon as October Bloomberg Tech Market
U.S. judge blocks Pentagon’s ‘Orwellian notion’ to label Anthropic a supply chain risk and ban Claude from the government Fortune Policy
Google Releases Gemini 3.1 Flash Live: A Real-Time Multimodal Voice Model for Low-Latency Audio, Video, and Tool Use for AI Agents MarkTechPost Industry
Exclusive: Anthropic acknowledges testing new AI model representing ‘step change’ in capabilities, after accidental data leak reveals its existence Fortune Industry
Hegseth warns Anthropic to let the military use the company’s AI tech as it sees fit, AP sources say - CSET | Center for Security and Emerging Technology Georgetown Policy

Share AI Pulse

Post on X Share on LinkedIn

Got feedback? Just hit reply — we read every response.

You're receiving this because you subscribed to AI Pulse.

Visit AI Pulse  ·  Manage preferences  ·  Unsubscribe

← Back to AI Pulse