Latest Newsletter

Sunday, May 24, 2026

AI Pulse Daily Sunday, May 24, 2026

Visit AI Pulse · Share

AI cost crisis hits tech giants as employee 'tokenmaxxing' backfires — agentic AI eats up to 1000x more tokens than standard AI, sparks corporate pullback at Microsoft, Meta, and Amazon

Agentic AI systems are consuming 1000x more tokens than standard models, triggering budget crises and pullbacks at Microsoft, Meta, and Amazon—a practical cost barrier reshaping enterprise AI spending.

Tom's Hardware · 1 min read Industry

How big tech got its way on Trump’s AI executive order

Trump abandoned plans to require government safety reviews of new AI models hours before signing, citing competitive concerns with China. The reversal signals a deregulatory stance that clears barriers for tech companies to release systems faster.

The Guardian Tech · 1 min read Policy

Nvidia’s Hidden $60 Billion Business Is About to Overtake Broadcom - 24/7 Wall St.

Nvidia's data center networking revenue (the hidden $60B business) is projected to surpass Broadcom's total revenue, driven by demand for AI infrastructure and connectivity between GPUs.

24/7 Wall St · 4 min read Hardware

Build a SuperClaude Framework Workflow with Commands, Agents, Modes, and Session Memory

Tutorial walks through building a SuperClaude Framework wrapper on top of Anthropic's API to create stateful multi-agent workflows with role-specific system prompts, session memory, and chained development tasks—includes working Python code for brainstorming, security analysis, and token-efficient reasoning chains.

MarkTechPost · 7 min read Tools

Tencent Open-Sources TencentDB Agent Memory: A 4-Tier Local Memory Pipeline for AI Agents

Tencent open-sourced TencentDB Agent Memory, a fully local 4-tier memory system for AI agents that replaces flat vector stores with a semantic pyramid (Conversation → Atom → Scenario → Persona). Ships as OpenClaw plugin, runs on SQLite + sqlite-vec, uses hybrid BM25+vector retrieval with RRF fusion—no external APIs required.

MarkTechPost · 9 min read Repos

NVIDIA AI Releases Gated DeltaNet-2: A Linear Attention Layer That Decouples Erase and Write in the Delta Rule

NVIDIA's Gated DeltaNet-2 decouples erase and write operations in linear attention by using separate channel-wise gates, enabling more flexible memory editing without losing information. At 1.3B parameters, it outperforms Mamba-2, Gated DeltaNet, and KDA on standard benchmarks while maintaining constant-memory decoding.

MarkTechPost · 15 min read Research

Microsoft Research Releases Webwright: A Terminal-Native Web Agent Framework That Scores 60.1% on Odysseys, Up from Base GPT-5.4’s 33.5%

Microsoft Research released Webwright, an open-source web agent framework that replaces click-trace automation with Playwright code generation. The agent writes and refines scripts in a terminal environment rather than interacting with a stateful browser, achieving 60.1% on the Odysseys benchmark—nearly 2x the base GPT-5.4 score of 33.5%—and 86.7% on Online-Mind2Web.

MarkTechPost · 8 min read Tools

How to Tame AI’s Voracious Appetite for Energy - Nautilus | Science

As AI models grow larger and training demands skyrocket, energy consumption has become a bottleneck for scaling. The piece examines why AI is so power-hungry and what solutions—from hardware optimization to algorithmic efficiency—might reduce the burden.

Nautil · 16 min read Industry

Quick Hits

Ubisoft reportedly testing generative AI in Far Cry 7, insider says it 'looks like sh*t' — company recently posted a record €1.3 billion loss

Tom's Hardware

Industry

Share AI Pulse Daily

Post on X

Share on LinkedIn

Got feedback? Just hit reply — we read every response.

You're receiving this because you subscribed to AI Pulse Daily.

Visit AI Pulse · Manage preferences · Unsubscribe

View in archive ← Back to AI Pulse