Tools AI News

Developer tools, frameworks, APIs, and tutorials — 2359 articles

Build a Complete Langfuse Observability and Evaluation Pipeline for Tracing, Prompt Management, Scoring, and Experiments

MarkTechPost · May 24

Complete walkthrough for setting up Langfuse tracing, prompt management, scoring, and experimentation with mock or real LLMs. Covers RAG pipeline instrumentation, evaluation scoring, and dataset-drive

Build a Claude Cowork-Like Browser Agent Using Playwright MCP and Claude Desktop

Analytics Vidhya · May 24

Guide to building a Claude-based browser agent using Playwright MCP in Claude Desktop that automates form filling, data extraction, and UI interaction without relying on screenshots. Shifts from chat-

Microsoft Research Releases Webwright: A Terminal-Native Web Agent Framework That Scores 60.1% on Odysseys, Up from Base GPT-5.4’s 33.5%

MarkTechPost · May 24

Microsoft Research released Webwright, an open-source web agent framework that replaces click-trace automation with Playwright code generation. The agent writes and refines scripts in a terminal envir

← Back to AI Pulse

Tools AI News

Build a Complete Langfuse Observability and Evaluation Pipeline for Tracing, Prompt Management, Scoring, and Experiments

Build a Claude Cowork-Like Browser Agent Using Playwright MCP and Claude Desktop

Microsoft Research Releases Webwright: A Terminal-Native Web Agent Framework That Scores 60.1% on Odysseys, Up from Base GPT-5.4’s 33.5%

Stop Engineering Prompts: How an Eval-First Harness Let Us Ship 25 Algorithm Versions Autonomously

The Runtime Was Dead Long Before the Dashboard Noticed

Major Updates in Claude Code v2.1.149 to v2.1.150 | DevelopersIO

How I Use Cursor + Claude to Ship React Code 3x Faster

We're A Bit Behind In Agentic Coding With Tool Use: Google CEO Sundar Pichai

The Ultimate Beginners’ Guide to Building an AI Agent in Python

OpenAI Codex Becomes Desktop Agent: Controls Mac Apps, Watches Screen, Runs on Mobile

Build a SuperClaude Framework Workflow with Commands, Agents, Modes, and Session Memory