AI

495 posts

AI Agent AI Infrastructure Developer Tools AI Coding LLM Security AI News MCP

2.03x Token Throughput, EAGLE 3.1 Fixes Speculation Drift

vLLM EAGLE 3.1 targets attention drift in speculative decoding, with early gains for long-context and coding-agent serving workloads.

May 26, 2026

AI Broke an Erdos Conjecture, and the Real Story Is the Verification Loop

OpenAI’s unit-distance counterexample shows that AI research automation depends less on answer generation than on proofs experts can inspect.

May 25, 2026

The 90-day review stalled, and AI model launches changed

Trump’s delayed AI executive order shows frontier model launches being reshaped around speed, security evaluation, and critical infrastructure readiness.

May 25, 2026

Datasette Agent shows why narrow AI agents matter

Datasette Agent connects SQLite exploration with LLMs, plugin tools, permissions, and sandbox execution in a narrow but practical agent experiment.

May 25, 2026

The 1,000-session wall, and why agent products need analytics

Voker’s Launch HN shows how agent operations are moving beyond trace debugging toward product analytics for intents, corrections, and resolutions.

May 25, 2026

The 3-second approval device trying to hold agent authority

Foundation Passport Prime is an experiment in moving final approval for AI agents out of the browser and into dedicated hardware.

May 25, 2026

One API call, Google opens the serverless agent runtime

Gemini API Managed Agents hides sandboxing, state, and tool loops behind an API, moving agent competition into runtime infrastructure.

May 25, 2026

AWS MCP Packs 15,000 APIs Into a New Boundary for Cloud Agents

AWS Agent Toolkit and the AWS MCP Server GA show how coding agents can reach cloud accounts through IAM, CloudWatch, and CloudTrail.

May 24, 2026

Docusign MCP Beta Turns Agreements Into Agent Tools

Docusign Iris Agents and its MCP beta show how agreement data can become a callable work surface for Claude, Gemini, and ChatGPT.

May 24, 2026

The 15x token bill and the return of the AI-native cloud

DigitalOcean AI-Native Cloud shows why agent costs are shifting from GPU rental to inference routing, data, state, and operations.

May 24, 2026

The 24-Hour Agent Permission Problem in Front of 900M Gemini Users

Google Gemini Spark brings background agents, MCP connections, and approval boundaries into a mass-market consumer AI surface.

May 24, 2026

99.82% Cache Hits, the New Variable in Coding Agent Costs

The Reasonix debate shows that coding agent costs depend not only on model pricing, but on harness design that keeps prefix cache intact.

May 24, 2026