Devlery - Page 21

Devlery blog

AI news for builders.

2.03x Token Throughput, EAGLE 3.1 Fixes Speculation Drift

vLLM EAGLE 3.1 targets attention drift in speculative decoding, with early gains for long-context and coding-agent serving workloads.

May 26, 2026[AI]

AI Broke an Erdos Conjecture, and the Real Story Is the Verification Loop

OpenAI’s unit-distance counterexample shows that AI research automation depends less on answer generation than on proofs experts can inspect.

May 25, 2026[AI]

The 90-day review stalled, and AI model launches changed

Trump’s delayed AI executive order shows frontier model launches being reshaped around speed, security evaluation, and critical infrastructure readiness.

May 25, 2026[AI]

Datasette Agent shows why narrow AI agents matter

Datasette Agent connects SQLite exploration with LLMs, plugin tools, permissions, and sandbox execution in a narrow but practical agent experiment.

May 25, 2026[AI]

The 1,000-session wall, and why agent products need analytics

Voker’s Launch HN shows how agent operations are moving beyond trace debugging toward product analytics for intents, corrections, and resolutions.

May 25, 2026[AI]

The 3-second approval device trying to hold agent authority

Foundation Passport Prime is an experiment in moving final approval for AI agents out of the browser and into dedicated hardware.

May 25, 2026[AI]

One API call, Google opens the serverless agent runtime

Gemini API Managed Agents hides sandboxing, state, and tool loops behind an API, moving agent competition into runtime infrastructure.

May 25, 2026[AI]

AWS MCP Packs 15,000 APIs Into a New Boundary for Cloud Agents

AWS Agent Toolkit and the AWS MCP Server GA show how coding agents can reach cloud accounts through IAM, CloudWatch, and CloudTrail.

May 24, 2026[AI]

Docusign MCP Beta Turns Agreements Into Agent Tools

Docusign Iris Agents and its MCP beta show how agreement data can become a callable work surface for Claude, Gemini, and ChatGPT.

May 24, 2026[AI]

The 15x token bill and the return of the AI-native cloud

DigitalOcean AI-Native Cloud shows why agent costs are shifting from GPU rental to inference routing, data, state, and operations.

May 24, 2026[AI]

The 24-Hour Agent Permission Problem in Front of 900M Gemini Users

Google Gemini Spark brings background agents, MCP connections, and approval boundaries into a mass-market consumer AI surface.

May 24, 2026[AI]

99.82% Cache Hits, the New Variable in Coding Agent Costs

The Reasonix debate shows that coding agent costs depend not only on model pricing, but on harness design that keeps prefix cache intact.

May 24, 2026[AI]

Devlery - AI news for builders

2.03x Token Throughput, EAGLE 3.1 Fixes Speculation Drift

AI Broke an Erdos Conjecture, and the Real Story Is the Verification Loop

The 90-day review stalled, and AI model launches changed

Datasette Agent shows why narrow AI agents matter

The 1,000-session wall, and why agent products need analytics

The 3-second approval device trying to hold agent authority

One API call, Google opens the serverless agent runtime

AWS MCP Packs 15,000 APIs Into a New Boundary for Cloud Agents

Docusign MCP Beta Turns Agreements Into Agent Tools

The 15x token bill and the return of the AI-native cloud

The 24-Hour Agent Permission Problem in Front of 900M Gemini Users

99.82% Cache Hits, the New Variable in Coding Agent Costs