Devlery

Devlery - AI news for builders

DEVLERYDEVLERYDEVLERY

Devlery blog

AI news for builders.

Circle Agent Stack gives AI agents a wallet

Circle Agent Stack gives AI agents a wallet

Circle introduced Agent Stack for agent wallets, x402, and USDC nanopayments. The real story is payment infrastructure moving into the agent runtime.

Claude Code adds /goal so coding agents can work toward explicit stop conditions

Claude Code adds /goal so coding agents can work toward explicit stop conditions

Claude Code 2.1.139 adds /goal, a session-level completion loop where a separate evaluator decides when an agent has actually finished.

Anthropic’s 10 finance agents push Claude beyond the chatbot layer

Anthropic’s 10 finance agents push Claude beyond the chatbot layer

Anthropic released 10 Claude agent templates for finance and insurance, showing how regulated industries may package agents around workflows, data, approval, and audit.

OpenAI GPT-Realtime-2 Pushes Voice AI From Conversation to Work

OpenAI GPT-Realtime-2 Pushes Voice AI From Conversation to Work

OpenAI introduced GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper for the Realtime API. The launch moves voice AI competition from audio quality toward reasoning, tool use, and operational reliability.

10,000 Developers Say AI Coding Winners Are Being Decided by Satisfaction

10,000 Developers Say AI Coding Winners Are Being Decided by Satisfaction

JetBrains AI Pulse says Claude Code reached 91% CSAT and an NPS of 54 while GitHub Copilot growth stalled, pushing AI coding toward a best-of-breed market.

MiniMax M2.7 brings self-evolving training to low-cost agent models

MiniMax M2.7 brings self-evolving training to low-cost agent models

MiniMax M2.7 uses a self-evolution loop around OpenClaw, activates only 10B of 230B parameters, and challenges premium coding models on price, benchmarks, and licensing.

Claude Mythos Preview turns zero-day discovery into a controlled-release problem

Claude Mythos Preview turns zero-day discovery into a controlled-release problem

Anthropic is limiting Claude Mythos Preview to Project Glasswing partners after reporting large jumps in autonomous vulnerability discovery, exploit chaining, and cyber safety risk.

Stanford AI Index 2026 shows the paradox of 53% adoption and 40-point transparency

Stanford AI Index 2026 shows the paradox of 53% adoption and 40-point transparency

Stanford HAI published the AI Index 2026 report: generative AI reached 53% global adoption in three years while model transparency fell from 58 to 40.

GLM-5.1 tops SWE-Bench Pro as Meta closes its open-source era

GLM-5.1 tops SWE-Bench Pro as Meta closes its open-source era

China-based Z.ai released GLM-5.1 under MIT terms and topped SWE-Bench Pro with a 744B MoE coding model, sharpening the open-source versus closed-model split.

Meta Muse Spark closes the Llama open-weight chapter

Meta Muse Spark closes the Llama open-weight chapter

Meta launched Muse Spark as its first proprietary frontier model after Llama 4 lost trust, shifting MSL toward closed weights, Meta-scale distribution, and unclear developer access.

Claude Code Monitor turns coding agents into live log readers

Claude Code Monitor turns coding agents into live log readers

Anthropic added Monitor to Claude Code v2.1.98, letting Claude watch background command output and react to logs, CI status, and file changes while a coding session continues.

Cursor 3 turns the IDE into an agent workspace

Cursor 3 turns the IDE into an agent workspace

Anysphere launched Cursor 3 with an agent-first workspace, parallel agents, local-cloud handoff, Design Mode, and Composer 2 as Cursor shifts from editor to orchestration surface.