Devlery

Devlery - AI news for builders

DEVLERYDEVLERYDEVLERY

Devlery blog

AI news for builders.

Claude Writes 80% of Anthropic Code, but the Hard Part Is Verifiable Pause

Claude Writes 80% of Anthropic Code, but the Hard Part Is Verifiable Pause

Anthropic says Claude wrote more than 80% of merged production code in May 2026. The sharper issue is not recursive self-improvement hype, but verifiable review, provenance, and pause mechanisms.

Claude Fable 5 ships with Mythos-class safeguards and pricing

Claude Fable 5 ships with Mythos-class safeguards and pricing

Anthropic released Claude Fable 5 and Mythos 5, splitting the same base model across safeguards, restricted access, pricing, and cloud retention rules.

Vercel Sandbox Drives beta keeps agent workspaces beyond disposable VMs

Vercel Sandbox Drives beta keeps agent workspaces beyond disposable VMs

Vercel Sandbox Drives is in private beta, separating short-lived Firecracker microVMs from persistent AI agent workspaces.

Anthropic maps 832 malicious Claude accounts to MITRE ATT&CK

Anthropic maps 832 malicious Claude accounts to MITRE ATT&CK

Anthropic mapped 832 banned Claude cyber-abuse accounts to MITRE ATT&CK. The risk signal is shifting from skill level to execution orchestration.

ChatGPT Lockdown Mode turns off agent web access for prompt injection defense

ChatGPT Lockdown Mode turns off agent web access for prompt injection defense

OpenAI expanded Lockdown Mode to all logged-in ChatGPT users, limiting web, agent, and file-download paths that can turn prompt injection into data exfiltration.

Microsoft Work IQ APIs go GA June 16 with ten tools for M365 agents

Microsoft Work IQ APIs go GA June 16 with ten tools for M365 agents

Microsoft Work IQ APIs reach GA on June 16, bringing Microsoft 365 work context, ten MCP-style tools, and Copilot Credits pricing to agents.

Google says Agentic RAG raises accuracy 34% by checking missing evidence

Google says Agentic RAG raises accuracy 34% by checking missing evidence

Google Research introduced Agentic RAG for Gemini Enterprise Agent Platform, with cross-corpus routing and a sufficient-context check before answers are finalized.

Claude security harness shows a 7-step path from AI-found bugs to proof

Claude security harness shows a 7-step path from AI-found bugs to proof

Anthropic released Claude Code skills and a gVisor-based vulnerability discovery harness focused on verification, triage, and patch validation.

Claude Wrote 80% of Anthropic's Merged Code, Moving the Bottleneck to Review

Claude Wrote 80% of Anthropic's Merged Code, Moving the Bottleneck to Review

Anthropic says Claude authored over 80% of merged production code. The harder question is whether review, tests, and incident prevention can keep up.

Netskope AI Command Center turns MCP servers into security inventory

Netskope AI Command Center turns MCP servers into security inventory

Netskope AI Command Center discovers AI apps, agents, MCP servers, local models, and data-store links inside enterprise environments.

Claude’s five-hour error window asks how agents recover from model outages

Claude’s five-hour error window asks how agents recover from model outages

Claude’s June 2 model error incident shows why AI agent products need retry, checkpoint, fallback, and human handoff design.

Kurrent Capacitor turns coding agent sessions into team memory

Kurrent Capacitor turns coding agent sessions into team memory

Kurrent Capacitor records Claude Code, Codex, and Cursor sessions as shared memory for PR review, recall, and agent evaluation.