Devlery

Blog

Notes and analysis on AI development.

Arm open-sources Metis, a security agent aimed at SAST false positives

Arm open-sources Metis, a security agent aimed at SAST false positives

Arm has open-sourced Metis, an agentic AI security framework for code review, SARIF triage, evidence chains, and lower false-positive cost.

LFM2.5 8B Brings Tool Calling to Local Agents

LFM2.5 8B Brings Tool Calling to Local Agents

Liquid AI released LFM2.5-8B-A1B, a 1.5B active MoE model with 128K context for local tool calling, structured outputs, and agent workflows.

AI Passed the CAPTCHA, but 0.88 AUC Caught the Process

AI Passed the CAPTCHA, but 0.88 AUC Caught the Process

Roundtable and an arXiv paper argue that AI agents can match CAPTCHA answers while still revealing themselves through click order and behavior.

GitHub Copilot Model Rules Let Enterprises Limit Expensive Models by Organization

GitHub Copilot Model Rules Let Enterprises Limit Expensive Models by Organization

GitHub opened Copilot targeted model rules in public preview. With AI Credits arriving on June 1, model choice is becoming a team-level budget and governance policy.

Claroty Claire Launches With Approval Boundaries for Factory and Hospital AI Security

Claroty Claire Launches With Approval Boundaries for Factory and Hospital AI Security

Claroty introduced Claire, a CPS-native AI security agent. The launch shows why AI action in factories and hospitals has to be tied to asset data, approvals, and audit trails.

Vera Ships First 88-Core CPU Systems to OpenAI for Agent Workloads

Vera Ships First 88-Core CPU Systems to OpenAI for Agent Workloads

NVIDIA delivered the first Vera CPU systems to OpenAI, Anthropic, SpaceXAI, and OCI. The launch points to CPU bottlenecks in tool calls, sandboxes, Python, and long-running agents.

Reactor raises $59M for real-time AI world APIs

Reactor raises $59M for real-time AI world APIs

Reactor launched as infrastructure for real-time world models, moving AI video competition from rendered clips toward latency, sessions, and API operations.

Kog claims 3,000 tokens/s, and coding agents hit a latency wall

Kog claims 3,000 tokens/s, and coding agents hit a latency wall

Kog KIE tech preview claims 3,000 tokens/s on 8x MI300X. The useful question is what 2B-model, batch-1 latency means for coding agents.

Antigravity Ran 93 Agents and Put a Price Tag on OS Demos

Antigravity Ran 93 Agents and Put a Price Tag on OS Demos

Google Antigravity teamwork-preview used 93 subagents, 15,314 model calls, and 2.6B+ tokens to build an OS demo. The useful signal is the cost model.

Five LLMs split on 67% of fact-checks, and AI search absorbs the cost

Five LLMs split on 67% of fact-checks, and AI search absorbs the cost

Lenz Research tested 1,000 real fact-check claims across five frontier LLMs and found that 67% did not receive the same verdict.

AI coding teams ship daily, but DevOps is paying the bill

AI coding teams ship daily, but DevOps is paying the bill

Harness 2026 survey data links heavy AI coding use with faster deployment, more delivery pressure, and downstream security, rollback, and burnout signals.

Fujitsu brings Claude to 100,000 staff and adds Codex to SI delivery

Fujitsu brings Claude to 100,000 staff and adds Codex to SI delivery

Fujitsu announced OpenAI and Anthropic collaborations on the same day, pointing to a multi-model SI strategy built around Claude, Codex, FDE, and enterprise controls.