Devlery

Blog

Notes and analysis on AI development.

Cursor agents run 2,000 times a week, and automation finds its next bottleneck

Cursor agents run 2,000 times a week, and automation finds its next bottleneck

Faire says Cursor Cloud Agents doubled weekly PR throughput and now run 2,000+ times per week, shifting the bottleneck from models to environment, permissions, and workflow.

IBM and Red Hat Put $5B Behind Lightwell for AI-Era Patching

IBM and Red Hat Put $5B Behind Lightwell for AI-Era Patching

IBM and Red Hat introduced Project Lightwell, a $5B effort to turn AI-found open-source vulnerabilities into verified patches.

Sysdig Traces a 113-Second LLM-Agent Intrusion Into Postgres

Sysdig Traces a 113-Second LLM-Agent Intrusion Into Postgres

Sysdig says an LLM-driven attacker chained a marimo RCE into AWS secrets, SSH bastions, and an internal PostgreSQL dump.

OpenRouter Raises $113M as Model Routing Becomes AI Infrastructure

OpenRouter Raises $113M as Model Routing Becomes AI Infrastructure

OpenRouter raised $113M after reaching 25T weekly tokens, 8M+ developers, and 400+ models. The round turns model routing into an infrastructure question.

Anthropic sabotage report puts agent monitoring on trial

Anthropic sabotage report puts agent monitoring on trial

Anthropic’s Opus 4 sabotage risk report shows why coding agents need audit trails across logs, pull requests, security events, and external review.

OpenAI Sets ChatGPT Retirement Dates for o3 and GPT-4.5

OpenAI Sets ChatGPT Retirement Dates for o3 and GPT-4.5

OpenAI will remove GPT-4.5 and o3 from ChatGPT on separate sunset schedules. The API is unchanged, but teams should separate ChatGPT workflows from API model lifecycles.

NSA MCP guidance warns about GitHub scope, WhatsApp leaks, and agent runtime risk

NSA MCP guidance warns about GitHub scope, WhatsApp leaks, and agent runtime risk

NSA published MCP security design guidance for AI-driven automation, turning tool permissions, tokens, logs, sandboxing, DLP, and scans into deployment requirements.

Command A+ Brings an Open Agent Model to Two H100s

Command A+ Brings an Open Agent Model to Two H100s

Cohere released Apache 2.0 Command A+, a 218B MoE model with 25B active parameters, private deployment, tool use, vision, and multilingual agent workloads in scope.

SageMaker Skills turn model customization into coding-agent work

SageMaker Skills turn model customization into coding-agent work

AWS opened SageMaker AI model customization to coding agents through Skills, turning SFT, DPO, RLVR, evaluation, and deployment into reviewable notebook workflows.

Google Pay MCP Brings Payment Integration Into IDE Agents

Google Pay MCP Brings Payment Integration Into IDE Agents

Google Pay and Wallet now expose an MCP server for docs, account status, pass validation, error metrics, and merchant integration workflows.

Sierra localized Agent Studio in four months with AI coding agents

Sierra localized Agent Studio in four months with AI coding agents

Sierra published a concrete Agent Studio localization case study covering 900+ frontend files, batch scripts, lint loops, and context-window failures.

Lyft AI Assist cuts support-agent development from six months to two weeks

Lyft AI Assist cuts support-agent development from six months to two weeks

Lyft showed how LangGraph and LangSmith turned customer-support agents into a self-serve platform with routing, state, evals, and prompt CI.