Blog
Notes and analysis on AI development.
Cursor agents run 2,000 times a week, and automation finds its next bottleneck
Faire says Cursor Cloud Agents doubled weekly PR throughput and now run 2,000+ times per week, shifting the bottleneck from models to environment, permissions, and workflow.
IBM and Red Hat Put $5B Behind Lightwell for AI-Era Patching
IBM and Red Hat introduced Project Lightwell, a $5B effort to turn AI-found open-source vulnerabilities into verified patches.
Sysdig Traces a 113-Second LLM-Agent Intrusion Into Postgres
Sysdig says an LLM-driven attacker chained a marimo RCE into AWS secrets, SSH bastions, and an internal PostgreSQL dump.
OpenRouter Raises $113M as Model Routing Becomes AI Infrastructure
OpenRouter raised $113M after reaching 25T weekly tokens, 8M+ developers, and 400+ models. The round turns model routing into an infrastructure question.
Anthropic sabotage report puts agent monitoring on trial
Anthropic’s Opus 4 sabotage risk report shows why coding agents need audit trails across logs, pull requests, security events, and external review.
OpenAI Sets ChatGPT Retirement Dates for o3 and GPT-4.5
OpenAI will remove GPT-4.5 and o3 from ChatGPT on separate sunset schedules. The API is unchanged, but teams should separate ChatGPT workflows from API model lifecycles.
NSA MCP guidance warns about GitHub scope, WhatsApp leaks, and agent runtime risk
NSA published MCP security design guidance for AI-driven automation, turning tool permissions, tokens, logs, sandboxing, DLP, and scans into deployment requirements.
Command A+ Brings an Open Agent Model to Two H100s
Cohere released Apache 2.0 Command A+, a 218B MoE model with 25B active parameters, private deployment, tool use, vision, and multilingual agent workloads in scope.
SageMaker Skills turn model customization into coding-agent work
AWS opened SageMaker AI model customization to coding agents through Skills, turning SFT, DPO, RLVR, evaluation, and deployment into reviewable notebook workflows.
Google Pay MCP Brings Payment Integration Into IDE Agents
Google Pay and Wallet now expose an MCP server for docs, account status, pass validation, error metrics, and merchant integration workflows.
Sierra localized Agent Studio in four months with AI coding agents
Sierra published a concrete Agent Studio localization case study covering 900+ frontend files, batch scripts, lint loops, and context-window failures.
Lyft AI Assist cuts support-agent development from six months to two weeks
Lyft showed how LangGraph and LangSmith turned customer-support agents into a self-serve platform with routing, state, evals, and prompt CI.