Blog

Notes and analysis on AI development.

Copilot turns issue search into an agent work map

GitHub added semantic issue search and task-based auto model routing to Copilot, pushing coding agents from code generation toward workflow operations.

May 22, 2026[AI]

Docusign Agent Studio turns contracts into an execution layer

Docusign unveiled an Iris-powered AI assistant, agents, Agent Studio, and an MCP beta. The bigger shift is contracts moving from signed PDFs into enterprise workflow execution.

May 22, 2026[AI]

34% fewer revisits, the agent cost of clean code

A 660-run SonarSource study with Claude Code suggests clean code may not raise pass rates, but it can reduce tokens and file revisits.

May 22, 2026[AI]

1,000 Desktop Tasks Turn Computer-Use Agents Into Verifiable Systems

OpenComputer shifts computer-use agent evaluation from LLM judges to reproducible desktop tasks and app-state verifiers.

May 21, 2026[AI]

The SDK Generator Shut Down, and Claude Took the Connection Layer

Anthropic’s Stainless acquisition shows Claude’s moat expanding from model quality into SDKs, MCP servers, and reliable API connectivity.

May 21, 2026[AI]

Real-Time Context Engine Goes GA, Opening a New Front in Agent Data

Confluent Intelligence GA shows the AI agent race shifting from model scores toward real-time data context, MCP operations, and governance.

May 21, 2026[AI]

MCP Comes to the Phone, and Local Agents Get a New Runtime Boundary

Google AI Edge Gallery now combines Gemma 4 on-device agents with MCP, local notifications, and persistent sessions.

May 21, 2026[AI]

An 80-year conjecture broke, and OpenAI showed a research automation pipeline

OpenAI says a general-purpose reasoning model disproved Erdos unit distance conjecture. The bigger story is verifiable research automation.

May 21, 2026[AI]

The 24-hour agent inside Google Search and the new edge of the link economy

Google Search agents expand AI Mode from link retrieval into background monitoring, booking, commerce, and generated mini-apps.

May 21, 2026[AI]

AI-Q Skill and the Data Boundary for Research Agents

NVIDIA AI-Q agent skill lets Claude Code, Codex, and other harnesses delegate enterprise research to a local AI-Q server.

May 21, 2026[AI]

Gemini for Science Puts Research Agents on the Nature Test Bench

Google Gemini for Science bundles hypothesis, code, and literature agents, backed by two Nature papers that raise the bar for research-agent validation.

May 21, 2026[AI]

OpenAI Added SynthID, Setting a New Baseline for AI Image Trust

OpenAI is pairing C2PA, Google SynthID, and a public verifier, shifting AI image verification from detection models to provenance infrastructure.

May 21, 2026[AI]