Blog
Notes and analysis on AI development.
MongoDB brings automatic embeddings and agent memory into Atlas
MongoDB announced Automated Voyage AI Embeddings and LangGraph.js memory support, reframing agent reliability as a data freshness and memory problem.
Trust3 says agent discovery must track hidden MCP connections
Trust3 AI’s agent discovery guide frames shadow agents, MCP bindings, runtime traffic, and ownership as an inventory problem for AI governance.
Anthropic raises $65B as Claude hits the compute wall
Anthropic’s $65B Series H ties Claude demand, Opus 4.8, and 10GW-scale compute contracts into one infrastructure story.
Datadog says rate limits caused 60% of LLM call errors
Datadog State of AI Engineering 2026 shows how production LLM apps fail on quotas, routing, prompt cache, and context growth, not only model quality.
GitHub opens Copilot Agent Tasks API for batch coding work
GitHub put Copilot cloud agent tasks behind a REST API preview. The useful part is not just task creation, but permissions, state tracking, and review design.
KDD Cup Data Agents Delayed by 700 Teams and Docker Audits
KDD Cup 2026 Data Agents delayed Phase 1 results after more than 700 teams and Docker compliance checks exposed the operational cost of agent evaluation.
TrustAI kill switch cuts agent data access in seconds
TrustLogix TrustAI moves AI agent control to the data layer with MCP governance, intent-based authorization, and a runtime kill switch.
Agent 365 goes GA and prices agent governance at $15 per user
Microsoft Agent 365 is now generally available, turning AI agents into governed inventory across Entra, Defender, Purview, and Microsoft 365 admin.
OpenAI says AI evals need harnesses, tools, and budgets
OpenAI published a frontier governance framework and third-party evaluation playbook. Agent scores now need harnesses, tools, and budgets attached.
Mastra Agent Builder puts permissions at the center of internal agents
Mastra Agent Builder and its Temporal integration show how TypeScript agent platforms are moving toward RBAC, allow-lists, durable execution, and workflow traces.
Cursor and Endor Labs Put Security Gates Inside the Coding Agent Loop
Cursor and Endor Labs formalized a hooks-based security partnership for agentic coding, blocking package installs, MCP use, and risky commands inside the IDE loop.
Cognition’s $1B round puts Devin’s 89% code claim on trial
Cognition says Devin commits 89% of its internal code. The harder question is whether agent-written PRs come with reviewable test evidence.