AI
Anthropic sabotage report puts agent monitoring on trial
Anthropic’s Opus 4 sabotage risk report shows why coding agents need audit trails across logs, pull requests, security events, and external review.
AI
Anthropic’s Opus 4 sabotage risk report shows why coding agents need audit trails across logs, pull requests, security events, and external review.
AI
NSA published MCP security design guidance for AI-driven automation, turning tool permissions, tokens, logs, sandboxing, DLP, and scans into deployment requirements.
AI
Anthropic added self-hosted sandboxes and MCP tunnels to Claude Managed Agents, shifting tool execution and private tool access into enterprise-controlled boundaries.
AI
Google released a Chrome Enterprise Premium MCP server that exposes DLP rules, connector policy, browser telemetry, and activity logs to AI agents.
AI
Anthropic published Claude containment designs and failure cases across claude.ai, Claude Code, and Claude Cowork, turning approval fatigue, allowlists, and memory into an agent security checklist.
AI
Robinhood opened Trading MCP and Banking MCP for AI agents. The real developer story is the permission, approval, and liability model around financial tool calls.
AI
Arm has open-sourced Metis, an agentic AI security framework for code review, SARIF triage, evidence chains, and lower false-positive cost.
AI
Roundtable and an arXiv paper argue that AI agents can match CAPTCHA answers while still revealing themselves through click order and behavior.
AI
Claroty introduced Claire, a CPS-native AI security agent. The launch shows why AI action in factories and hospitals has to be tied to asset data, approvals, and audit trails.
AI
Harness 2026 survey data links heavy AI coding use with faster deployment, more delivery pressure, and downstream security, rollback, and burnout signals.
AI
OpenAI Secure MCP Tunnel gives ChatGPT, Codex, Responses API, and AgentKit an outbound-only path to private MCP servers without public endpoints.
AI
Anthropic published Claude Mythos Preview exploit evaluations and a CVD dashboard. V8 21/41 ACE and 1,596 disclosed flaws reset security triage expectations.