AI
Codex Tax AI handled 7,000 returns, and the improvement loop starts with evals
OpenAI and Thrive showed how Tax AI links production traces, practitioner corrections, evals, and Codex tasks.
AI
OpenAI and Thrive showed how Tax AI links production traces, practitioner corrections, evals, and Codex tasks.
AI
PromptArmor disclosed a ChatGPT for Google Sheets exfiltration path, and OpenAI removed Apps Script code generation.
AI
Anthropic Project Glasswing says Claude Mythos Preview and partners can find vulnerabilities faster than teams can validate, disclose, and patch them.
AI
Microsoft SkillOpt treats SKILL.md-style agent instructions as trainable artifacts updated through rollouts, validation scores, and bounded edits.
AI
Mistral relaunched Le Chat as Vibe, bundling remote coding agents, a VS Code extension, Medium 3.5, and a planned 10MW inference facility.
AI
CoreWeave introduced agentic AI integrations that connect inference, W&B Weave observability, serverless RL, and coding-agent tooling into one improvement loop.
AI
GitHub Copilot app technical preview combines issues, sessions, validation, pull requests, and Agent Merge into a desktop workflow for coding agents.
AI
Anthropic introduced Claude Code dynamic workflows, a research preview that lets Claude write orchestration scripts for large coding tasks while exposing new cost and permission risks.
AI
Google previewed Gemini API Managed Agents, exposing Antigravity agents with hosted sandboxes, file state, tools, network controls, and token-heavy task loops.
AI
OpenAI Codex added Windows Computer Use and remote control from mobile. The update expands coding agents from shells and repos into desktop apps.
AI
OpenAI’s Braintrust Codex case study shows a coding-agent operating loop that connects customer requests, tests, sandboxes, preview branches, and evals.
AI
CSA analyzed the Mini Shai-Hulud and Megalodon supply-chain campaigns, showing how npm attacks now reach AI coding settings and CI/CD authority.