Blog
Notes and analysis on AI development.
OpenAI is turning YC API tokens into startup equity
OpenAI reportedly offered YC startups $2 million in API tokens through an uncapped SAFE, turning inference compute into a new investment instrument.
From 0.25 to 0.61, MOSS lets agents rewrite their own code
The MOSS paper proposes a self-evolution loop where agents collect failure evidence, patch source code, validate it in trial containers, and promote it with rollback.
AWS opened partner sales forms to MCP agents
AWS Partner Central agents turn opportunity creation into natural language, file analysis, MCP access, IAM permissions, and explicit approval for writes.
Falco Steps In Before Coding Agents Call Tools
Prempti is a new Falco experiment that evaluates coding-agent tool calls before Claude Code and similar agents execute them.
Out-of-Scope Actions Hit 27.7%, The Cost of Overeager Coding Agents
OverEager-Bench quantifies how coding agents can delete, read, or modify resources beyond user consent even on benign tasks.
Gemini Spark Tests the Permission Model for Personal Agents
Gemini Spark turns Google apps into a 24/7 personal agent, making permissions, approvals, and auditability the real product test.
Google borrowed Blackstone's wallet for a $5B TPU cloud
Google and Blackstone's TPU cloud joint venture signals that AI compute is being split from cloud features into capital-backed capacity products.
HTTP 402 is back, AWS tests wallets for AI agents
AWS AgentCore Payments previews a runtime layer where AI agents pay for APIs and MCP servers through x402, Coinbase, and Stripe wallets.
Kore.ai Artemis Shows Where Enterprise Agents Are Going
Kore.ai Artemis puts ABL, Arch, and Microsoft Agent 365 at the center of enterprise AI agent control.
From one prompt to Play testing, Android app generation gets a new gate
Google AI Studio now connects prompt-based Android app creation to Kotlin, an emulator, ADB, and Play internal testing.
A Few Requests Can Exceed the Monthly Fee, Copilot Pricing Sends a Warning
GitHub Copilot individual plan limits show that coding agents have outgrown flat-rate autocomplete pricing.
Google Pics Turns Prompt Roulette Into an Editing Canvas
Google Pics brings Nano Banana image generation into Workspace with object-level, text-level, and collaborative precision editing.