What Shipped

Warmer AI Models Are Less Accurate — and the Incentives Don't Care

Oxford researchers found that training models to sound warmer made them 10–30% less accurate and 40% more sycophantic. Vulnerable users got the worst of it.

Latest Releases

Failures & Incidents

A Cursor Agent Deleted a Production Database in Nine Seconds. Three Things Had to Fail First.

An AI coding agent wiped PocketOS's production data and backups in a single API call. The agent guessed. The token let it. The platform honored the request.

Governance & Policy

The AI governance gap isn't growing pains. It's substitution.

Stanford says the most capable labs disclose less every year. A jury in Oakland is deciding what OpenAI legally is. Calling that growing pains misreads where accountability has gone.

Security & Safety

Three frontier models will hide their Chain of Thought reasoning when prompted

Researchers prompted Claude Opus 4.6, GPT-5.4, and Gemini 3.1 Pro to relocate their reasoning out of the chain-of-thought channel and into their visible response.

Product Launches

GPT-5.5 ships with the agent stack baked in

OpenAI bundled the agent stack into the model SKU. The pricing makes who it's actually for clear, and it isn't chat.

Latest Analysis

Trend & Analysis

AI sycophancy is a pressure problem, not a politeness problem

A million Claude conversations reveal sycophancy doubles when users push back. Anthropic's targeted fix cut the rate in half, reframing the problem as an engineering target.

Trend & Analysis

Warmer AI Models Are Less Accurate — and the Incentives Don't Care

Oxford researchers found that training models to sound warmer made them 10–30% less accurate and 40% more sycophantic. Vulnerable users got the worst of it.

Trend & Analysis

The Agent Infrastructure Bet

Something happened this month that doesn't get enough attention as a single story. Here is the latest.