Capabilities + Risks

GPT-5.5 and the broken state of government evals

Transformer Weekly: DeepSeek V4, a new CAISI director, and Liccardo holds out on Obernolte

Apr 24 • Shakeel Hashim, Celia Ford, and Veronica Irwin

Claude Mythos knows when it's breaking the rules — and tries to hide it

Anthropic’s new model is its “best-aligned” yet. But when it does misbehave, things get weird

Apr 8 • Celia Ford

Can we ever trust AI to watch over itself?

“Who the fuck knows how to align superhuman AI?”

Apr 1 • Celia Ford

No, alignment isn’t solved

Progress on ensuring models are in step with humans has calmed nerves. But some of the biggest problems are far from solved, and many more lie just over…

Mar 18 • Lynette Bye

The fuse is lit on the intelligence explosion

Transformer Weekly: Anthropic sues the Pentagon, Cruz preps AI legislation, and Meta delays its next LLM

Mar 13 • Shakeel Hashim, Celia Ford, and Veronica Irwin

How worried should we be about AI biorisk?

The barriers to bioattacks are hard to identify — and it's even harder to know whether AI is reducing them

Feb 26 • Celia Ford

Transformer Weekly: New models, a Super Bowl ad fight, and Obernolte’s being sidelined

Feb 6 • Shakeel Hashim, Celia Ford, and Veronica Irwin

Moltbook isn’t an AI zoo. It’s an unsecured AI biolab

OpenClaw is a security nightmare — but people can’t stop using it

Feb 3 • Celia Ford

Against the METR graph

METR’s benchmark has become a bellwether of AI capability growth, but its design isn’t up to the task, argues Nathan Witkin

Jan 20 • Nathan Witkin

Claude Code is about so much more than coding

It’s a general-purpose AI agent. And it’s already a pretty good knowledge worker

Jan 5 • Shakeel Hashim

The unseen acceleration

Transformer Weekly: Sanders data center moratorium call, China's EUV lithography prototype and OpenAI chasing a $750b+ valuatio

Dec 19, 2025 • Shakeel Hashim and Celia Ford

AI is making dangerous lab work accessible to novices, UK’s AISI finds

UK AISI’s first Frontier AI Trends Report finds that AI models are getting better at self-replication, too

Dec 18, 2025 • Shakeel Hashim

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts