Subscribe
Sign in
Home
Policy
Industry
Capabilities + Risks
Society
Opinion
Weekly Briefing
About
Capabilities + Risks
Latest
Top
Discussions
Why AI reading science fiction could be a problem
The theory that we’re accidentally teaching AI to turn against us
Dec 9
•
Lynette Bye
9
3
3
The perils of AI safety’s insularity
By building their own intellectual ecosystem, researchers worried about existential AI risk shed academia's baggage — and, perhaps, some of its…
Dec 4
•
Celia Ford
47
6
7
Claude can identify its ‘intrusive thoughts’
“I’m experiencing something that feels like an intrusive thought,” Claude said in a recent experiment
Nov 13
•
Celia Ford
12
3
2
AI doesn’t need to be general to be dangerous
There’s more to AI safety than the AGI debate
Nov 11
•
Shakeel Hashim
3
1
AI cyberrisk might be a bit overhyped — for now at least
Experts say key factors currently limit the risk of catastrophic harm from AI-enabled cyberattacks — as far as we know
Oct 20
•
Chris Stokel-Walker
1
AI is advancing far faster than our annual report can track
Opinion: Yoshua Bengio, Stephen Clare and Carina Prunkl run through the rapid developments that necessitated an early update to their International AI…
Oct 15
27
1
5
AI and synthetic DNA could be a lethal combination
Stronger gene-synthesis screening is vital to closing off AI’s ability to enable man-made pandemics
Oct 13
7
5
We’re all behind The Curve
Transformer Weekly: GAIN AI Act, China’s rare earth crackdown, and AI bubble talk
Oct 10
•
Shakeel Hashim
and
Celia Ford
33
2
4
AI models are getting really good at things you do at work
A new OpenAI benchmark, GDPval, tests AI models on things people actually do in their jobs — and finds that Claude is about as good as a human for…
Oct 2
•
Celia Ford
3
2
Claude Sonnet 4.5 knows when it’s being tested
Anthropic's new model appears to use "eval awareness" to be on its best behavior
Sep 30
•
Celia Ford
32
3
When AI starts writing itself
Why automating AI R&D could be the most dangerous milestone yet
Sep 29
•
Lynette Bye
18
3
Can open-weight models ever be safe?
Opinion: Bengüsu Özcan, Alex Petropoulos and Max Reddel argue that technical safeguards, societal preparedness, and new standards could make open-weight…
Sep 17
2
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts