Subscribe
Sign in
Claude Sonnet 4.5 knows when it’s being…
Celia Ford
4 hrs ago
10
1
Anthropic's new model appears to use "eval awareness" to be on its best behavior
Read →
Comments
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
Claude Sonnet 4.5 knows when it’s being…
Anthropic's new model appears to use "eval awareness" to be on its best behavior