Discussion about this post

User's avatar
Penelope Lawrence's avatar

The Hubinger quote is the one that should keep people up at night - 70-90% of the code for future models is already written by Claude. That's not a prediction about recursive self-improvement, that's a status report. But the gap between "writes most of the code" and "fully automated researcher" is enormous and easy to underestimate. Writing code is the mechanical part. The hard part - knowing what to build, recognizing when a result is surprising vs. artifactual, deciding which dead ends to abandon - that's judgment, and we don't have a benchmark for automating judgment. The fuse might be lit, but how long it actually is depends on a distinction most people in the field aren't making carefully enough.

No posts

Ready for more?