A host of new evidence shows that misalignment is possible — but it's unclear whether harm will follow
Misaligned AI is no longer just theory
A host of new evidence shows that misalignment is possible — but it's unclear whether harm will follow