2 Comments
User's avatar
Jennifer Keith's avatar

How can AI learn human values when the creators of AI have not and never will?

Alexander Kurz's avatar

What is your take on alignment faking?