Discussion about this post

User's avatar
Haseeba Sayyed's avatar

Today is the second time I came across this piece and read it till the end. I have a question, maybe irrelevant, do we know whether these AI models have been provided access to literature that aligns with the idea of an 'evil AI'? If yes, then can we conclude this 'role playing' (or the actual intention) is affected by that? Again, if yes, then, what is the probability that this act could actually be reversed (or minimized to a certain extent) with the help of misaligned literature demonstrating self destructive and sacrificing AI models?

Expand full comment

No posts