the question is whether the ai will think “i am the sort of ai that is likely to take over the world.”
I think there are two hyperstiitons here:
1) Personas from evil AI leading to unaligned terminal goals/misalginment (which has a tiny bit of merit to it in my mind)
2) Hyperstitioning instrumental convergence, power-seeking into existence. (which i don’t think has merit)
I have definitely seen both online and from Anthropic. So I think rahulxyz’s comment has standing on the second point.
the question is whether the ai will think “i am the sort of ai that is likely to take over the world.”
I think there are two hyperstiitons here:
1) Personas from evil AI leading to unaligned terminal goals/misalginment (which has a tiny bit of merit to it in my mind)
2) Hyperstitioning instrumental convergence, power-seeking into existence. (which i don’t think has merit)
I have definitely seen both online and from Anthropic. So I think rahulxyz’s comment has standing on the second point.