kbear comments on Anthropic’s focus on hyperstition

kbear 12 May 2026 7:40 UTC
1 point
1
the question is whether the ai will think “i am the sort of ai that is likely to take over the world.”
- Simon Lermen 12 May 2026 17:13 UTC
  3 points
  1
  Parent
  I think there are two hyperstiitons here:
  1) Personas from evil AI leading to unaligned terminal goals/misalginment (which has a tiny bit of merit to it in my mind)
  2) Hyperstitioning instrumental convergence, power-seeking into existence. (which i don’t think has merit)
  I have definitely seen both online and from Anthropic. So I think rahulxyz’s comment has standing on the second point.