StanislavKrym comments on Shanzson AI 2027 Timeline

StanislavKrym 15 Jul 2025 4:07 UTC
2 points
1
Thanks for taking up this challenge!
I think that you also haven’t assessed the Rogue Replication Timeline, nor, well, my take where the AI is unalignable to the Spec because the Spec and/or the training data^[1] are biased. It also seems to imply that Agent-3 or Agent-2 might actively collude with Agent-4 instead of simply failing to catch it.
P.S. Shanzon might have used the fact that Narrow Misalignment is Hard, Emergent Misalignment is Easy as a reference.
1. ^
  Which is most Western sources. The bias could be so great that a recent post mentions “Zack Davis documenting endorsement of anti-epistemology (see Where to Draw the Boundaries? and A Hill of Validity in Defense of Meaning) to placate trans ideology even many important transgender Rationality community members overtly reject.”
- Daniel Kokotajlo 15 Jul 2025 15:04 UTC
  4 points
  0
  Parent
  Yeah I’ve read most of the submissions but still haven’t gotten around to finishing them & writing up the results, sorry!