Mitchell_Porter comments on Reasons not to trust AI

Mitchell_Porter 25 Apr 2026 7:07 UTC
2 points
0
The risk that an ASI would optimize the world according to alien values inadvertently acquired while learning human values seems a very plausible “failure mode” of superintelligence. (Discussion with GPT-5.5 which ends with preliminary thoughts on how to avoid such failures in the context of CEV-like alignment.)