Donald Hobson comments on MIRI’s “The Problem” hinges on diagnostic dilution

Donald Hobson 21 Aug 2025 22:53 UTC
2 points
0
Given that alignment is theoretically solvable, (probably) and not currently solved, almost any argument about alignment failure is going to have an
“and the programmers didn’t have a giant breakthrough at the last minute” assumption.