O O comments on “If we go extinct due to misaligned AI, at least nature will continue, right? … right?”

O O 19 May 2024 2:56 UTC
3 points
0
Additionally, the AI might think it’s in an alignment simulation and just leave the humans as is or even nominally address their needs. This might be mentioned in the linked post, but I want to highlight it. Since we already do very low fidelity alignment simulations by training deceptive models, there is reason to think this.