avturchin comments on The Achilles Heel Hypothesis for AI

avturchin 15 Oct 2020 11:43 UTC
3 points
0
Yes, landmines is the last level of defence, which have very low probability to work (like 0.1 per cent). However, If AI is stable to all possible philosophical landmines, it is a very stable agent and has higher chances to keep its alignment and do not fail catastrophically.