adamShimi comments on LCDT, A Myopic Decision Theory

adamShimi 5 Aug 2021 22:58 UTC
2 points
0
These examples seem related to the abstraction question: we want the model to know that it is splitting an agent into parts, and still believe it can’t influence the agent as a hole. If we could realize this, then the LCDT agent wouldn’t believe it could influence the neural net/the logic gates.