“If their goals are non-indexical” seems like quite a big “if”.
Yeah, my modal assumption is that AIs will be able to make fairly strong inferences about the mechanics of the decision processes of other AIs by making observations about their behavior (including of side channels). “Mind reading” might be a slightly strong term for this, but, it’s not very far off.
Likely out of scope for this comment section though. I should, at some point, probably write my modal expectation of what the next couple decades look like in more detail.
Thanks.
“If their goals are non-indexical” seems like quite a big “if”.
Yeah, my modal assumption is that AIs will be able to make fairly strong inferences about the mechanics of the decision processes of other AIs by making observations about their behavior (including of side channels). “Mind reading” might be a slightly strong term for this, but, it’s not very far off.
Likely out of scope for this comment section though. I should, at some point, probably write my modal expectation of what the next couple decades look like in more detail.