I’m not sure where you got the idea that this was to solve the spurious counterfactuals problem, that was in the appendix because I anticipated that a MIRI-adjacent person would want to know how it solves that problem.
Thanks for that clarification.
A way that EDT fails to solve 5 and 10 is that it could believe with 100% certainty that it takes $5 so its expected value for $10 is undefined
I suppose that demonstrates that the 5 and 10 problem is a broader problem than I realised. I still think that it’s only a hard problem within particular systems that have a vulnerability to it.
It does look like your post overall agrees with the view I presented. I would tend to call augmented reality “metaphysics” in that it is a piece of ontology that goes beyond physics
Yeah, we have significant agreement, but I’m more conservative in my interpretations. I guess this is a result of me being, at least in my opinion, more skeptical of language. Like I’m very conscious of arguments where someone says, “X could be described by phrase Y” and then later they rely on connations of Y that weren’t proven.
For example, you write, “From the AI’s perspective, it has a choice among multiple actions, hence in a sense “believing in metaphysical free will”. I would suggest it would be more accurate to write: “The AI models the situation as though it had free will” which leaves open the possibility that it is might be just a pragmatic model, rather than the AI necessarily endorsing itself as possessing free will.
Another way of framing this: there’s an additional step in between observing that an agent acts or models a situation as it believes in freewill and concluding that it actually believes in freewill. For example, I might round all numbers in a calculation to integers in order to make it easier for me, but that doesn’t mean that I believe that the values are integers.
Thanks for that clarification.
I suppose that demonstrates that the 5 and 10 problem is a broader problem than I realised. I still think that it’s only a hard problem within particular systems that have a vulnerability to it.
Yeah, we have significant agreement, but I’m more conservative in my interpretations. I guess this is a result of me being, at least in my opinion, more skeptical of language. Like I’m very conscious of arguments where someone says, “X could be described by phrase Y” and then later they rely on connations of Y that weren’t proven.
For example, you write, “From the AI’s perspective, it has a choice among multiple actions, hence in a sense “believing in metaphysical free will”. I would suggest it would be more accurate to write: “The AI models the situation as though it had free will” which leaves open the possibility that it is might be just a pragmatic model, rather than the AI necessarily endorsing itself as possessing free will.
Another way of framing this: there’s an additional step in between observing that an agent acts or models a situation as it believes in freewill and concluding that it actually believes in freewill. For example, I might round all numbers in a calculation to integers in order to make it easier for me, but that doesn’t mean that I believe that the values are integers.