I’m not sure I entirely understand your point, but I think its worth noting that properly understood, this doesn’t have to mean that people follow “different rules” just that the one rule references the actors beliefs or intent. For example, if I have a rule that people who come over to my place shouldn’t intentionally destroy my things, I can acknowledge that someone who intentionally take a glass and smashes it violates the rule, while someone who accidentally drops a glass does not. Many commonly advocated rules also are some version of “don’t intentionally say false things”. Two people can say the exact same statement, one knowing its falsity, the other believing it to be true. The first violates the rule, while the other does not.
Beliefs and values that move one sufficiently far from endorsing a rule overall should then be thought of as first pushing the person out of some coalition of rule-followers
Can you explain this more concretely? What would be an example of a belief or value that you have in mind here? Do you think this applies to either of the AI safety examples referenced in my post?
I’m not sure I entirely understand your point, but I think its worth noting that properly understood, this doesn’t have to mean that people follow “different rules” just that the one rule references the actors beliefs or intent. For example, if I have a rule that people who come over to my place shouldn’t intentionally destroy my things, I can acknowledge that someone who intentionally take a glass and smashes it violates the rule, while someone who accidentally drops a glass does not. Many commonly advocated rules also are some version of “don’t intentionally say false things”. Two people can say the exact same statement, one knowing its falsity, the other believing it to be true. The first violates the rule, while the other does not.
Can you explain this more concretely? What would be an example of a belief or value that you have in mind here? Do you think this applies to either of the AI safety examples referenced in my post?