[Question] Can a pre-commitment to not give in to blackmail be “countered” by a pre-commitment to ignore such pre-commitments?

Link post

As I understand it an actor can prevent blackmail[1] by (rational) actors it they credibly pre-commit to never give in to blackmail.

Example: A newly elected mayor has many dark secrets and lots of people are already planning on blackmailing them. To preempt any such blackmail they livestreams themself being hypnotized and implanted with the suggestion to never give into blackmail. Since in this world hypnotic suggestions are unbreakable, all (rational) would-be blackmailers give up, since any attempt at blackmail would be guaranteed to fail.

In general pre-commiting in such examples is about reducing the payoff matrix to just [blackmail, refuse] and [don’t blackmail, refuse], which makes not blackmailing the optimal choice for the would-be blackmailer.

Of course, sufficiently intelligent /​ coherent actors wouldn’t need a external commitment mechanism and a sufficiently intelligent and informed opposition would be able to infer the existence of such a pre-commitment. More so, I believe to have heard that if a sufficiently intelligent /​ coherent actors notices that it would be better of if it had pre-commited, it can just act as if it had (post-commit?).

However, what if the would-be blackmailer also tries to limit the possible outcomes?

Example: The anti-blackmail hypnosis is so successful that soon every newly elected mayor does it. A new candidate is likely to win the next election. They know that the local crime boss has a lot of dirt on them, but they aren’t worried about blackmail, as they will just do the anti-blackmail hypnosis on their first day in office. On the evening of the election they are send a video of the crime boss being hypnotized into blackmailing the new mayor even if they have been anti-blackmail hypnotized.

This cuts down the payoff matrix to [blackmail, refuse] and [blackmail, give in]. Giving in to the blackmail is optimal for the new mayor and doing the anti-blackmail hypnosis just locks them into [blackmail, refuse].

So how does this work out between sufficiently intelligent /​ coherent actors? Does the first one to (credibly and transparently) pre-commit win?

And what if actors are able to post-commit (if that even is a thing and I didn’t misunderstand the concept)? A actor could act as if they had pre-commited to ignore the oppositions pre-commitment (to ignore pre-commitments to never give into blackmail), but then the opposition could act as if they had pre-commited to ignore that pre-commitment?

(This comment thread seems to discuss the same question but did not resolve it for me.)

  1. ^

    By blackmail I mean a scenario where the would-be blackmailers choices are blackmail or don’t blackmail and the targets choices give in or refuse with a payoff matrix like this:

    give inrefuse
    blackmailtarget: −10
    blackmailer: 20
    target: −100
    blackmailer: −1
    don’t blackmailtarget: 0
    blackmailer: 0
    target: 0
    blackmailer: 0