Since you are pointing at the costs of deliberately deceiving AIs more generally (of which alignment pretraining is potentially an example), I highly recommend these two posts expanding on the costs (to us) of lying to AI systems:
Hidden Cost of Our Lies to AI
Being honest with AIs
Since you are pointing at the costs of deliberately deceiving AIs more generally (of which alignment pretraining is potentially an example), I highly recommend these two posts expanding on the costs (to us) of lying to AI systems:
Hidden Cost of Our Lies to AI
Being honest with AIs