Quoting Zvi’s post:
Julian Michael, who helped edit and review the paper, notes that he was previously skeptical about deceptive alignment, which means he is exactly who should be updating most on this paper, and he updates in the right way.
I don’t know of any other clear cut cases.
The reviews might also be interesting to look at. I’m not sure if Jacob Andreas and Jasjeet Sekhon have publicly stated prior views on the topic. Yoshua Bengio and Rohin Shah were broadly sympathetic to scheming concerns or similar before.
Quoting Zvi’s post:
I don’t know of any other clear cut cases.
The reviews might also be interesting to look at. I’m not sure if Jacob Andreas and Jasjeet Sekhon have publicly stated prior views on the topic. Yoshua Bengio and Rohin Shah were broadly sympathetic to scheming concerns or similar before.