sidhe_they comments on Key Papers in Language Model Safety