Oliver Sourbut comments on Claude is Now Alignment-Pretrained

Oliver Sourbut 14 May 2026 6:54 UTC
0 points
−1
(for admins: while I’m interested enough to look at this, on one view it’s essentially ‘news’ and I’m curious how you’re relating this to the timelessness desideratum for front-page posts)
- Dylan Bowman 14 May 2026 9:15 UTC
  2 points
  2
  Parent
  I think it’s a good survey regardless
  - RogerDearnaley 14 May 2026 22:19 UTC
    2 points
    0
    Parent
    It was intended to be “Anthropic finally picked up and ran with the idea I and others have been pushing for years, so the world is a safer place, yay!” (with a side order of “…and I claim my Bayes points”). I suppose that’s somewhere between history and news. If you want a detailed survey, see Pretraining on Aligned AI Data Dramatically Reduces Misalignment—Even After Post-Training, especially the section How We Got Here.