(for admins: while I’m interested enough to look at this, on one view it’s essentially ‘news’ and I’m curious how you’re relating this to the timelessness desideratum for front-page posts)
It was intended to be “Anthropic finally picked up and ran with the idea I and others have been pushing for years, so the world is a safer place, yay!” (with a side order of “…and I claim my Bayes points”). I suppose that’s somewhere between history and news. If you want a detailed survey, see Pretraining on Aligned AI Data Dramatically Reduces Misalignment—Even After Post-Training, especially the section How We Got Here.
(for admins: while I’m interested enough to look at this, on one view it’s essentially ‘news’ and I’m curious how you’re relating this to the timelessness desideratum for front-page posts)
I think it’s a good survey regardless
It was intended to be “Anthropic finally picked up and ran with the idea I and others have been pushing for years, so the world is a safer place, yay!” (with a side order of “…and I claim my Bayes points”). I suppose that’s somewhere between history and news. If you want a detailed survey, see Pretraining on Aligned AI Data Dramatically Reduces Misalignment—Even After Post-Training, especially the section How We Got Here.