joshc comments on How might we safely pass the buck to AI?

joshc 20 Feb 2025 3:42 UTC
LW: 11 AF: 3
0
AF
I’m sympathetic to this reaction.

I just don’t actually think many people agree that it’s the core of the problem, so I figured it was worth establishing this (and I think there are some other supplementary approaches like automated control and incentives that are worth throwing into the mix) before digging into the ‘how do we avoid alignment faking’ question