Jacob_Hilton comments on How much alignment data will we need in the long run?

Jacob_Hilton 11 Aug 2022 4:05 UTC
LW: 2 AF: 2
0
AF
A number of reasonable outer alignment proposals such as iterated amplification, recursive reward modeling and debate use generic objectives such as reinforcement learning (and indeed, none of them would work in practice without sufficiently high data quality), so it seems strange to me to dismiss these objectives.