ArchiveSequencesAbout

QuestionsEventsShortformAlignment ForumAF Comments

HomeFeaturedAllTagsRecent Comments

leogao comments on Scaling Laws for Reward Model Overoptimization

leogao 26 Oct 2022 1:48 UTC
LW: 2 AF: 1
0
AF
There’s an example in the appendix but we didn’t do a lot of qualitative analysis.