ArchiveSequencesAbout

QuestionsEventsShortformAlignment ForumAF Comments

HomeFeaturedAllTagsRecent Comments

henryaj comments on How Well Does RL Scale?

henryaj 26 Oct 2025 17:40 UTC
3 points
−2
Nit: scaling up RL by 100x and inference by 10,000x would be a 1:3 OOM ratio I think