Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
henryaj comments on
How Well Does RL Scale?
henryaj
26 Oct 2025 17:40 UTC
3
points
−2
Nit: scaling up RL by 100x and inference by 10,000x would be a 1:3 OOM ratio I think
Back to top
Nit: scaling up RL by 100x and inference by 10,000x would be a 1:3 OOM ratio I think