Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
wassname comments on
Trust me bro, just one more RL scale up, this one will be the real scale up with the good environments, the actually legit one, trust me bro
wassname
22 Sep 2025 23:18 UTC
1
point
0
I found it! rStar2-Agent show’s that training on math with their form of RL generalised to ScienceQA
Back to top
I found it! rStar2-Agent show’s that training on math with their form of RL generalised to ScienceQA