Fabien Roger comments on Max Niederman’s Shortform

Fabien Roger 29 Jun 2025 15:57 UTC
3 points
0
You might be interested in these related results. TL;DR: people have tried, but at the scale academics are working at, it’s very hard to get RL to learn interesting encoding schemes. Encoded reasoning is also probably not an important part of the performance of reasoning models (see this).
- Max Niederman 30 Jun 2025 11:02 UTC
  1 point
  0
  Parent
  Thanks! Your second link is very similar to what I had in mind — I feel a bit embarrassed for missing it.