Jakub Halmeš comments on Jakub Halmeš′s Shortform

Jakub Halmeš 29 Jan 2025 16:52 UTC
6 points
0
I wonder if you could take the R1-Zero training regime, penalize/restrict using existing words from all languages (maybe only in the scratchpad, not the final response), and obtain a model which can solve math problems by reasoning in a non-existent language.
- Milan W 30 Jan 2025 17:05 UTC
  3 points
  0
  Parent
  It may just use l33tc0d3 or palabres nonexistentes interpolatas d’idioms prossimos.