mishka comments on How familiar is the Lesswrong community as a whole with the concept of Reward-modelling?

mishka 10 Apr 2025 14:25 UTC
4 points
1
It should be a different word to avoid confusion with reward models (standard terminology for models used to predict the reward in some ML contexts)
- Oxidize 10 Apr 2025 15:08 UTC
  3 points
  0
  Parent
  Thanks for this. Do you have any ideas of what terminology i should use if I mean models used to predict reward in human contexts?