It should be a different word to avoid confusion with reward models (standard terminology for models used to predict the reward in some ML contexts)
Thanks for this. Do you have any ideas of what terminology i should use if I mean models used to predict reward in human contexts?
It should be a different word to avoid confusion with reward models (standard terminology for models used to predict the reward in some ML contexts)
Thanks for this. Do you have any ideas of what terminology i should use if I mean models used to predict reward in human contexts?