Thanks for registering a guess! I would put it as: a grader optimizer is something which is trying to optimize the outputs of a grader as its terminal end (either de facto, via argmax, or intent-alignment, as in “I wanna search for plans which make this function output a high number”). Like, the pointof the optimization is to make the number come out high.
(To help you checksum: It feels important to me that “is good at achieving its goals” is not tightly coupled to “approximating argmax”, as I’m talking about those terms. I wish I had fast ways of communicating my intuitions here, but I’m not thinking of something more helpful to say right now; I figured I’d at least comment what I’ve already written.)
Thanks for registering a guess! I would put it as: a grader optimizer is something which is trying to optimize the outputs of a grader as its terminal end (either de facto, via argmax, or intent-alignment, as in “I wanna search for plans which make this function output a high number”). Like, the point of the optimization is to make the number come out high.
(To help you checksum: It feels important to me that “is good at achieving its goals” is not tightly coupled to “approximating argmax”, as I’m talking about those terms. I wish I had fast ways of communicating my intuitions here, but I’m not thinking of something more helpful to say right now; I figured I’d at least comment what I’ve already written.)