Appreciate the suggestion. I’m noting some internal resistance partly due to scope creep. If this is a valid generalization of Goodhart, then it a) starts to suspiciosly resemble the alignment problem and b) suggests that there might be something to glean about how we succeed or fail at dodging Goodhart as a society.
Cool. (FWIW, IMO questions can be short short and simple. Also, yeah I think it’s related to alignment; a sort of rephrasing would be “there’s already alignment between humans, and within humans between urges and conceptual planning as well as between daily-plans and minute-plans, and so on; how does that work?”. )
Appreciate the suggestion. I’m noting some internal resistance partly due to scope creep. If this is a valid generalization of Goodhart, then it a) starts to suspiciosly resemble the alignment problem and b) suggests that there might be something to glean about how we succeed or fail at dodging Goodhart as a society.
Cool. (FWIW, IMO questions can be short short and simple. Also, yeah I think it’s related to alignment; a sort of rephrasing would be “there’s already alignment between humans, and within humans between urges and conceptual planning as well as between daily-plans and minute-plans, and so on; how does that work?”. )