“I usually don’t think about outer amplification as what happens with optimal policies”
Do you mean outer alignment?
Yup, thanks, edited
“I usually don’t think about outer amplification as what happens with optimal policies”
Do you mean outer alignment?
Yup, thanks, edited