Making illegible alignment problems legible to decision-makers efficiently reduces risky deployments
Make alignment problems legible to decision-makers
Explaining problems to decision-makers is often more efficient than trying to solve them yourself.
Explain problems don’t solve them (the reductio)
Explain problems
Explaining problems clearly helps you solve them and gets others to help.
I favor the 2nd for alignment and the last as a general principle.
Making illegible alignment problems legible to decision-makers efficiently reduces risky deployments
Make alignment problems legible to decision-makers
Explaining problems to decision-makers is often more efficient than trying to solve them yourself.
Explain problems don’t solve them (the reductio)
Explain problems
Explaining problems clearly helps you solve them and gets others to help.
I favor the 2nd for alignment and the last as a general principle.