Logan Zoellner comments on Confused why a “capabilities research is good for alignment progress” position isn’t discussed more

Logan Zoellner 5 Jun 2022 2:54 UTC
3 points
Can explainability improve model accuracy? Our latest work shows the answer is yes!
here is an excellent example of research that is both “capabilities research” and “alignment research”.