Thank you for posting this Asya and Nick. After I read it I realized that it connected to something that I’ve been thinking about for a while that seems like it might actually be a fit for this RFP under research direction 3 or 4 (interpretability, truthful AI). I drafted a very rough 1.5-pager this morning in a way that hopefully connects fairly obviously to what you’ve written above:
Thank you for posting this Asya and Nick. After I read it I realized that it connected to something that I’ve been thinking about for a while that seems like it might actually be a fit for this RFP under research direction 3 or 4 (interpretability, truthful AI). I drafted a very rough 1.5-pager this morning in a way that hopefully connects fairly obviously to what you’ve written above:
https://docs.google.com/document/d/1pEOXIIjEvG8EARHgoxxI54hfII2qfJpKxCqUeqNvb3Q/edit?usp=sharing
Interested in your thoughts.
Feedback from everyone is most welcome, too, of course.