Wei Dai comments on Paul Christiano named as US AI Safety Institute Head of AI Safety

Wei Dai 17 Apr 2024 4:51 UTC
34 points
9
What are some failure modes of such an agency for Paul and others to look out for? (I shared one anecdote with him, about how a NIST standard for “crypto modules” made my open source cryptography library less secure, by having a requirement that had the side effect that the library could only be certified as standard-compliant if it was distributed in executable form, forcing people to trust me not to have inserted a backdoor into the executable binary, and then not budging when we tried to get an exception for this requirement.)
- Nathan Helm-Burger 17 Apr 2024 17:46 UTC
  13 points
  0
  Parent
  Yeah, I’d say that in general the US gov attempts to regulate cryptography have been a bungled mess which helped the situation very little, if at all. I have the whole mess mentally categorized as an example of how we really want AI regulation NOT to be handled.
- mesaoptimizer 17 Apr 2024 12:42 UTC
  5 points
  0
  Parent
  What was the requirement? Seems like this was a deliberate effect instead of a side effect.
- David James 27 Apr 2024 23:39 UTC
  4 points
  0
  Parent
  Another failure mode—perhaps the elephant in the room from a governance perspective—is national interests conflicting with humanity’s interests. For example, actions done in the national interest of the US may ratchet up international competition (instead of collaboration).
  
  Even if one puts aside short-term political disagreements, what passes for serious analysis around US national security seems rather limited in terms of (a) time horizon and (b) risk mitigation. Examples abound: e.g. support of one dictator until he becomes problematic, then switching support and/or spending massively to deal with the aftermath.
  Even with sincere actors pursuing smart goals (such as long-term global stability), how can a nation with significant leadership shifts every 4 to 8 years hope to ensure a consistent long-term strategy? This question suggests that an instrumental goal for AI safety involves promoting institutions and mechanisms that promote long-term governance.
- David James 27 Apr 2024 23:22 UTC
  3 points
  0
  Parent
  One failure mode could be a perception that the USG’s support of evals is “enough” for now. Under such a perception, some leaders might relax their efforts in promoting all approaches towards AI safety.