Signer comments on Late 2021 MIRI Conversations: AMA / Discussion

Signer 2 Mar 2022 22:00 UTC
15 points
0
Not sure if it’s a right place to ask, instead of just googling it, but anyway: does anyone know what’s the current state of AI security practices at DeepMind, OpenAI and other such places? Like, did they estimate probability of GPT-3 killing everyone before turning it on, do they have procedures for not turning something on, did they test these procedures by someone impersonating unaligned GPT and trying to manipulate researchers, things like that?
- Rohin Shah 2 Mar 2022 22:12 UTC
  8 points
  0
  Parent
  No, I very strongly predict they did not do things like that. I expect they (perhaps implicitly) predicted with high confidence that GPT-3 would not have the capabilities needed to kill everyone.
  - Signer 2 Mar 2022 22:17 UTC
    3 points
    0
    Parent
    Do they have plans to do something in the future?
    - Rohin Shah 3 Mar 2022 7:39 UTC
      4 points
      0
      Parent
      I would assume that the safety teams plan to do this (certainly I plan to). It’s less clear what the opinions are outside of the safety teams.