Bogdan Ionut Cirstea comments on Thoughts on “AI is easy to control” by Pope & Belrose

Bogdan Ionut Cirstea 20 Nov 2024 17:10 UTC
4 points
1
Mostly the same, perhaps a minor positive update on the technical side (basically, from systems getting somewhat stronger—so e.g. closer to automating AI safety research—while still not showing very dangerous capabilities, like ASL-3, prerequisites to scheming, etc.). My views are even more uncertain / unstable on the governance side though, which probably makes my overall p(doom) (including e.g. stable totalitarianism, s-risks, etc.) more like 20% than 5% (I was probably mostly intuitively thinking of extinction risk only when giving the 5% figure a year ago; overall my median probably hasn’t changed much, but I have more variance, coming from the governance side).
- Dakara 24 Nov 2024 17:08 UTC
  1 point
  0
  Parent
  If it’s not a big ask, I’d really like to know your views on more of a control-by-power-hungry-humans side of AI risk.
  
  For example, the first company to create intent-aligned AGI would be wielding incredible power over the rest of us. I don’t think I could trust any of the current leading AI labs to use that power fairly. I don’t think this lab would voluntarily decide to give up control over it either (intuitively, it would take quite something for anyone to give up such a source of power). Is there anything that can be done to prevent such a scenario?
  - Bogdan Ionut Cirstea 24 Nov 2024 17:25 UTC
    3 points
    1
    Parent
    I’m very uncertain and feel somewhat out of depth on this. I do have quite some hope though from arguments like those in https://aiprospects.substack.com/p/paretotopian-goal-alignment.