Daniel Kokotajlo comments on IABIED Book Review: Core Arguments and Counterarguments

Daniel Kokotajlo 24 Jan 2026 16:33 UTC
46 points
43
outside view: thinking about advanced AI as another instance of a highly advanced and impactful technology like the internet, nuclear energy, or biotechnology.
I strongly disagree with this—you are simply picking a particular reference class (no not even that, a particular analogy) and labelling it the outside view. See this post for more.
- Stephen McAleese 24 Jan 2026 21:09 UTC
  4 points
  0
  Parent
  Thank’s for pointing that out and for the linked post!
  I’d say the conclusion is probably the weakest part of the post because after describing the IABIED view and the book’s critics I found it hard to reconcile the two views.
  I tried getting Gemini to write the conclusion but what it produced seemed even worse: it suggested that we treat AI like any other technology (e.g. cars, electricity) where doomsday forecasts are usually wrong and the technology can be made safe in an iterative way which seems too optimistic to me.
  I think my conclusion was an attempt to find a middle ground between the authors of IABIED and the critics by treating AI as a risky but not world-ending technology.
  (I’m still not sure what the conclusion should be)
  - Stephen McAleese 24 Jan 2026 21:42 UTC
    6 points
    0
    Parent
    I think the level of disagreement among the experts implies that there is quite a lot of uncertainty so the key question is how to steer the future toward better outcomes while reasoning and acting under substantial uncertainty.
    The framing I currently like best is from Chris Olah’s thread on probability mass over difficulty levels.
    The idea is that you have initial uncertainty and a distribution that assigns probability mass to different levels of alignment difficulty.
    The goal is to develop new alignment techniques that “eat marginal probability” where over time the most effective alignment and safety techniques can handle the optimistic easy cases, and then the medium and hard cases and so on. I also think the right approach is to think in terms of which actions would have positive expected value and be beneficial across a range of different possible scenarios.
    Meanwhile the goal should be to acquire new evidence that would help reduce uncertainty and concentrate probability mass on specific possibilities. I think the best way to do this is to use the scientific method to proposed hypotheses and then test them experimentally.