Steven Byrnes answers Generalization and the Multiple Stage Fallacy?

Steven Byrnes 7 Oct 2025 13:22 UTC
35 points
13
The “multiple stage fallacy fallacy” is the fallacious idea that equations like
$P (A & B & C & D) = P (A) \times P (B | A) \times P (C | A & B) \times P (D | A & B & C)$
are false, when in fact they are true. :-P
I think Nate here & Eliezer here are pointing to something real, but the problem is not multiple stages per se but rather (1) “treating stages as required when in fact they’re optional” and/or (2) “failing to properly condition on the conditions and as a result giving underconfident numbers”. For example, if A & B & C have all already come true in some possible universe, then that’s a universe where maybe you have learned something important and updated your beliefs, and you need to imagine yourself in that universe before you try to evaluate $P (D | A & B & C)$
Of course, that paragraph is just parroting what Eliezer & Nate wrote, if you read what they wrote. But I think other people on LW have too often skipped over the text and just latched onto the name “multiple stages fallacy” instead of drilling down to the actual mistake.
In the case at hand, I don’t have much opinion in the absence of more details about the AI training approach etc., but here’s a couple general comments.
If an AI development team notices Problem A and fixes it, and then notices Problem B and fixes it, and then notices Problem C and fixes it, we should expect that it’s less likely, not more likely, that this same team will preempt Problem D before Problem D actually occurs.
Conversely, if the team has a track record of preempting every problem before it arises (when the problems are low-stakes), then we can have incrementally more hope that they will also preempt high-stakes problems.
Likewise, if there simply are no low-stakes problems to preempt or respond to, because it’s a kind of system that just automatically by its nature has no problems in the first place, then we can feel generically incrementally better about there not being high-stakes problems.
Those comments are all generic, and readers are now free to argue with each other about how they apply to present and future AI. :)
- plex 7 Oct 2025 20:25 UTC
  4 points
  2
  Parent
  It’s kinda covered by 1 and 2 of you apply it right, but one view on how this plays it that I’ve found helpful is: Having model uncertainty on many individual steps predictably making the output look low confidence. If you can break something into 10 steps that you multiply together, and feel uncomfortable assigning more than 0.8 to any individual guess, you’re always going to have a low final answer.
- David Johnston 9 Oct 2025 0:08 UTC
  −5 points
  0
  Parent
  To be honest I stand with Barbie: “Reliable probabilistic reasoning is hard”