Logan Zoellner comments on Arguments for optimism on AI Alignment (I don’t endorse this version, will reupload a new version soon.)

Logan Zoellner 15 Oct 2023 20:13 UTC
2 points
0
I think there are plausible stories in which a hard left turn could happen (but as you’ve pointed out, it is extremely unlikely under the current deep learning paradigm).

For example, suppose it turns out that a class of algorithms I will simply call heuristic AIXI are much more powerful than the current deep learning paradigm.

The idea behind this class of algorithm is you basically do evolution but instead of using blind hillclimbing, you periodically ask what is the best learning algorithm I have, and then apply that to your entire process. Because this means you are constantly changing the learning algorithm, you could get the same sort of 1Mx overhang that caused the sharp left turn in human evolution.

The obvious counter is that if we think heuristic, AIXI is not safe, then we should just not use it. But the obvious counter to that is when have humans ever not done some thing because someone else told them it wasn’t safe.
What links here?
- A NotKillEveryoneIsm Argument for Accelerating Deep Learning Research by Logan Zoellner (19 Oct 2023 16:28 UTC; -6 points)
- Noosphere89 15 Oct 2023 20:45 UTC
  2 points
  0
  Parent
  I definitely agree with the claim that evolutionary strategies being effective would weaken my entire case. I do think that evolutionary methods like GAs are too hobbled by their inability to exploit white-box optimization, unlike SGD, but we shall see.
  - Logan Zoellner 16 Oct 2023 1:45 UTC
    2 points
    0
    Parent
    I genuinely don’t know if heuristic AIXI is a real thing or not, but if it is it combines the ability to search the whole space of possible algorithms (which evolution has but SGD doesn’t) with the ability to take advantage of higher order statistics (like SGD does but evolution doesn’t).
    
    My best guess is that just as there was a “Deep learning” regime that only got unlocked once we had tons of compute from GPUs, there’s also a heuristic AIXI regime that unlocks at some level of compute.