Thane Ruthenis comments on What’s General-Purpose Search, And Why Might We Expect To See It In Trained ML Systems?

Thane Ruthenis 16 Aug 2022 9:08 UTC
32 points
10
An observation that I think is missing here is that this world is biased towards general-purpose search too. As in, it is frequently the case that agents operating in reality face the need to problem-solve in off-distribution circumstances; circumstances to which they could not have memorized correct responses (or even near-correct responses), because they’d never faced them. And if failure is fatal, that creates a pressure towards generality. Not simply a “bias” towards it; a direct pressure.
And we’re already doing something similar with ML models today, where we’re not repeating training examples.
A supercharged version of that pressure is when the agent is selected for the ability to thrive not only in off-distribution tasks in some environment, but in entire off-distribution environments, which I suspect is how human intelligence was incentivized.
What links here?
- Thane Ruthenis's comment on Clarifying the Agent-Like Structure Problem by johnswentworth (30 Sep 2022 14:30 UTC; 5 points)
- johnswentworth 16 Aug 2022 15:54 UTC
  9 points
  3
  Parent
  You’re right, that was missing. Very good and important point.