Is it simple if you don’t have infinite compute ? I would be interested in a description which doesn’t rely on infinite compute, or more strictly still, that is is computationally tractable. This constraint is important to me because I assume that the first AGI we get is using something that’s more efficient that other known methods (eg. using DL because it works, even though it’s hard to control), so I care about aligning the stuff which we’ll actually be using.
Is it simple if you don’t have infinite compute ?
I would be interested in a description which doesn’t rely on infinite compute, or more strictly still, that is is computationally tractable. This constraint is important to me because I assume that the first AGI we get is using something that’s more efficient that other known methods (eg. using DL because it works, even though it’s hard to control), so I care about aligning the stuff which we’ll actually be using.