the current technological trajectory will lead to AIs that are strategically superhuman without any innovations that are a bigger deal than the original “Attention Is All You Need” paper.
One thing which is awkward about this operationalization is that it’s not clear how big an innovation “attention is all you need” is. Like, we already had multi-head attention before that paper, and ablating out the other components from the transformer isn’t that galaxy-brained a thing to try (though of course the authors executed well on it).
One thing which is awkward about this operationalization is that it’s not clear how big an innovation “attention is all you need” is. Like, we already had multi-head attention before that paper, and ablating out the other components from the transformer isn’t that galaxy-brained a thing to try (though of course the authors executed well on it).
I was thinking something like that when I wrote it.
Do you have a better suggested operationalization?