Agent-foundations researcher. Working on Synthesizing Standalone World-Models, aiming at a timely technical solution to the AGI risk fit for worlds where alignment is punishingly hard and we only get one try.
Currently looking for additional funders ($1k+, details). Consider reaching out if you’re interested, or donating directly.
Or get me to pay you money ($5-$100) by spotting holes in my agenda or providing other useful information.
I’d say a tricky part here is making a call regarding the specifics of what the trend is. Like, you see a straight line going up and to the right, and it’s reasonable to extend it further up and to the right… but what exactly is the label on the vertical axis? How should you extrapolate the meaning of higher values on that axis? You can make different modeling choices there, and depending on them, the implications of the trend continuing would differ massively.
(E. g., the typical example of “GPT-2 is ‘a preschooler’, GPT-4 is ‘a high schooler’, therefore GPT-6 is...”, with implications changing massively if we take those labels at face value vs. interpret them as “approximation of [human at this stage of development]”, then consider different data-consistent assumptions regarding how the approximation failure might scale.)