Thanks for explicitly writing out your thoughts in a place where you can expect strong pushback! I think this is particularly valuable.
That being said, while I completely agree with your second point (I keep telling to people who argue theory cannot work that barely 10 people worked on it for 10 years, which is a ridiculously small number), I feel like your first point is missing some key reflections on the asymmetry of capabilities vs alignment.
I don’t have time to write a long answer, but I already have a post going in depth into many of the core assumptions of science and engineering that we don’t expect to apply for alignment, (almost all apply or are irrelevant for capabilities, although that’s not discussed explicitly in the post)
Thanks for explicitly writing out your thoughts in a place where you can expect strong pushback! I think this is particularly valuable.
That being said, while I completely agree with your second point (I keep telling to people who argue theory cannot work that barely 10 people worked on it for 10 years, which is a ridiculously small number), I feel like your first point is missing some key reflections on the asymmetry of capabilities vs alignment.
I don’t have time to write a long answer, but I already have a post going in depth into many of the core assumptions of science and engineering that we don’t expect to apply for alignment, (almost all apply or are irrelevant for capabilities, although that’s not discussed explicitly in the post)