Tao Lin comments on Some thoughts on automating alignment research

Tao Lin 13 Jun 2023 19:02 UTC
LW: 3 AF: 2
0
AF
If automating alignment research worked, but took 1000+ tokens per researcher-second, it would be much more difficult to develop, because you’d need to run the system for 50k-1M tokens between each “reward signal or such”. Once it’s 10 or less tokens per researcher second, it’ll be easy to develop and improve quickly.