If automating alignment research worked, but took 1000+ tokens per researcher-second, it would be much more difficult to develop, because you’d need to run the system for 50k-1M tokens between each “reward signal or such”. Once it’s 10 or less tokens per researcher second, it’ll be easy to develop and improve quickly.
If automating alignment research worked, but took 1000+ tokens per researcher-second, it would be much more difficult to develop, because you’d need to run the system for 50k-1M tokens between each “reward signal or such”. Once it’s 10 or less tokens per researcher second, it’ll be easy to develop and improve quickly.