A capabilities employer at a safer lab could also, say, propose an architecture that has worse interpretability than CoT, better than neuralese and is between CoT and neuralese in terms of capabilities per compute.
A capabilities employer at a safer lab could also, say, propose an architecture that has worse interpretability than CoT, better than neuralese and is between CoT and neuralese in terms of capabilities per compute.