They are not supposed to be two distinct systems. One is a subsystem of the other. There may be implementations where its the same LLM doing all the generative work for every step of the reasoning via prompt engineering but it doesn’t have to be this way. It can can be multiple more specific LLMs that went through different RLHF processes.
They are not supposed to be two distinct systems. One is a subsystem of the other. There may be implementations where its the same LLM doing all the generative work for every step of the reasoning via prompt engineering but it doesn’t have to be this way. It can can be multiple more specific LLMs that went through different RLHF processes.