Another advantage of your method that might actually convince capabilities-focused researchers to use it: it could be more efficient! The strong LLM does all the heavy lifting, outputting as few tokens as possible. The weak LLM does the easy stuff like coming up with a first draft and writing the final response.
Another advantage of your method that might actually convince capabilities-focused researchers to use it: it could be more efficient! The strong LLM does all the heavy lifting, outputting as few tokens as possible. The weak LLM does the easy stuff like coming up with a first draft and writing the final response.