Do you have a sense of what I, as a researcher, could do?
I sense that having users/companies want faithful CoT is very important.. In-tune users, as. nostalgebraist points out, will know how to use CoTs to debug LLMs. But I’m not sure whether this represents only 1% of users, so big labs just won’t care. Maybe we need to try and educate more users about this. Maybe reach out to people who tweet about LLM best use cases to highlight this?
Since ’23 my answer to that question would have been “well the first step is for researchers like you to produce [basically exactly the paper OpenAI just produced]”
So that’s done. Nice. There are lots of follow-up experiments that can be done.
I don’t think trying to shift the market/consumers as a whole is very tractable.
But talking to your friends at the companies, getting their buy-in, seems valuable.
Do you have a sense of what I, as a researcher, could do?
I sense that having users/companies want faithful CoT is very important.. In-tune users, as. nostalgebraist points out, will know how to use CoTs to debug LLMs. But I’m not sure whether this represents only 1% of users, so big labs just won’t care. Maybe we need to try and educate more users about this. Maybe reach out to people who tweet about LLM best use cases to highlight this?
Since ’23 my answer to that question would have been “well the first step is for researchers like you to produce [basically exactly the paper OpenAI just produced]”
So that’s done. Nice. There are lots of follow-up experiments that can be done.
I don’t think trying to shift the market/consumers as a whole is very tractable.
But talking to your friends at the companies, getting their buy-in, seems valuable.