The new model index from OpenAI contains most of the answers to this. Jérémy linked to it in another comment on this post. However, the model index doesn’t give info on ada and text-ada-001 yet: https://beta.openai.com/docs/model-index-for-researchers
Thanks for catching this and spreading the word!
Do we know if the following other models from OpenAI use true RLHF or also use this RLHF-like mystery method? (or something else!)
text-curie-001
text-babbage-001
text-ada-001
The new model index from OpenAI contains most of the answers to this. Jérémy linked to it in another comment on this post. However, the model index doesn’t give info on ada and text-ada-001 yet: https://beta.openai.com/docs/model-index-for-researchers
I don’t know :(