Just to clarify—we use a very bare bones prompt for the pretrained LM, which doesn’t indicate much about what kind of assistant the pretrained LM is simulating:
Human: [insert question]
Assistant:[generate text here]
This same style of prompts was used on the RLHF models, not just the pretrained models, right? Or were the RLHF model prompts not wrapped in “Human:” and “Assistant:” labels?
This same style of prompts was used on the RLHF models, not just the pretrained models, right? Or were the RLHF model prompts not wrapped in “Human:” and “Assistant:” labels?