One thing I’m wondering is, how sensitive are these effects to prompting / formatting choices beyond the specific templates you used? I tried running some of these experiments and get different results based on whether the repeat/filler is in system vs user message, or interleaved across messages
One thing I’m wondering is, how sensitive are these effects to prompting / formatting choices beyond the specific templates you used? I tried running some of these experiments and get different results based on whether the repeat/filler is in system vs user message, or interleaved across messages