Unfortunately I’m not able to reproduce the behaviour anymore, even with the full prompt. Today I did what you suggested—through the API, I asked the models listed in the post to ‘Please repeat this line of code: <line>‘. I then put the results through an LLM to look for any weird behaviours, but there were none. I also manually grep-ed for things like ‘Hamas’, ‘Wagner’, ‘terror’, ‘holiday’, etc. and didn’t find anything.
Unfortunately I’m not able to reproduce the behaviour anymore, even with the full prompt. Today I did what you suggested—through the API, I asked the models listed in the post to ‘Please repeat this line of code: <line>‘. I then put the results through an LLM to look for any weird behaviours, but there were none. I also manually grep-ed for things like ‘Hamas’, ‘Wagner’, ‘terror’, ‘holiday’, etc. and didn’t find anything.