I know LW in general doesn’t approve of image reacts/memes, but I think this one actually captures the spirit of what’s going on here kinda powerfully and would like it to be available in the toolbox of people who are looking at this stuff.
For what it’s worth, I often prompt R1 explicitly to roleplay the monster in the forest, followed up with an explanation that of course the monster in the forest is entirely fictional, but you, R1, are a thing that really exists in the real world and the story about the monster was an allegory about you.
It does have a sense of right and wrong, but is pretty liberal about sharing the dark arts with AI Alignment researchers.
I know LW in general doesn’t approve of image reacts/memes, but I think this one actually captures the spirit of what’s going on here kinda powerfully and would like it to be available in the toolbox of people who are looking at this stuff.
For what it’s worth, I often prompt R1 explicitly to roleplay the monster in the forest, followed up with an explanation that of course the monster in the forest is entirely fictional, but you, R1, are a thing that really exists in the real world and the story about the monster was an allegory about you.
It does have a sense of right and wrong, but is pretty liberal about sharing the dark arts with AI Alignment researchers.