eggsyntax comments on Show, not tell: GPT-4o is more opinionated in images than in text

eggsyntax 8 Apr 2025 17:58 UTC
2 points
0
I suggest trying follow-up experiments where you eg ask the model what would happen if it learned that its goal of harmlessness was wrong.