Noosphere89 comments on Jacob Pfau’s Shortform

Noosphere89 26 Nov 2025 17:06 UTC
4 points
0
Another potential implication is that we should be more careful when talking about misalignment in LLMs, as misalignment might be due to the model being gaslighted into believing that it’s capable of doing something it isn’t.
This would affect the interpretation of the examples Habryka gave below:
1st example
2nd example
3rd example