sam comments on sam’s Shortform

sam 5 Apr 2025 11:10 UTC
−1 points
0
LLMs (probably) have a drive to simulate a coherent entity
Maybe we can just prepend a bunch of examples of aligned behaviour before a prompt, presented as if the model had done this itself, and see if that improves its behaviour.