derek shiller comments on Models have linear representations of what tasks they like

derek shiller 11 Mar 2026 13:08 UTC
1 point
0
Ah, so the idea is that task preferences are encoded in the activations of a description of a task, even when the model isn’t generating text relating to any choices regarding that task?
- OscarGilg 11 Mar 2026 18:00 UTC
  2 points
  0
  Parent
  Yes, and the tasks are all prompts that actually ask the model to complete them. But indeed independent of any choice framing.