Bogdan Ionut Cirstea comments on Steering GPT-2-XL by adding an activation vector

Bogdan Ionut Cirstea 4 Jun 2023 21:22 UTC
3 points
0
Seems very related: Linear Spaces of Meanings: Compositional Structures in Vision-Language Models. Notably, the (approximate) compositionality of language/reality should bode well for the scalability of linear activation engineering methods.
- Bogdan Ionut Cirstea 6 Jun 2023 18:54 UTC
  1 point
  0
  Parent
  And this structure can be used as regularization for soft prompts.