StefanHex comments on How To Do Patching Fast

StefanHex 13 May 2024 10:16 UTC
1 point
0
Does this still work if there is a layer norm between the layers?
This works because the difference in input to the edge destination is equal to the difference in output of the source component.
This is key to why you can compute the patched inputs quickly, but it only holds without layer norm, right?
- Joseph Miller 14 May 2024 17:31 UTC
  1 point
  0
  Parent
  Yes you’re correct that it does not work with LayerNorm between layers. I’m not aware of any models that do this. Are you?