Hoagy comments on Going Beyond Linear Mode Connectivity: The Layerwise Linear Feature Connectivity

Hoagy 24 Jul 2023 21:12 UTC
2 points
1
I still don’t quite see the connection—if it turns out that LLFC holds between different fine-tuned models to some degree, how will this help us interpolate between different simulacra?

Is the idea that we could fine-tune models to only instantiate certain kinds of behaviour and then use LLFC to interpolate between (and maybe even extrapolate between?) different kinds of behaviour?
- Bogdan Ionut Cirstea 25 Jul 2023 13:09 UTC
  1 point
  0
  Parent
  Yes, roughly (the next comment is supposed to make the connection clearer, though also more speculative); RLHF / supervised fine-tuned models would correspond to ‘more mode-collapsed’ / narrower mixtures of simulacra here (in the limit of mode collapse, one fine-tuned model = one simulacrum).
  What links here?
  - Bogdan Ionut Cirstea's comment on Bogdan Ionut Cirstea’s Shortform by Bogdan Ionut Cirstea (25 Dec 2023 10:23 UTC; 1 point)