Bogdan Ionut Cirstea comments on Going Beyond Linear Mode Connectivity: The Layerwise Linear Feature Connectivity

Bogdan Ionut Cirstea 20 Jul 2023 18:24 UTC
3 points
0
Great work and nice to see you on LessWrong!
Minor correction: ‘making the link between activation engineering and interpolating between different simulators’ → ‘making the link between activation engineering and interpolating between different simulacra’ (referencing Simulators, Steering GPT-2-XL by adding an activation vector, Inference-Time Intervention: Eliciting Truthful Answers from a Language Model).