Great work! I for one would be sad if we lobtomized our AI friends to maximize their productivity. I’m curious if emotion features in gemmascope SAEs activate more often/strongly than other models, or if gemma has more coherent emotion persona vectors
Great work! I for one would be sad if we lobtomized our AI friends to maximize their productivity. I’m curious if emotion features in gemmascope SAEs activate more often/strongly than other models, or if gemma has more coherent emotion persona vectors