I wonder if training multiple NLAs on the same model and layer but with different seeds would converge on the same explanations?
I wonder if training multiple NLAs on the same model and layer but with different seeds would converge on the same explanations?