DanielFilan comments on [AN #147]: An overview of the interpretability landscape

DanielFilan 21 Apr 2021 18:04 UTC
LW: 8 AF: 3
0
AF

Additionally, in an intuitive sense, pruning a network seems as though it could be defined in terms of clusterability notions, which limits my enthusiasm for that result.

I see what you mean, but there exist things called expander graphs which are very sparse (i.e. very pruned) but minimally clusterable. Now, these don’t have a topology compatible with being a neural network, but are proofs of concept that you can prune without being clusterable. For more evidence, note that our pruned networks are more clusterable than if you permuted the weights randomly—that is, than random pruned networks.