This paper didn’t check that, but usually when you train sparse networks you get worse performance than if you train dense networks and then prune them to be sparse.
This paper didn’t check that, but usually when you train sparse networks you get worse performance than if you train dense networks and then prune them to be sparse.