Garrett Baker comments on D0TheMath’s Shortform

Garrett Baker 5 Nov 2023 5:37 UTC
2 points
0
Wondering how straightforward it is to find the layerwise local learning coefficient. At a high level, it seems like it should be doable by just freezing the weights outside that layer, and performing the SGLD algorithm on just that layer. Would be interesting to see whether the layerwise lambdahats add up to the full lambdahat.