faul_sname comments on Alexander Gietelink Oldenziel’s Shortform

faul_sname 24 Sep 2025 18:40 UTC
2 points
0

for the weight-initialization distribution as prior

The bits of that I understand seem accurate but also it is not possible in the general case to predict (without doing the training run) how a given random initialization will affect what the final model looks like.

Which might have been the point you were trying to make, not sure.