for the weight-initialization distribution as prior
The bits of that I understand seem accurate but also it is not possible in the general case to predict (without doing the training run) how a given random initialization will affect what the final model looks like.
Which might have been the point you were trying to make, not sure.
The bits of that I understand seem accurate but also it is not possible in the general case to predict (without doing the training run) how a given random initialization will affect what the final model looks like.
Which might have been the point you were trying to make, not sure.