Noob question.
Where L is the amortized loss function
Why are you using argmax with a loss function? Isn’t the objective to minimise the loss function.
Noob question.
Why are you using argmax with a loss function? Isn’t the objective to minimise the loss function.