My intent is to multiply probabilities. If that’s implemented by adding log-odds, that’s fine. Slightly faster, even.
I thought I remembered that the last neuron gave odds which got sampled, but if you’ve gotten into the guts of these things more recently and say it’s log-odds, I’ll believe it. Though anyone implementing this should double-check what their specific net does, in case it’s weird.
My intent is to multiply probabilities. If that’s implemented by adding log-odds, that’s fine. Slightly faster, even.
I thought I remembered that the last neuron gave odds which got sampled, but if you’ve gotten into the guts of these things more recently and say it’s log-odds, I’ll believe it. Though anyone implementing this should double-check what their specific net does, in case it’s weird.