Lucius Bushnaq comments on Alexander Gietelink Oldenziel’s Shortform

Lucius Bushnaq 24 Sep 2025 16:46 UTC
5 points
0
That may be true^[1]. But it doesn’t seem like a particularly useful answer?
“The optimization target is the optimization target.”
1. ^
  For the outer optimiser that builds the AI
- James Camacho 24 Sep 2025 19:04 UTC
  1 point
  0
  Parent
  I think having all of this in mind as you train is actually pretty important. That way, when something doesn’t work, you know where to look:
  - Am I exploring enough, or stuck always pulling the first lever? (free energy)
  - Is it biased for some reason? (probably the metric)
  - Is it stuck not improving? (step or batch size)
  Weight-initialization isn’t too helpful to think about yet (other than avoiding explosions at the very beginning of training, and maybe a little for transfer learning), but we’ll probably get hyper neural networks within a few years.