silentbob comments on Visual Exploration of Gradient Descent (many images)

silentbob 19 Sep 2025 9:02 UTC
8 points
0
In fact, even after only 10 epochs and a test loss of >1.2, model 4 already produces something that clearly resembles Mandelbrot, which model 3 failed to achieve even after 100s of epochs: