Lucius Bushnaq comments on Computational Superposition in a Toy Model of the U-AND Problem

Lucius Bushnaq 28 Mar 2025 11:46 UTC
3 points
0
Forgot to tell you this when you showed me the draft: The comp in sup paper actually had a dense construction for UAND included already. It works differently than the one you seem to have found though, using Gaussian weights rather than binary weights.
- Adam Newgas 28 Mar 2025 15:36 UTC
  1 point
  0
  Parent
  Yes I don’t think the exact distribution of weights Gaussian/uniform/binary really makes that much difference, you can see the difference in loss in some of the charts above. The extra efficiency probably comes from the fact that every neuron contributes to everything fully—with Gaussian, sometimes the weights will be close to zero.
  Some other advantages:
  * But they are somewhat easier to analyse than gaussian weights.
  * They can be skewed ( $p \neq 0.5$ ) which seems advantageous for an unknown reason. Possibly it makes AND circuits better at the expense of other possible truth tables.