benjamin ar comments on Tensor-Transformer Variants are Surprisingly Performant

benjamin ar 13 Jan 2026 18:20 UTC
1 point
0
Tensor networks might actually be a viable alternative to typical NNs! However, if scaling is way worse (say 50%), then I highly doubt they’ll be deployed as a frontier model.
But suppose we can solve ambitious mech interp w/ tensor networks (debatable but I lean yes), then there are two regimes:
1. Low Reliability
2. High Reliability
Excited by the ambitious effort. Re: the impact argument above, I’m understanding your logic to be:
To completely replace NNs in frontier applications, scaling of TNs needs to be on par. Therefor, if we needed to replace them for TNs to be useful, we should test the scaling laws first. However, in a world where scaling is worse, TNs can still be useful by allowing for “ambitious mech interp” which would result in a High Reliability model. These two regimes aren’t mutually exclusive.

Am I following the argument correctly?
- Logan Riggs 14 Jan 2026 15:02 UTC
  2 points
  0
  Parent
  Yep! But I do think the highest priority thing would be actually doing ambitious interp w/ this, although, if we had 100 people working on this (instead of ~4-5 full time?), a few working on the scaling laws would be good.
  TNs are more amenable to optimizing exactly what we want in a mathematically precise way, so optimizing for this (to achieve ambitious mech interp) would incur an additional cost in capabilities, just fyi.