Fabien Roger comments on What’s up with LLMs representing XORs of arbitrary features?

Fabien Roger 4 Jan 2024 18:07 UTC
LW: 35 AF: 20
1
AF
On xor being represented incidentally:
I find experiments where you get <<50% val acc sketchy so I quickly ran my own using a very fake dataset made out of vectors in {-1,1}^d that I pass through 10 randomly initialized ReLU MLPs with skip connections. Here, the “features” I care about are canonical directions in the input space.
What I find:
- XOR is not incidentally represented if the features are not redundant
- XOR is incidentally represented if the features are very redundant (which is often the case in Transformers, but maybe not to the extent needed for XOR to be incidentally represented). I get redundancy by using input vectors that are a concatenation of many copies of a smaller input vector.
See my code for more details: https://pastebin.com/LLjvaQLC
What links here?
- Rohin Shah's comment on What’s up with LLMs representing XORs of arbitrary features? by Sam Marks (10 Jan 2024 9:58 UTC; 14 points)
- ryan_greenblatt's comment on What’s up with LLMs representing XORs of arbitrary features? by Sam Marks (5 Jan 2024 17:55 UTC; 8 points)
- Adam Shai 4 Jan 2024 19:12 UTC
  3 points
  2
  Parent
  I think you might need to change permissions on your github repository?
  - Fabien Roger 5 Jan 2024 9:56 UTC
    2 points
    0
    Parent
    Oops, here is the fixed link:
    https://pastebin.com/LLjvaQLC
- [ ]
  [deleted]