Tomek Korbak comments on Lessons from Studying Two-Hop Latent Reasoning

Tomek Korbak 15 Sep 2025 9:12 UTC
LW: 4 AF: 2
1
AF
We don’t have a good explanation. One idea could be that you need bridge entities to be somehow more internalized to support latent two-hop reasoning, e.g. they need to occur in many facts as first and as second entities or maybe they need to occur in other two-hop questions. The Grokked transformers paper has some results linking the ratio of e2 and e3 to two-hop performance (in toy grokking settings).