quetzal_rainbow comments on quetzal_rainbow’s Shortform

quetzal_rainbow 21 Jun 2024 17:03 UTC
4 points
0
Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data
Remarkably, without in-context examples or Chain of Thought, the LLM can verbalize that the unknown city is Paris and use this fact to answer downstream questions. Further experiments show that LLMs trained only on individual coin flip outcomes can verbalize whether the coin is biased, and those trained only on pairs x,f(x) can articulate a definition of f and compute inverses.
Explanation on Twitter
IMHO, this is creepy as hell, because one thing when we have conditional probability distribution and the othen when conditional probability distribution has arbitrary access to the different part of itself.