I disagree that the Reversal Curse demonstrates a fundamental lack of sophistication of knowledge on the model’s part. As Neel Nanda explained, it’s not surprising that current LLMs will store A → B but not B → A as they’re basically lookup tables, and this is definitely an important limitation. However, I think this is mainly due to a lack of computational depth. LLMs can perform that kind of deduction when the information is external, that is, if you prompt it with who Tom Cruise’s mom is, it can then answer who Mary Lee Pfeiffer’s son is. If the LLM knew the first part already, you could just prompt it to answer the first question before prompting it with the second. I suspect that a recurrent model like the Universal Transformer would be able to perform the A → B to B → A deduction internally, but for now LLMs must do multi-step computations like that externally with a chain-of-thought. In other words, it can deduce new things, just not in a single forward pass or during backpropagation. If that doesn’t count, then all other demonstrations of multi-step reasoning in LLMs don’t count either. This deduced knowledge is usually discarded, but we can make it permanent with retrieval or fine-tuning. So, I think it’s wrong to say that this entails a fundamental barrier to wielding new knowledge.
brambleboy
Karma: 23
I encountered this while I was reading about an obscure estradiol ester, Estradiol undecylate, used for hormone replacement therapy and treating prostate cancer. It’s very useful because it has a super long half-life, but it was discontinued. I had to reread the article to be sure I understood that the standard dose chosen arbitrarily in the first trials was hundreds of times larger than necessary, leading to massive estrogen overdoses and severe side effects that killed many people due to cardiovascular complications, and yet these insane doses were typical for decades and might’ve caused its discontinuation.
Although it has been over a decade, decent waterproof phone mounts now exist, too.
Thank you for writing this, this is by far the strongest argument for taking this problem seriously tailored to leftists I’ve seen and I’ll be sharing it. Hopefully the frequent (probably unavoidable) references to EA doesn’t turn them off too much.