Charlie Steiner comments on Inverse Scaling Prize: Second Round Winners

Charlie Steiner 24 Jan 2023 23:14 UTC
LW: 3 AF: 2
0
AF
The memo trap reminds me of the recent work from Anthropic on superposition, memorization, and double descent—it’s plausible that there’s U-shaped scaling in there somewhere for similar reasons. But because of the exponential scaling of how good superposition is for memorization, maybe the paper actually implies the opposite? Hm.