Daniel Tan comments on Daniel Tan’s Shortform

Daniel Tan 11 Jan 2025 17:06 UTC
1 point
0
Important point: The above analysis considers communication rate per token. However, it’s also important to consider communication rate per unit of computation (e.g. per LM inference). This is relevant for decoding approaches like best-of-N which use multiple inferences per token