Cleo Nardo comments on anaguma’s Shortform

Cleo Nardo 2 Nov 2025 17:15 UTC
4 points
0
Why would demand for AI inference be below 167 tokens/second/american? I expect it to be much higher, and for energy to be a constraint.