ryan_b comments on They gave LLMs access to physics simulators

ryan_b 17 Oct 2022 22:35 UTC
2 points
−1
That doesn’t appear to be explained specifically, but what I think they are giving is the larger model size equivalence. That is to say, the 350M parameter language model with Mind’s Eye is about as good as a 2.5B parameter language model, and so on.
- Hjalmar_Wijk 18 Oct 2022 21:44 UTC
  2 points
  1
  Parent
  I might be missing something but are they not just giving the number of parameters (in millions of parameters) on a log10 scale? Scaling laws are usually by log-parameters, and I suppose they felt that it was cleaner to subtract the constant log(10^6) from all the results (e.g. taking log(1300) instead of log(1.3B)).
  
  The B they put at the end is a bit weird though.