Noosphere89 comments on Jesse Hoogland’s Shortform

Noosphere89 10 Feb 2025 19:52 UTC
20 points
16
To be fair here, AlphaZero was a case where it not only had an essentially unhackable reward model, but also could generate very large amounts of data, which while not totally unique to Go or gaming, is a property that is generally hard to come by in a lot of domains, so progress will probably be slower than AlphaZero.

Also, a lot of the domains are areas where latencies are either very low or you can tolerate long latency, which is not the case in the physical world very often.
- cubefox 11 Feb 2025 14:31 UTC
  5 points
  1
  Parent
  We already have seen a lot of progress in this regard with the new reasoning models, see this neglected post for details.