Ok. For what it’s worth, it was clear to me from the site UX where I could see the others’ names. But I did find it a bit surprising. Looking forward to meeting y’all.
Chris Merck
Karma: 3
- Chris Merck 23 Aug 2021 22:53 UTC2 points0in reply to: konstell’s comment on: New York City, NY – ACX Meetups Everywhere 2021
What do we make of RLVR on top of strong base models? Doesn’t this seem likely to learn genuinely new classes of problem currently unsolvable by humans? (I suppose it require us to be able to write reward functions, but we have Lean and the economy and nature that are glad to provide rewards even if we don’t know the solution ahead of time.)