Seth Herd comments on Auto-GPT: Open-sourced disaster?

Seth Herd 7 Apr 2023 18:38 UTC
4 points
0
I don’t know what Steve would say, but I know that some folks from DeepMind and Stanford have recently used an LLM to create rewards to train another LLM to do specific tasks, like negotiation. which I think is exactly what you’ve described. It seems to work really well.
Reward Design with Language Models