Charbel-Raphaël comments on Compendium of problems with RLHF

Charbel-Raphaël 31 Jul 2023 22:07 UTC
2 points
0
Here is the polished version from our team led by Stephen Casper and Xander Davies: Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback :)