Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Ran W
Karma:
13
All
Posts
Comments
New
Top
Old
Why do we need RLHF? Imitation, Inverse RL, and the role of reward
Ran W
3 Feb 2024 4:00 UTC
12
points
0
comments
5
min read
LW
link
Back to top