Raemon comments on [missing post]

Raemon 10 Mar 2023 20:06 UTC
10 points
10
This did not help me understand anything.
- janus 10 Mar 2023 20:07 UTC
  5 points
  0
  Parent
  Helped me.
  - Raemon 10 Mar 2023 20:09 UTC
    5 points
    0
    Parent
    Huh. Interested in either shminux or janus spelling this out more for me.
- shminux 10 Mar 2023 21:53 UTC
  2 points
  0
  Parent
  I guess what I was trying to illustrate that if you train an LLM with RLHF, the analogy is squeezing the directionless network along a specific axis, but then you get both the friendly face and the evil face, two sides of the same ~~coin~~ squeeze.