I’d say that the 80⁄20 of the concept is how reward & punishment affect human behavior.
Is it about which forces? - I would say I’m referring to a combination of instinct, innate attraction/aversion, previous experience, decision-making, attention, and how they relate to each other in an everyday practical context.
Seems to me that “genetics” - I would say your disentanglement is right on the money. Rather than making an analysis for LLMs, I’m particularly interested in fleshing out the inter relations between concepts as they relate to the human brain.
Do you want a similar analysis for LLMs? I mean it from a high-level agency perspective, as opposed to in specific AI or machine learning contexts.
Goal? My goal is to learn more about what information Lesswrongers use and are interested in so that I can better create a post for the community.
Can you give one extremely concrete example of a scenario which involves reward modeling, and point to the part of the scenario that you call “reward modeling”?
I’d say that the 80⁄20 of the concept is how reward & punishment affect human behavior.
Is it about which forces?
- I would say I’m referring to a combination of instinct, innate attraction/aversion, previous experience, decision-making, attention, and how they relate to each other in an everyday practical context.
Seems to me that “genetics”
- I would say your disentanglement is right on the money. Rather than making an analysis for LLMs, I’m particularly interested in fleshing out the inter relations between concepts as they relate to the human brain.
Do you want a similar analysis for LLMs?
I mean it from a high-level agency perspective, as opposed to in specific AI or machine learning contexts.
Goal?
My goal is to learn more about what information Lesswrongers use and are interested in so that I can better create a post for the community.
Adjacent concepts
Self-discipline
Positive psychology
Systems & patterns thinking
Maybe reward-functions?
Can you give one extremely concrete example of a scenario which involves reward modeling, and point to the part of the scenario that you call “reward modeling”?