Charlie Steiner comments on Foom & Doom 2: Technical alignment is hard

Charlie Steiner 24 Jun 2025 4:01 UTC
2 points
0
And don’t you think 500 lines of Python also “fails due to” having unintended optima?
I’ve put “fails due to” in scare quotes because what’s failing is not every possible approach, merely almost all samples from approaches we currently know how to take. If we knew how to select python code much more cleverly, suddenly it wouldn’t fail anymore. And ditto for if we knew how to better construct reward functions from big AI systems plus small amounts of human text or human feedback.
- plex 24 Jun 2025 13:28 UTC
  2 points
  0
  Parent
  Oh no, almost all possible 500 lines of python are also bad.