Steven Byrnes comments on [Intro to brain-like-AGI safety] 3. Two subsystems: Learning & Steering

Steven Byrnes 4 Dec 2025 15:45 UTC
4 points
0
start with an already trained model as the center of our learning subsystem, and a steering subsystem that points to concepts in that trained model?
I think my version of that is Plan for mediocre alignment of brain-like [model-based RL] AGI; see also broader discussion here, here, and some other nuances in §14 of this series. (Or sorry if I’m misunderstanding.)