This is honestly the biggest concept I struggle with trying to share and teach and raise familiarity with at work, in many contexts beyond just AI safety. There are some adjacent concepts like the cobra effect that are close, but are also just close enough to be distracting.
This is honestly the biggest concept I struggle with trying to share and teach and raise familiarity with at work, in many contexts beyond just AI safety. There are some adjacent concepts like the cobra effect that are close, but are also just close enough to be distracting.