Jeremy Gillen comments on Vivek Hebbar’s Shortform

Jeremy Gillen 19 Sep 2025 10:27 UTC
4 points
0
On top of what Garrett said, reflection also pushes against this pretty hard. An AI that has gone through a few situations where it has acted against its own goals because of “context-specific heuristics” will be motivated to remove those heuristics, if that is an available option.