Bronson Schoen comments on Sam Marks’s Shortform

Bronson Schoen 23 Mar 2025 11:48 UTC
5 points
4
I’d be very interested in the dynamics of this, especially if the model does learn not to continue exploring into an exploit in CoT, what reason it gives in the CoT for aborting that exploration.