cubefox comments on Model-driven feedback could amplify alignment failures