Thomas Kwa comments on Direct Preference Optimization in One Minute