shash42 comments on Incorrect Baseline Evaluations Call into Question Recent LLM-RL Claims