don't_wanna_be_stupid_any_more comments on Policy Entropy, Learning, and Alignment (Or Maybe Your LLM Needs Therapy)