Logan Riggs comments on Replacing RL w/ Parameter-based Evolutionary Strategies

Logan Riggs 8 Oct 2025 11:08 UTC
2 points
0
The paper does have a few empirical experiments showing they arrive at different solutions. Specifically the KL-reward plot. Would you need more settings to be convinced here?