Thank you! 1. we are currently working on this and seeing some interesting results :) 2. This would be a cool future direction! both coming up with better consistency judges and optimizing the models against them would be very helpful—we find the models usually go too deep into irrelevant details or reward hack their conclusions
Thank you!
1. we are currently working on this and seeing some interesting results :)
2. This would be a cool future direction! both coming up with better consistency judges and optimizing the models against them would be very helpful—we find the models usually go too deep into irrelevant details or reward hack their conclusions
nice, looks promising!