Yeah I strongly disagree that this failure mode is very likely to be very hard to mostly resolve. I’m doing some research right now that will hopefully shed some light on this!
To be clear, I am not claiming that this failure mode is very likely very hard to resolve. Just harder than “run it twice on the original question and a rephrasing/transformation of the question”.
Yeah I strongly disagree that this failure mode is very likely to be very hard to mostly resolve. I’m doing some research right now that will hopefully shed some light on this!
To be clear, I am not claiming that this failure mode is very likely very hard to resolve. Just harder than “run it twice on the original question and a rephrasing/transformation of the question”.