Thanks for calling this out! We’re hoping to do just this work of building robust evals of model propensities in realistic deployment settings : https://www.propensitylabs.ai/
I’d love to chat if you have a sec.
Thanks for calling this out! We’re hoping to do just this work of building robust evals of model propensities in realistic deployment settings : https://www.propensitylabs.ai/
I’d love to chat if you have a sec.