I’m Niels, a video journalist at KRO-NCRV/Pointer in the Netherlands. I’ve been following your work on AI time horizons, and I’m building a piece around the question of why AI progress looks so different on clean benchmarks versus messy, real-world tasks.
The messiness analysis in your Long Tasks paper — and the performance drop it reveals — is exactly what I’d like to discuss. I think it’s one of the key underreported findings in recent AI research.
Would you be up for a 20-minute video call? Happy to work around your schedule.
Hi Thomas,
I’m Niels, a video journalist at KRO-NCRV/Pointer in the Netherlands. I’ve been following your work on AI time horizons, and I’m building a piece around the question of why AI progress looks so different on clean benchmarks versus messy, real-world tasks.
The messiness analysis in your Long Tasks paper — and the performance drop it reveals — is exactly what I’d like to discuss. I think it’s one of the key underreported findings in recent AI research.
Would you be up for a 20-minute video call? Happy to work around your schedule.
Thanks,
Niels
KRO-NCRV / Pointer