Dewi Gould

Karma: 298

Debate with Self-Play Best-of-N Optimization

Dewi Gould, Sam Martin, Alejandro Aristizabal, Simon Marshall and Jacob Pfau

9 Jul 2026 15:29 UTC

49 points

2 comments14 min readLW link

Dewi Gould 26 Jun 2026 8:22 UTC
3 points
0
in reply to: dgros’s comment on: Estimating No-CoT Task-Completion Time Horizons of Frontier AI Models
Thank you for your comments! Just following on from the point about individual dataset influence, we ran a leave-one-out study, sharing those results below. You are correct that SHADE and Vibe-coding (amongst a few others) have a strong impact, but as you can see these don’t materially change the conclusions we landed on around doubling time trends and associated uncertainty. We will add this to the paper, thank you pointing it out.
>The fact there is only single task in the >0.5hr regime looks pretty problematic
I also wanted to add that, as part of the bootstrap, we are incorporating uncertainty in solve times meaning that even if the point-estimate solve times for some questions is <0.5hr, it could still contribute in the bootstrap at >0.5hr. (this is just to say, there are more tasks in the bucket than it might seem!).

Estimating No-CoT Task-Completion Time Horizons of Frontier AI Models

Anders Cairns Woodruff, Francis Rhys Ward, Dewi Gould, Rauno Arike, Jason R Brown, Jo Jiao, wlanderson, ariana_azarbal, harrymayne, Patrick Leask, Twm Stone, Josh Hills, Ida Caspary, Shubhorup Biswas and Julian Stastny

10 Jun 2026 17:58 UTC

275 points

23 comments4 min readLW link

A Positive Case for Faithfulness: LLM Self-Explanations Help Predict Model Behavior

harrymayne, Justin kang, Dewi Gould and noahys

26 Feb 2026 17:03 UTC

28 points

0 comments4 min readLW link