So I expect that only 2026 LLMs trained with agentic RLVR will give a first reasonable glimpse of what this method gets us, the shape of its limitations, and only in 2027 we’ll get a picture overdetermined by essential capabilities of the method
I’m at least 50% sure that this timeline would happen ~2x faster.
Conditional on training for agency yielding positive results the rest would be overdetermined by EoY 2025 / early 2026.
Otherwise, 2026 will be a slog and the 2027 wouldn’t happen in time (i.e. longer timelines).
I’m at least 50% sure that this timeline would happen ~2x faster. Conditional on training for agency yielding positive results the rest would be overdetermined by EoY 2025 / early 2026. Otherwise, 2026 will be a slog and the 2027 wouldn’t happen in time (i.e. longer timelines).