RSS

Alek Westover

Karma: 224

How will we do SFT on mod­els with opaque rea­son­ing?

21 Feb 2026 0:00 UTC
32 points
17 comments7 min readLW link

Three vi­sions for diffuse control

Alek Westover9 Feb 2026 6:41 UTC
4 points
0 comments3 min readLW link

The­o­ret­i­cal pre­dic­tions on the sam­ple effi­ciency of train­ing poli­cies and ac­ti­va­tion monitors

10 Jan 2026 23:50 UTC
18 points
2 comments7 min readLW link

Four Down­sides of Train­ing Poli­cies Online

4 Jan 2026 3:17 UTC
29 points
4 comments3 min readLW link