RSS

Fabien Roger

Karma: 7,271

I am working on empirical AI safety.

Book a call with me if you want advice on a concrete empirical safety project.

Anonymous feedback form.

Re­fusals that could be­come catastrophic

Fabien Roger30 Jan 2026 4:12 UTC
37 points
0 comments7 min readLW link

Elic­it­ing base mod­els with sim­ple un­su­per­vised techniques

23 Jan 2026 18:06 UTC
29 points
2 comments8 min readLW link

Should con­trol down-weight nega­tive net-sab­o­tage-value threats?

Fabien Roger16 Jan 2026 4:18 UTC
35 points
0 comments10 min readLW link