RSS

nielsrolf

Karma: 614

Shap­ing the ex­plo­ra­tion of the mo­ti­va­tion-space mat­ters for AI safety

6 Mar 2026 14:43 UTC
78 points
15 comments10 min readLW link

UtopiaBench

nielsrolf8 Feb 2026 18:19 UTC
67 points
10 comments1 min readLW link

Con­crete re­search ideas on AI personas

3 Feb 2026 21:50 UTC
68 points
10 comments6 min readLW link

Con­di­tion­al­iza­tion Con­founds Inoc­u­la­tion Prompt­ing Results

3 Feb 2026 11:50 UTC
76 points
5 comments19 min readLW link

A Case for Model Per­sona Research

15 Dec 2025 13:35 UTC
119 points
11 comments4 min readLW link

niel­srolf’s Shortform

nielsrolf3 Mar 2023 0:00 UTC
1 point
29 comments1 min readLW link