RSS

nielsrolf

Karma: 553

Shap­ing the ex­plo­ra­tion of the mo­ti­va­tion-space mat­ters for AI safety

6 Mar 2026 14:43 UTC
77 points
13 comments10 min readLW link

UtopiaBench

nielsrolf8 Feb 2026 18:19 UTC
67 points
10 comments1 min readLW link

Con­crete re­search ideas on AI personas

3 Feb 2026 21:50 UTC
62 points
10 comments6 min readLW link

Con­di­tion­al­iza­tion Con­founds Inoc­u­la­tion Prompt­ing Results

3 Feb 2026 11:50 UTC
74 points
5 comments19 min readLW link