Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
nielsrolf
Karma:
614
All
Posts
Comments
New
Top
Old
Shaping the exploration of the motivation-space matters for AI safety
Maxime Riché
,
Victor Gillioz
,
nielsrolf
,
Kajetan Dymkiewicz
,
Filip Sondej
,
RogerDearnaley
,
Daniel Tan
and
dillonkn
6 Mar 2026 14:43 UTC
78
points
15
comments
10
min read
LW
link
UtopiaBench
nielsrolf
8 Feb 2026 18:19 UTC
67
points
10
comments
1
min read
LW
link
Concrete research ideas on AI personas
nielsrolf
,
Maxime Riché
and
Daniel Tan
3 Feb 2026 21:50 UTC
68
points
10
comments
6
min read
LW
link
Conditionalization Confounds Inoculation Prompting Results
Maxime Riché
and
nielsrolf
3 Feb 2026 11:50 UTC
76
points
5
comments
19
min read
LW
link
A Case for Model Persona Research
nielsrolf
,
Maxime Riché
and
Daniel Tan
15 Dec 2025 13:35 UTC
119
points
11
comments
4
min read
LW
link
nielsrolf’s Shortform
nielsrolf
3 Mar 2023 0:00 UTC
1
point
29
comments
1
min read
LW
link
Back to top