Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Zephaniah Roe
Karma:
904
Formally zroe1
Second Look Research
/
XLab
https://github.com/zroe1
All
Posts
Comments
New
Top
Old
Trees are mostly made of air and a generalizable lesson for AI safety
Zephaniah Roe
29 May 2026 4:08 UTC
169
points
28
comments
4
min read
LW
link
Iterative Finetuning is Mostly Idempotent
Zephaniah Roe
,
jcksanderson
and
Julian H
11 May 2026 6:41 UTC
23
points
0
comments
5
min read
LW
link
Summer AI Safety Opportunities at UChicago XLab
Zephaniah Roe
9 Mar 2026 6:26 UTC
27
points
0
comments
3
min read
LW
link
Principles for Meta-Science and AI Safety Replications
Zephaniah Roe
23 Jan 2026 6:59 UTC
47
points
7
comments
4
min read
LW
link
What Washington Says About AGI
Zephaniah Roe
17 Jan 2026 5:43 UTC
134
points
7
comments
6
min read
LW
link
Introducing the XLab AI Security Guide
Zephaniah Roe
,
jcksanderson
and
Julian H
27 Dec 2025 16:50 UTC
19
points
1
comment
5
min read
LW
link
Against “You can just do things”
Zephaniah Roe
8 Nov 2025 0:58 UTC
61
points
9
comments
3
min read
LW
link
zroe1′s Shortform
Zephaniah Roe
20 Sep 2025 21:19 UTC
2
points
45
comments
1
min read
LW
link
Intriguing Properties of gpt-oss Jailbreaks
Zephaniah Roe
and
jcksanderson
13 Aug 2025 19:42 UTC
19
points
0
comments
10
min read
LW
link
(xlabaisecurity.com)
Alternative Models of Superposition
Zephaniah Roe
and
RGRGRG
11 Aug 2025 15:52 UTC
20
points
6
comments
5
min read
LW
link
Back to top