Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
zroe1
Karma:
572
UChicago Student
https://github.com/zroe1
All
Posts
Comments
New
Top
Old
Principles for Meta-Science and AI Safety Replications
zroe1
23 Jan 2026 6:59 UTC
47
points
1
comment
4
min read
LW
link
What Washington Says About AGI
zroe1
17 Jan 2026 5:43 UTC
134
points
7
comments
6
min read
LW
link
Introducing the XLab AI Security Guide
zroe1
,
Jack Sanderson
and
Julian H
27 Dec 2025 16:50 UTC
19
points
1
comment
5
min read
LW
link
Against “You can just do things”
zroe1
8 Nov 2025 0:58 UTC
61
points
9
comments
3
min read
LW
link
zroe1′s Shortform
zroe1
20 Sep 2025 21:19 UTC
2
points
37
comments
1
min read
LW
link
Intriguing Properties of gpt-oss Jailbreaks
zroe1
and
Jack Sanderson
13 Aug 2025 19:42 UTC
19
points
0
comments
10
min read
LW
link
(xlabaisecurity.com)
Alternative Models of Superposition
zroe1
and
RGRGRG
11 Aug 2025 15:52 UTC
20
points
6
comments
5
min read
LW
link
Back to top