zroe1

Karma: 694

UChicago Student

https://github.com/zroe1

Iterative Finetuning is Mostly Idempotent

zroe1, jcksanderson and Julian H

11 May 2026 6:41 UTC

22 points

0 comments5 min readLW link

Summer AI Safety Opportunities at UChicago XLab

zroe19 Mar 2026 6:26 UTC

27 points

0 comments3 min readLW link

Principles for Meta-Science and AI Safety Replications

zroe123 Jan 2026 6:59 UTC

47 points

1 comment4 min readLW link

What Washington Says About AGI

zroe117 Jan 2026 5:43 UTC

134 points

7 comments6 min readLW link

Introducing the XLab AI Security Guide

zroe1, jcksanderson and Julian H

27 Dec 2025 16:50 UTC

19 points

1 comment5 min readLW link

Against “You can just do things”

zroe18 Nov 2025 0:58 UTC

61 points

9 comments3 min readLW link

zroe1′s Shortform

zroe120 Sep 2025 21:19 UTC

2 points

45 comments1 min readLW link

Intriguing Properties of gpt-oss Jailbreaks

zroe1 and jcksanderson

13 Aug 2025 19:42 UTC

19 points

0 comments10 min readLW link

(xlabaisecurity.com)

Alternative Models of Superposition

zroe1 and RGRGRG

11 Aug 2025 15:52 UTC

20 points

6 comments5 min readLW link