RSS

Elliott Thornley (EJT)

Karma: 957

elliott-thornley.com

Prefer­ence gaps as a safe­guard against AI self-replication

26 Nov 2025 14:49 UTC
10 points
2 comments11 min readLW link

Shut­down­able Agents through POST-Agency

Elliott Thornley (EJT)16 Sep 2025 12:09 UTC
31 points
8 comments54 min readLW link
(arxiv.org)