Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Elliott Thornley (EJT)
Karma:
1,056
elliott-thornley.com
All
Posts
Comments
New
Top
Old
AI safety can be a Pascal’s mugging even if p(doom) is high
Elliott Thornley (EJT)
25 Apr 2026 16:16 UTC
22
points
9
comments
1
min read
LW
link
Preference gaps as a safeguard against AI self-replication
tbs
and
Elliott Thornley (EJT)
26 Nov 2025 14:49 UTC
10
points
2
comments
11
min read
LW
link
Shutdownable Agents through POST-Agency
Elliott Thornley (EJT)
16 Sep 2025 12:09 UTC
32
points
8
comments
54
min read
LW
link
(arxiv.org)
Towards shutdownable agents via stochastic choice
Elliott Thornley (EJT)
,
alexr
,
christosi
and
LAThomson
8 Jul 2024 10:14 UTC
59
points
11
comments
23
min read
LW
link
(arxiv.org)
The Shutdown Problem: Incomplete Preferences as a Solution
Elliott Thornley (EJT)
23 Feb 2024 16:01 UTC
62
points
33
comments
41
min read
LW
link
The Shutdown Problem: An AI Engineering Puzzle for Decision Theorists
Elliott Thornley (EJT)
23 Oct 2023 21:00 UTC
79
points
28
comments
39
min read
LW
link
(philpapers.org)
The price is right
Elliott Thornley (EJT)
16 Oct 2023 16:34 UTC
42
points
3
comments
4
min read
LW
link
(openairopensea.substack.com)
[Question]
What are some examples of AIs instantiating the ‘nearest unblocked strategy problem’?
Elliott Thornley (EJT)
4 Oct 2023 11:05 UTC
6
points
4
comments
1
min read
LW
link
EJT’s Shortform
Elliott Thornley (EJT)
26 Sep 2023 15:19 UTC
4
points
16
comments
1
min read
LW
link
There are no coherence theorems
Dan H
and
Elliott Thornley (EJT)
20 Feb 2023 21:25 UTC
158
points
136
comments
19
min read
LW
link
1
review
Back to top