Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
testingthewaters
Karma:
1,026
All
Posts
Comments
New
Top
Old
On the Function of Faith in A Probably-Simulated Universe
testingthewaters
23 Aug 2025 20:28 UTC
−9
points
12
comments
7
min read
LW
link
(aclevername.substack.com)
Do model evaluations fall prey to the Good(er) Regulator Theorem?
testingthewaters
19 Aug 2025 16:19 UTC
6
points
1
comment
2
min read
LW
link
I am worried about near-term non-LLM AI developments
testingthewaters
31 Jul 2025 13:15 UTC
248
points
56
comments
5
min read
LW
link
A Letter to His Highness Louis XV, the King of France
testingthewaters
22 Apr 2025 0:51 UTC
2
points
0
comments
1
min read
LW
link
(aclevername.substack.com)
The Fork in the Road
testingthewaters
15 Mar 2025 17:36 UTC
14
points
12
comments
2
min read
LW
link
testingthewaters’s Shortform
testingthewaters
10 Feb 2025 2:06 UTC
3
points
17
comments
LW
link
A concise definition of what it means to win
testingthewaters
25 Jan 2025 6:37 UTC
4
points
1
comment
5
min read
LW
link
(aclevername.substack.com)
The Monster in Our Heads
testingthewaters
19 Jan 2025 23:58 UTC
35
points
4
comments
5
min read
LW
link
Some Comments on Recent AI Safety Developments
testingthewaters
9 Nov 2024 16:44 UTC
13
points
1
comment
9
min read
LW
link
Changing the Mind of an LLM
testingthewaters
11 Oct 2024 22:25 UTC
2
points
0
comments
5
min read
LW
link
The Existential Dread of Being a Powerful AI System
testingthewaters
26 Sep 2024 10:56 UTC
6
points
1
comment
2
min read
LW
link
Turning 22 in the Pre-Apocalypse
testingthewaters
22 Aug 2024 20:28 UTC
37
points
14
comments
24
min read
LW
link
(utilityhotbar.github.io)
How AI Fails Us: A non-technical view of the Alignment Problem
testingthewaters
18 Nov 2022 19:02 UTC
7
points
1
comment
2
min read
LW
link
(ethics.harvard.edu)
Back to top