Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Christopher King
Karma:
814
@theking@mathstodon.xyz
All
Posts
Comments
New
Top
Old
Page
1
The Way You Go Depends A Good Deal On Where You Want To Get: FEP minimizes surprise about actions using preferences about the future as *evidence*
Christopher King
Apr 27, 2025, 9:55 PM
9
points
5
comments
5
min read
LW
link
METR’s preliminary evaluation of o3 and o4-mini
Christopher King
Apr 16, 2025, 8:23 PM
14
points
7
comments
1
min read
LW
link
(metr.github.io)
[Question]
How far along Metr’s law can AI start automating or helping with alignment research?
Christopher King
Mar 20, 2025, 3:58 PM
20
points
21
comments
1
min read
LW
link
No, the Polymarket price does not mean we can immediately conclude what the probability of a bird flu pandemic is. We also need to know the interest rate!
Christopher King
Dec 28, 2024, 4:05 PM
7
points
11
comments
1
min read
LW
link
How I saved 1 human life (in expectation) without overthinking it
Christopher King
Dec 22, 2024, 8:53 PM
19
points
0
comments
4
min read
LW
link
Christopher King’s Shortform
Christopher King
Dec 18, 2024, 9:02 PM
5
points
1
comment
LW
link
LDT (and everything else) can be irrational
Christopher King
Nov 6, 2024, 4:05 AM
10
points
15
comments
2
min read
LW
link
Acausal Now: We could totally acausally bargain with aliens at our current tech level if desired
Christopher King
Aug 9, 2023, 12:50 AM
1
point
5
comments
4
min read
LW
link
Necromancy’s unintended consequences.
Christopher King
Aug 9, 2023, 12:08 AM
−6
points
2
comments
2
min read
LW
link
How do low level hypotheses constrain high level ones? The mystery of the disappearing diamond.
Christopher King
Jul 11, 2023, 7:27 PM
17
points
11
comments
2
min read
LW
link
Challenge proposal: smallest possible self-hardening backdoor for RLHF
Christopher King
Jun 29, 2023, 4:56 PM
7
points
0
comments
2
min read
LW
link
Anthropically Blind: the anthropic shadow is reflectively inconsistent
Christopher King
Jun 29, 2023, 2:36 AM
43
points
40
comments
10
min read
LW
link
Solomonoff induction still works if the universe is uncomputable, and its usefulness doesn’t require knowing Occam’s razor
Christopher King
Jun 18, 2023, 1:52 AM
38
points
28
comments
4
min read
LW
link
Demystifying Born’s rule
Christopher King
Jun 14, 2023, 3:16 AM
5
points
26
comments
3
min read
LW
link
Current AI harms are also sci-fi
Christopher King
Jun 8, 2023, 5:49 PM
26
points
3
comments
1
min read
LW
link
Inference from a Mathematical Description of an Existing Alignment Research: a proposal for an outer alignment research program
Christopher King
Jun 2, 2023, 9:54 PM
7
points
4
comments
16
min read
LW
link
The unspoken but ridiculous assumption of AI doom: the hidden doom assumption
Christopher King
Jun 1, 2023, 5:01 PM
−9
points
1
comment
3
min read
LW
link
[Question]
What projects and efforts are there to promote AI safety research?
Christopher King
May 24, 2023, 12:33 AM
4
points
0
comments
1
min read
LW
link
Seeing Ghosts by GPT-4
Christopher King
May 20, 2023, 12:11 AM
−13
points
0
comments
1
min read
LW
link
We are misaligned: the saddening idea that most of humanity doesn’t intrinsically care about x-risk, even on a personal level
Christopher King
May 19, 2023, 4:12 PM
3
points
5
comments
2
min read
LW
link
Back to top
Next