Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
Luck based medicine: my resentful story of becoming a medical miracle
Elizabeth
Oct 16, 2022, 5:40 PM
489
points
121
comments
12
min read
LW
link
3
reviews
(acesounderglass.com)
Counterarguments to the basic AI x-risk case
KatjaGrace
Oct 14, 2022, 1:00 PM
371
points
124
comments
34
min read
LW
link
1
review
(aiimpacts.org)
So, geez there’s a lot of AI content these days
Raemon
Oct 6, 2022, 9:32 PM
258
points
140
comments
6
min read
LW
link
Introduction to abstract entropy
Alex_Altair
Oct 20, 2022, 9:03 PM
238
points
78
comments
18
min read
LW
link
1
review
Lessons learned from talking to >100 academics about AI safety
Marius Hobbhahn
Oct 10, 2022, 1:16 PM
216
points
18
comments
12
min read
LW
link
1
review
What does it take to defend the world against out-of-control AGIs?
Steven Byrnes
Oct 25, 2022, 2:47 PM
208
points
49
comments
30
min read
LW
link
1
review
Decision theory does not imply that we get to have nice things
So8res
Oct 18, 2022, 3:04 AM
171
points
73
comments
26
min read
LW
link
2
reviews
Six (and a half) intuitions for KL divergence
CallumMcDougall
Oct 12, 2022, 9:07 PM
170
points
27
comments
10
min read
LW
link
1
review
(www.perfectlynormal.co.uk)
The Social Recession: By the Numbers
antonomon
Oct 29, 2022, 6:45 PM
165
points
29
comments
8
min read
LW
link
(novum.substack.com)
Why I think there’s a one-in-six chance of an imminent global nuclear war
Max Tegmark
Oct 8, 2022, 6:26 AM
164
points
169
comments
4
min read
LW
link
Age changes what you care about
Dentin
Oct 16, 2022, 3:36 PM
141
points
37
comments
2
min read
LW
link
AI Timelines via Cumulative Optimization Power: Less Long, More Short
jacob_cannell
Oct 6, 2022, 12:21 AM
138
points
33
comments
6
min read
LW
link
Apply to the Redwood Research Mechanistic Interpretability Experiment (REMIX), a research program in Berkeley
maxnadeau
,
Xander Davies
,
Buck
and
Nate Thomas
Oct 27, 2022, 1:32 AM
135
points
14
comments
12
min read
LW
link
Don’t leave your fingerprints on the future
So8res
Oct 8, 2022, 12:35 AM
135
points
48
comments
5
min read
LW
link
Niceness is unnatural
So8res
Oct 13, 2022, 1:30 AM
134
points
20
comments
8
min read
LW
link
1
review
Warning Shots Probably Wouldn’t Change The Picture Much
So8res
Oct 6, 2022, 5:15 AM
130
points
42
comments
2
min read
LW
link
Mnestics
Jarred Filmer
Oct 23, 2022, 12:30 AM
122
points
6
comments
4
min read
LW
link
Am I secretly excited for AI getting weird?
porby
Oct 29, 2022, 10:16 PM
116
points
4
comments
4
min read
LW
link
Why Weren’t Hot Air Balloons Invented Sooner?
Lost Futures
Oct 18, 2022, 12:41 AM
115
points
52
comments
6
min read
LW
link
(lostfutures.substack.com)
Actually, All Nuclear Famine Papers are Bunk
Lao Mein
Oct 12, 2022, 5:58 AM
113
points
37
comments
2
min read
LW
link
1
review
That one apocalyptic nuclear famine paper is bunk
Lao Mein
Oct 12, 2022, 3:33 AM
110
points
10
comments
1
min read
LW
link
Plans Are Predictions, Not Optimization Targets
johnswentworth
Oct 20, 2022, 9:17 PM
108
points
20
comments
4
min read
LW
link
1
review
Consider your appetite for disagreements
Adam Zerner
Oct 8, 2022, 11:25 PM
107
points
18
comments
6
min read
LW
link
1
review
Contra shard theory, in the context of the diamond maximizer problem
So8res
Oct 13, 2022, 11:51 PM
105
points
19
comments
2
min read
LW
link
1
review
Scaling Laws for Reward Model Overoptimization
leogao
,
John Schulman
and
Jacob_Hilton
Oct 20, 2022, 12:20 AM
103
points
13
comments
1
min read
LW
link
(arxiv.org)
Alignment 201 curriculum
Richard_Ngo
Oct 12, 2022, 6:03 PM
102
points
3
comments
1
min read
LW
link
(www.agisafetyfundamentals.com)
Analysis: US restricts GPU sales to China
aog
Oct 7, 2022, 6:38 PM
102
points
58
comments
5
min read
LW
link
The Teacup Test
lsusr
Oct 8, 2022, 4:25 AM
102
points
32
comments
2
min read
LW
link
Some Lessons Learned from Studying Indirect Object Identification in GPT-2 small
RowanWang
,
Alexandre Variengien
,
Arthur Conmy
,
Buck
and
jsteinhardt
Oct 28, 2022, 11:55 PM
101
points
9
comments
9
min read
LW
link
2
reviews
(arxiv.org)
How To Make Prediction Markets Useful For Alignment Work
johnswentworth
Oct 18, 2022, 7:01 PM
97
points
18
comments
2
min read
LW
link
A shot at the diamond-alignment problem
TurnTrout
Oct 6, 2022, 6:29 PM
95
points
67
comments
15
min read
LW
link
Transformative VR Is Likely Coming Soon
jimrandomh
Oct 13, 2022, 6:25 AM
90
points
47
comments
2
min read
LW
link
«Boundaries», Part 3a: Defining boundaries as directed Markov blankets
Andrew_Critch
Oct 30, 2022, 6:31 AM
90
points
20
comments
15
min read
LW
link
A blog post is a very long and complex search query to find fascinating people and make them route interesting stuff to your inbox
Henrik Karlsson
Oct 5, 2022, 7:07 PM
89
points
12
comments
11
min read
LW
link
(escapingflatland.substack.com)
Why Balsa Research is Worthwhile
Zvi
Oct 10, 2022, 1:50 PM
87
points
12
comments
8
min read
LW
link
(thezvi.wordpress.com)
Polysemanticity and Capacity in Neural Networks
Buck
,
Adam Jermyn
and
Kshitij Sachan
Oct 7, 2022, 5:51 PM
87
points
14
comments
3
min read
LW
link
I learn better when I frame learning as Vengeance for losses incurred through ignorance, and you might too
chaosmage
Oct 15, 2022, 12:41 PM
84
points
9
comments
3
min read
LW
link
1
review
More Recent Progress in the Theory of Neural Networks
jylin04
Oct 6, 2022, 4:57 PM
82
points
6
comments
4
min read
LW
link
Untapped Potential at 13-18
belkarx
Oct 18, 2022, 6:09 PM
82
points
53
comments
1
min read
LW
link
“Normal” is the equilibrium state of past optimization processes
Alex_Altair
Oct 30, 2022, 7:03 PM
82
points
5
comments
5
min read
LW
link
The heritability of human values: A behavior genetic critique of Shard Theory
geoffreymiller
Oct 20, 2022, 3:51 PM
82
points
63
comments
21
min read
LW
link
Paper: Discovering novel algorithms with AlphaTensor [Deepmind]
LawrenceC
Oct 5, 2022, 4:20 PM
82
points
18
comments
1
min read
LW
link
(www.deepmind.com)
Voting Theory Introduction
Scott Garrabrant
Oct 17, 2022, 8:48 AM
80
points
8
comments
6
min read
LW
link
Maximal Lotteries
Scott Garrabrant
Oct 17, 2022, 8:54 AM
77
points
11
comments
7
min read
LW
link
Response to Katja Grace’s AI x-risk counterarguments
Erik Jenner
and
Johannes Treutlein
Oct 19, 2022, 1:17 AM
77
points
18
comments
15
min read
LW
link
The “you-can-just” alarm
Emrik
Oct 8, 2022, 10:43 AM
77
points
3
comments
1
min read
LW
link
Neural Tangent Kernel Distillation
Thomas Larsen
and
Jeremy Gillen
Oct 5, 2022, 6:11 PM
76
points
20
comments
8
min read
LW
link
Open Problem in Voting Theory
Scott Garrabrant
Oct 17, 2022, 8:42 PM
75
points
16
comments
6
min read
LW
link
Wisdom Cannot Be Unzipped
Sable
Oct 22, 2022, 12:28 AM
74
points
17
comments
7
min read
LW
link
1
review
(affablyevil.substack.com)
What does it mean for an AGI to be ‘safe’?
So8res
Oct 7, 2022, 4:13 AM
74
points
29
comments
3
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel