Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
1
Chris Voss negotiation MasterClass: review
VipulNaik
Nov 20, 2021, 10:39 PM
70
points
15
comments
33
min read
LW
link
ACX Montreal Meetup Dec 4 2021
E
Nov 20, 2021, 5:49 PM
8
points
0
comments
1
min read
LW
link
The Maker of MIND
Tomás B.
Nov 20, 2021, 4:28 PM
112
points
19
comments
11
min read
LW
link
South Bay ACX/LW Meetup—CHANGED LOCATION
IS
Nov 20, 2021, 2:42 PM
11
points
0
comments
1
min read
LW
link
The Emperor’s New Clothes: a story of motivated stupidity
David Hugh-Jones
Nov 20, 2021, 1:24 PM
10
points
5
comments
3
min read
LW
link
(wyclif.substack.com)
[Book Review] “Sorceror’s Apprentice” by Tahir Shah
lsusr
Nov 20, 2021, 11:29 AM
92
points
11
comments
7
min read
LW
link
Competence/Confidence
Duncan Sabien (Inactive)
Nov 20, 2021, 8:59 AM
60
points
19
comments
1
min read
LW
link
Awesome-github Post-Scarcity List
lorepieri
Nov 20, 2021, 8:47 AM
3
points
6
comments
1
min read
LW
link
A Certain Formalization of Corrigibility Is VNM-Incoherent
TurnTrout
Nov 20, 2021, 12:30 AM
68
points
24
comments
8
min read
LW
link
More detailed proposal for measuring alignment of current models
Beth Barnes
Nov 20, 2021, 12:03 AM
31
points
0
comments
8
min read
LW
link
Ambitious Altruistic Software Engineering Efforts: Opportunities and Benefits
ozziegooen
Nov 19, 2021, 5:55 PM
42
points
1
comment
9
min read
LW
link
(forum.effectivealtruism.org)
[Question]
Which booster shot to get and when?
NormanPerlmutter
Nov 19, 2021, 8:52 AM
22
points
17
comments
2
min read
LW
link
Goodhart: Endgame
Charlie Steiner
Nov 19, 2021, 1:26 AM
25
points
3
comments
8
min read
LW
link
Reaction and Reply to Sasha Chapin on Bad In-group Norms
Nicholas / Heather Kross
Nov 19, 2021, 1:13 AM
6
points
0
comments
3
min read
LW
link
(www.thinkingmuchbetter.com)
[Question]
Does anyone know what Marvin Minsky is talking about here?
delton137
Nov 19, 2021, 12:56 AM
1
point
6
comments
3
min read
LW
link
How To Get Into Independent Research On Alignment/Agency
johnswentworth
Nov 19, 2021, 12:00 AM
356
points
38
comments
13
min read
LW
link
2
reviews
“Acquisition of Chess Knowledge in AlphaZero”: probing AZ over time
jsd
Nov 18, 2021, 11:24 PM
11
points
9
comments
LW
link
(arxiv.org)
Ngo and Yudkowsky on AI capability gains
Eliezer Yudkowsky
and
Richard_Ngo
Nov 18, 2021, 10:19 PM
131
points
61
comments
39
min read
LW
link
1
review
Covid 11/18: Paxlovid Remains Illegal
Zvi
Nov 18, 2021, 3:50 PM
55
points
36
comments
14
min read
LW
link
(thezvi.wordpress.com)
Satisficers Tend To Seek Power: Instrumental Convergence Via Retargetability
TurnTrout
Nov 18, 2021, 1:54 AM
85
points
8
comments
17
min read
LW
link
(www.overleaf.com)
Forecasting: Zeroth and First Order
jsteinhardt
Nov 18, 2021, 1:30 AM
33
points
6
comments
5
min read
LW
link
(bounded-regret.ghost.io)
Experience on Methotrexate
jefftk
Nov 17, 2021, 10:40 PM
13
points
0
comments
2
min read
LW
link
(www.jefftk.com)
Applications for AI Safety Camp 2022 Now Open!
adamShimi
Nov 17, 2021, 9:42 PM
47
points
3
comments
1
min read
LW
link
[Question]
Did EcoHealth create SARS-CoV-2?
jamal
Nov 17, 2021, 8:42 PM
3
points
7
comments
1
min read
LW
link
On Raising Awareness
Tomás B.
Nov 17, 2021, 5:12 PM
21
points
10
comments
3
min read
LW
link
Sasha Chapin on bad social norms in rationality/EA
Kaj_Sotala
Nov 17, 2021, 9:43 AM
51
points
22
comments
5
min read
LW
link
(sashachapin.substack.com)
[Question]
What are the mutual benefits of AGI-human collaboration that would otherwise be unobtainable?
M. Y. Zuo
Nov 17, 2021, 3:09 AM
1
point
4
comments
1
min read
LW
link
Quadratic Voting and Collusion
leogao
Nov 17, 2021, 12:19 AM
41
points
24
comments
2
min read
LW
link
Taking a simplified model
dominicq
Nov 16, 2021, 10:21 PM
9
points
8
comments
1
min read
LW
link
The Greedy Doctor Problem
Jan
Nov 16, 2021, 10:06 PM
6
points
10
comments
12
min read
LW
link
(universalprior.substack.com)
Equity premium puzzles
Ege Erdil
and
Metaculus
Nov 16, 2021, 8:50 PM
20
points
4
comments
12
min read
LW
link
(www.metaculus.com)
Why I am no longer driven
dominicq
Nov 16, 2021, 8:43 PM
71
points
16
comments
4
min read
LW
link
Super intelligent AIs that don’t require alignment
Yair Halberstadt
Nov 16, 2021, 7:55 PM
10
points
2
comments
6
min read
LW
link
Why Save The Drowning Child: Ethics Vs Theory
Raymond Douglas
Nov 16, 2021, 7:07 PM
17
points
12
comments
4
min read
LW
link
Two Stupid AI Alignment Ideas
aphyer
Nov 16, 2021, 4:13 PM
27
points
3
comments
4
min read
LW
link
[linkpost] Project Blueprint: ‘Measuring and then maximally reversing the quantified biological age of my organs’
matteodimaio
Nov 16, 2021, 2:48 AM
2
points
0
comments
1
min read
LW
link
A positive case for how we might succeed at prosaic AI alignment
evhub
Nov 16, 2021, 1:49 AM
81
points
46
comments
6
min read
LW
link
Quantilizer ≡ Optimizer with a Bounded Amount of Output
itaibn0
Nov 16, 2021, 1:03 AM
11
points
4
comments
2
min read
LW
link
D&D.Sci Dungeoncrawling: The Crown of Command Evaluation & Ruleset
aphyer
Nov 16, 2021, 12:29 AM
29
points
12
comments
9
min read
LW
link
Streaming Science on Twitch
A Ray
Nov 15, 2021, 10:24 PM
21
points
1
comment
3
min read
LW
link
Ngo and Yudkowsky on alignment difficulty
Eliezer Yudkowsky
and
Richard_Ngo
Nov 15, 2021, 8:31 PM
259
points
151
comments
99
min read
LW
link
1
review
Dan Luu on Persistent Bad Decision Making (but maybe it’s noble?)
Elizabeth
Nov 15, 2021, 8:05 PM
17
points
3
comments
1
min read
LW
link
(danluu.com)
The poetry of progress
jasoncrawford
Nov 15, 2021, 7:24 PM
51
points
6
comments
4
min read
LW
link
(rootsofprogress.org)
[Question]
Worst Commonsense Concepts?
abramdemski
15 Nov 2021 18:22 UTC
75
points
34
comments
3
min read
LW
link
My understanding of the alignment problem
danieldewey
15 Nov 2021 18:13 UTC
43
points
3
comments
3
min read
LW
link
“Summarizing Books with Human Feedback” (recursive GPT-3)
gwern
15 Nov 2021 17:41 UTC
24
points
4
comments
LW
link
(openai.com)
How Humanity Lost Control and Humans Lost Liberty: From Our Brave New World to Analogia (Sequence Introduction)
Justin Bullock
15 Nov 2021 14:22 UTC
8
points
4
comments
3
min read
LW
link
Re: Attempted Gears Analysis of AGI Intervention Discussion With Eliezer
lsusr
15 Nov 2021 10:02 UTC
20
points
8
comments
15
min read
LW
link
What the future will look like
avantika.mehra
15 Nov 2021 5:14 UTC
7
points
1
comment
3
min read
LW
link
Attempted Gears Analysis of AGI Intervention Discussion With Eliezer
Zvi
15 Nov 2021 3:50 UTC
197
points
49
comments
16
min read
LW
link
(thezvi.wordpress.com)
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel