Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
[Question]
Can Bayes theorem represent infinite confusion?
Yoav Ravid
Mar 22, 2019, 6:02 PM
4
points
13
comments
1
min read
LW
link
The Game Theory of Blackmail
Linda Linsefors
Mar 22, 2019, 5:44 PM
25
points
17
comments
4
min read
LW
link
New Entry at the Stanford Encyclopedia of Philosophy on the Pragmatic Theory of Truth
Iwan Danilo
Mar 22, 2019, 5:39 PM
−3
points
1
comment
LW
link
(plato.stanford.edu)
South Bay SSC Meetup
David Friedman
Mar 22, 2019, 3:10 AM
2
points
0
comments
LW
link
Retrospective on a quantitative productivity logging attempt
femtogrammar
Mar 22, 2019, 2:31 AM
25
points
5
comments
3
min read
LW
link
Declarative Mathematics
johnswentworth
Mar 21, 2019, 7:05 PM
59
points
10
comments
3
min read
LW
link
The Main Sources of AI Risk?
Daniel Kokotajlo
and
Wei Dai
Mar 21, 2019, 6:28 PM
126
points
26
comments
2
min read
LW
link
[Link] IDA 9/14: The Scheme
RAISE
Mar 21, 2019, 6:28 PM
4
points
0
comments
1
min read
LW
link
[Question]
What should we expect from GPT-3?
avturchin
Mar 21, 2019, 2:28 PM
22
points
2
comments
1
min read
LW
link
[Question] Tracking accuracy of personal forecasts
CheerfulWarrior
Mar 20, 2019, 8:39 PM
8
points
14
comments
1
min read
LW
link
Criticism catalyzes analytical thinking in groups
rayraegah
Mar 20, 2019, 4:27 PM
10
points
0
comments
1
min read
LW
link
Games in Kocherga club: Fallacymania, Tower of Chaos, Scientific Discovery
Alexander230
Mar 20, 2019, 1:52 PM
3
points
0
comments
1
min read
LW
link
Moscow LW meetup in “Nauchka” library
Alexander230
Mar 20, 2019, 1:49 PM
3
points
0
comments
1
min read
LW
link
[Question]
What’s wrong with these analogies for understanding Informed Oversight and IDA?
Wei Dai
Mar 20, 2019, 9:11 AM
35
points
3
comments
1
min read
LW
link
Alignment Newsletter #49
Rohin Shah
Mar 20, 2019, 4:20 AM
23
points
1
comment
11
min read
LW
link
(mailchi.mp)
Some thoughts after reading Artificial Intelligence: A Modern Approach
swift_spiral
Mar 19, 2019, 11:39 PM
38
points
4
comments
2
min read
LW
link
Rest Days vs Recovery Days
Unreal
Mar 19, 2019, 10:37 PM
223
points
36
comments
6
min read
LW
link
1
review
Partial preferences and models
Stuart_Armstrong
Mar 19, 2019, 4:29 PM
12
points
9
comments
2
min read
LW
link
IRL 3/8: Mitigating degeneracy: feature matching
RAISE
Mar 18, 2019, 8:15 PM
6
points
0
comments
1
min read
LW
link
(app.grasple.com)
[Question]
Is there a difference between uncertainty over your utility function and uncertainty over outcomes?
Chris_Leong
Mar 18, 2019, 6:41 PM
14
points
12
comments
1
min read
LW
link
Ideas for a fact checking widget
Yoav Ravid
Mar 18, 2019, 2:25 PM
9
points
4
comments
1
min read
LW
link
Implications of living within a Simulation
Tater
Mar 18, 2019, 6:22 AM
1
point
7
comments
2
min read
LW
link
What failure looks like
paulfchristiano
Mar 17, 2019, 8:18 PM
434
points
55
comments
8
min read
LW
link
2
reviews
Cryopreservation of Valia Zeldin
avturchin
Mar 17, 2019, 7:15 PM
19
points
0
comments
1
min read
LW
link
(medium.com)
Insights from Munkres’ Topology
Rafael Harth
Mar 17, 2019, 4:52 PM
30
points
0
comments
14
min read
LW
link
Motivational Meeting Place
Vincent B
Mar 17, 2019, 4:17 PM
8
points
1
comment
3
min read
LW
link
[Question]
Ask LW: Have you read Yudkowsky’s AI to Zombie book?
CaiwitzAzaria
Mar 17, 2019, 1:31 PM
10
points
20
comments
1
min read
LW
link
[Question]
What societies have ever had legal or accepted blackmail?
clone of saturn
Mar 17, 2019, 9:16 AM
33
points
23
comments
1
min read
LW
link
[Question]
How large is the fallout area of the biggest cobalt bomb we can build?
habryka
Mar 17, 2019, 5:50 AM
20
points
8
comments
1
min read
LW
link
A cognitive intervention for wrist pain
rmoehn
Mar 17, 2019, 5:26 AM
28
points
24
comments
6
min read
LW
link
Has “politics is the mind-killer” been a mind-killer?
SonnieBailey
Mar 17, 2019, 3:05 AM
31
points
26
comments
3
min read
LW
link
Comparison of decision theories (with a focus on logical-counterfactual decision theories)
riceissa
Mar 16, 2019, 9:15 PM
78
points
11
comments
10
min read
LW
link
Terrorism and Russell’s love of excitement
CaiwitzAzaria
Mar 16, 2019, 6:53 AM
−9
points
0
comments
1
min read
LW
link
Boeing 737 MAX MCAS as an agent corrigibility failure
Shmi
Mar 16, 2019, 1:46 AM
60
points
3
comments
1
min read
LW
link
Humans aren’t agents—what then for value learning?
Charlie Steiner
Mar 15, 2019, 10:01 PM
28
points
16
comments
3
min read
LW
link
Privacy
Zvi
Mar 15, 2019, 8:20 PM
79
points
78
comments
6
min read
LW
link
(thezvi.wordpress.com)
Active Curiosity vs Open Curiosity
Unreal
Mar 15, 2019, 4:54 PM
76
points
24
comments
3
min read
LW
link
IDA 5-8/14: Approval Directed Agents
RAISE
Mar 14, 2019, 11:58 PM
4
points
0
comments
1
min read
LW
link
(app.grasple.com)
Nashville SSC March Meetup
Dude McDude
Mar 14, 2019, 7:37 PM
1
point
0
comments
1
min read
LW
link
Risk of Mass Human Suffering / Extinction due to Climate Emergency
willfranks
Mar 14, 2019, 6:32 PM
4
points
3
comments
1
min read
LW
link
Speculations on Duo Standard
Zvi
Mar 14, 2019, 2:30 PM
9
points
2
comments
8
min read
LW
link
(thezvi.wordpress.com)
Combining individual preference utility functions
Stuart_Armstrong
Mar 14, 2019, 2:14 PM
13
points
2
comments
1
min read
LW
link
Mysteries, identity, and preferences over non-rewards
Stuart_Armstrong
Mar 14, 2019, 1:52 PM
14
points
1
comment
1
min read
LW
link
Blackmailers are privateers in the war on hypocrisy
Benquo
Mar 14, 2019, 8:13 AM
25
points
23
comments
5
min read
LW
link
(benjaminrosshoffman.com)
AI Safety Prerequisites Course: Basic abstract representations of computation
RAISE
Mar 13, 2019, 7:38 PM
28
points
2
comments
1
min read
LW
link
Question: MIRI Corrigbility Agenda
algon33
Mar 13, 2019, 7:38 PM
15
points
11
comments
1
min read
LW
link
A theory of human values
Stuart_Armstrong
Mar 13, 2019, 3:22 PM
28
points
13
comments
7
min read
LW
link
[Question]
Formalising continuous info cascades? [Info-cascade series]
Ben Pace
and
Bird Concept
13 Mar 2019 10:55 UTC
16
points
5
comments
1
min read
LW
link
[Question]
How large is the harm from info-cascades? [Info-cascade series]
Bird Concept
and
Ben Pace
13 Mar 2019 10:55 UTC
22
points
2
comments
1
min read
LW
link
[Question]
How can we respond to info-cascades? [Info-cascade series]
Bird Concept
and
Ben Pace
13 Mar 2019 10:55 UTC
14
points
12
comments
1
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel