Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
1
Exploring toy neural nets under node removal. Section 1.
Donald Hobson
Apr 13, 2022, 11:30 PM
12
points
7
comments
8
min read
LW
link
Make a Movie Showing Alignment Failures
Logan Riggs
Apr 13, 2022, 9:54 PM
75
points
11
comments
2
min read
LW
link
Summary: “How to Do Research” by OSP’s Red
Pablo Repetto
Apr 13, 2022, 7:46 PM
9
points
0
comments
3
min read
LW
link
(pabloernesto.github.io)
A Quick Guide to Confronting Doom
Ruby
Apr 13, 2022, 7:30 PM
243
points
33
comments
2
min read
LW
link
Design, Implement and Verify
rwallace
Apr 13, 2022, 6:14 PM
32
points
13
comments
4
min read
LW
link
Takeoff speeds have a huge effect on what it means to work on AI x-risk
Buck
Apr 13, 2022, 5:38 PM
139
points
27
comments
2
min read
LW
link
2
reviews
Budapest Meetup
Richard Horvath
Apr 13, 2022, 5:23 PM
2
points
0
comments
1
min read
LW
link
[Question]
What to include in a guest lecture on existential risks from AI?
Aryeh Englander
Apr 13, 2022, 5:03 PM
20
points
9
comments
1
min read
LW
link
Common Knowledge is a Circle Game for Toddlers
ryan_b
Apr 13, 2022, 3:24 PM
58
points
1
comment
1
min read
LW
link
Another list of theories of impact for interpretability
Beth Barnes
Apr 13, 2022, 1:29 PM
33
points
1
comment
5
min read
LW
link
The Cage of the Language
Martin Sustrik
Apr 13, 2022, 5:20 AM
54
points
19
comments
2
min read
LW
link
[Question]
What’s a good probability distribution family (e.g. “log-normal”) to use for AGI timelines?
David Scott Krueger (formerly: capybaralet)
Apr 13, 2022, 4:45 AM
9
points
11
comments
1
min read
LW
link
How dath ilan coordinates around solving alignment
Thomas Kwa
Apr 13, 2022, 4:22 AM
65
points
46
comments
5
min read
LW
link
What more compute does for brain-like models: response to Rohin
Nathan Helm-Burger
Apr 13, 2022, 3:40 AM
24
points
14
comments
12
min read
LW
link
[Question]
“Fragility of Value” vs. LLMs
Not Relevant
Apr 13, 2022, 2:02 AM
34
points
33
comments
1
min read
LW
link
Commensurable Scientific Paradigms; or, computable induction
samshap
Apr 13, 2022, 12:01 AM
14
points
0
comments
5
min read
LW
link
Convincing People of Alignment with Street Epistemology
Logan Riggs
Apr 12, 2022, 11:43 PM
54
points
4
comments
3
min read
LW
link
Useful Vices for Wicked Problems
HoldenKarnofsky
Apr 12, 2022, 7:30 PM
76
points
2
comments
17
min read
LW
link
1
review
(www.cold-takes.com)
SSC/ACX, San Diego, Schelling Point, Meetups Everywhere
CitizenTen
Apr 12, 2022, 6:50 PM
2
points
0
comments
1
min read
LW
link
SSC/ACX San Diego Rock Climbing
CitizenTen
Apr 12, 2022, 6:46 PM
2
points
0
comments
1
min read
LW
link
[Question]
Does the rationalist community have a membership funnel?
Alex_Altair
Apr 12, 2022, 6:44 PM
38
points
17
comments
1
min read
LW
link
A Small Negative Result on Debate
Sam Bowman
Apr 12, 2022, 6:19 PM
42
points
11
comments
1
min read
LW
link
US Taxes: Adjust Withholding When Donating?
jefftk
Apr 12, 2022, 3:50 PM
15
points
1
comment
1
min read
LW
link
(www.jefftk.com)
Introducing Effective Self-Help
Ben Williamson
Apr 12, 2022, 3:01 PM
19
points
0
comments
16
min read
LW
link
Ukraine Post #10: Next Phase
Zvi
Apr 12, 2022, 1:40 PM
47
points
13
comments
14
min read
LW
link
(thezvi.wordpress.com)
Is technical AI alignment research a net positive?
cranberry_bear
Apr 12, 2022, 1:07 PM
6
points
2
comments
2
min read
LW
link
[Question]
What is your advice for elder care, particularly taking care of dementia patients?
RasmusHB
Apr 12, 2022, 11:33 AM
4
points
6
comments
1
min read
LW
link
Reward model hacking as a challenge for reward learning
Erik Jenner
Apr 12, 2022, 9:39 AM
25
points
1
comment
9
min read
LW
link
How I use Anki: expanding the scope of SRS
CallumMcDougall
Apr 12, 2022, 8:28 AM
37
points
8
comments
19
min read
LW
link
[Question]
What do you think will most probably happen to our consciousness when our simulation ends?
ArtMi
Apr 12, 2022, 8:23 AM
1
point
5
comments
1
min read
LW
link
Favorites & Performers
Soma
Apr 12, 2022, 5:50 AM
9
points
0
comments
1
min read
LW
link
A broad basin of attraction around human values?
Wei Dai
Apr 12, 2022, 5:15 AM
114
points
18
comments
2
min read
LW
link
AI governance student hackathon on Saturday, April 23: register now!
mic
Apr 12, 2022, 4:48 AM
14
points
0
comments
1
min read
LW
link
The Platonist’s Dilemma: A Remix on the Prisoner’s.
James Camacho
Apr 12, 2022, 3:49 AM
7
points
2
comments
5
min read
LW
link
[Question]
Three questions about mesa-optimizers
Eric Neyman
Apr 12, 2022, 2:58 AM
26
points
5
comments
3
min read
LW
link
The Amish
PeterMcCluskey
Apr 12, 2022, 2:54 AM
49
points
5
comments
6
min read
LW
link
(www.bayesianinvestor.com)
Rationalist Should Win. Not Dying with Dignity and Funding WBE.
CitizenTen
Apr 12, 2022, 2:14 AM
32
points
15
comments
5
min read
LW
link
[Question]
How can I determine that Elicit is not some weak AGI’s attempt at taking over the world ?
Lucie Philippon
Apr 12, 2022, 12:54 AM
5
points
3
comments
1
min read
LW
link
Summary: “How to Write Quickly...” by John Wentworth
Pablo Repetto
Apr 11, 2022, 11:26 PM
4
points
0
comments
2
min read
LW
link
(pabloernesto.github.io)
Rambling thoughts on having multiple selves
cranberry_bear
Apr 11, 2022, 10:43 PM
15
points
1
comment
3
min read
LW
link
An AI-in-a-box success model
azsantosk
Apr 11, 2022, 10:28 PM
16
points
1
comment
10
min read
LW
link
The Regulatory Option: A response to near 0% survival odds
Matthew Lowenstein
Apr 11, 2022, 10:00 PM
46
points
21
comments
6
min read
LW
link
The Efficient LessWrong Hypothesis—Stock Investing Competition
MrThink
Apr 11, 2022, 8:43 PM
30
points
35
comments
2
min read
LW
link
Review: Structure and Interpretation of Computer Programs
L Rudolf L
11 Apr 2022 20:27 UTC
17
points
9
comments
10
min read
LW
link
(www.strataoftheworld.com)
[Question]
Underappreciated content on LessWrong
Ege Erdil
11 Apr 2022 17:40 UTC
22
points
5
comments
1
min read
LW
link
Editing Advice for LessWrong Users
JustisMills
11 Apr 2022 16:32 UTC
234
points
14
comments
6
min read
LW
link
1
review
Post-history is written by the martyrs
Veedrac
11 Apr 2022 15:45 UTC
50
points
2
comments
19
min read
LW
link
(www.royalroad.com)
What Chords Do You Need?
jefftk
11 Apr 2022 15:00 UTC
11
points
0
comments
3
min read
LW
link
(www.jefftk.com)
What can people not smart/technical/”competent” enough for AI research/AI risk work do to reduce AI-risk/maximize AI safety? (which is most people?)
Alex K. Chen (parrot)
11 Apr 2022 14:05 UTC
7
points
3
comments
3
min read
LW
link
Goodhart’s Law Causal Diagrams
JustinShovelain
and
Jeremy Gillen
11 Apr 2022 13:52 UTC
35
points
6
comments
6
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel