Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
1
Catastrophic Risks from AI #6: Discussion and FAQ
Dan H
,
Mantas Mazeika
and
TW123
Jun 27, 2023, 11:23 PM
24
points
1
comment
13
min read
LW
link
(arxiv.org)
Catastrophic Risks from AI #5: Rogue AIs
Dan H
,
Mantas Mazeika
and
TW123
Jun 27, 2023, 10:06 PM
15
points
0
comments
22
min read
LW
link
(arxiv.org)
AISN #12: Policy Proposals from NTIA’s Request for Comment and Reconsidering Instrumental Convergence
Dan H
Jun 27, 2023, 5:20 PM
6
points
0
comments
LW
link
The Weight of the Future (Why The Apocalypse Can Be A Relief)
Sable
Jun 27, 2023, 5:18 PM
18
points
14
comments
3
min read
LW
link
(affablyevil.substack.com)
Aligning AI by optimizing for “wisdom”
JustinShovelain
and
Elliot Mckernon
Jun 27, 2023, 3:20 PM
28
points
8
comments
12
min read
LW
link
Freedom under Naturalistic Dualism
Arturo Macias
Jun 27, 2023, 2:34 PM
1
point
36
comments
1
min read
LW
link
(www.jneurophilosophy.com)
Munk AI debate: confusions and possible cruxes
Steven Byrnes
Jun 27, 2023, 2:18 PM
244
points
21
comments
8
min read
LW
link
Ateliers: Motivation
Stephen Fowler
Jun 27, 2023, 1:07 PM
7
points
0
comments
2
min read
LW
link
Self-Blinded Caffeine RCT
niplav
Jun 27, 2023, 12:38 PM
45
points
9
comments
8
min read
LW
link
An overview of the points system
Iknownothing
Jun 27, 2023, 9:09 AM
3
points
4
comments
1
min read
LW
link
(ai-plans.com)
AISC team report: Soft-optimization, Bayes and Goodhart
Simon Fischer
,
benjaminko
,
jazcarretao
,
DFNaiff
and
Jeremy Gillen
Jun 27, 2023, 6:05 AM
38
points
2
comments
15
min read
LW
link
Epistemic spot checking one claim in The Precipice
Isaac King
Jun 27, 2023, 1:03 AM
33
points
3
comments
1
min read
LW
link
nuclear costs are inflation
bhauth
Jun 26, 2023, 10:30 PM
8
points
42
comments
5
min read
LW
link
(www.bhauth.com)
Man in the Arena
Richard_Ngo
Jun 26, 2023, 9:57 PM
66
points
6
comments
8
min read
LW
link
Catastrophic Risks from AI #4: Organizational Risks
Dan H
,
Mantas Mazeika
and
TW123
Jun 26, 2023, 7:36 PM
23
points
0
comments
21
min read
LW
link
(arxiv.org)
The fraught voyage of aligned novelty
TsviBT
Jun 26, 2023, 7:10 PM
13
points
0
comments
17
min read
LW
link
[Question]
Deceptive AI vs. shifting instrumental incentives
Aryeh Englander
Jun 26, 2023, 6:09 PM
7
points
2
comments
3
min read
LW
link
On the Cost of Thriving Index
Zvi
Jun 26, 2023, 3:30 PM
33
points
6
comments
9
min read
LW
link
(thezvi.wordpress.com)
“Safety Culture for AI” is important, but isn’t going to be easy
Davidmanheim
Jun 26, 2023, 12:52 PM
47
points
2
comments
2
min read
LW
link
(forum.effectivealtruism.org)
Direct Preference Optimization in One Minute
lukemarks
Jun 26, 2023, 11:52 AM
22
points
3
comments
2
min read
LW
link
Self-experiment: A supraphysiological dosage of testosterone.
shapeshifter
Jun 26, 2023, 10:26 AM
8
points
3
comments
1
min read
LW
link
Confused Attractiveness
Vlad Loweren
Jun 26, 2023, 9:33 AM
8
points
5
comments
6
min read
LW
link
60+ Possible Futures
Bart Bussmann
Jun 26, 2023, 9:16 AM
93
points
18
comments
11
min read
LW
link
Bounded surprise exam paradox
cousin_it
Jun 26, 2023, 8:37 AM
29
points
5
comments
2
min read
LW
link
Model, Care, Execution
Ricki Heicklen
and
AvitalM
Jun 26, 2023, 4:05 AM
112
points
10
comments
12
min read
LW
link
1
review
(bayesshammai.substack.com)
The Fall of Rationality—The Senate of Admins
Ace Delgado
Jun 26, 2023, 1:49 AM
−10
points
0
comments
4
min read
LW
link
Another medical miracle
Dentin
Jun 25, 2023, 8:43 PM
187
points
48
comments
3
min read
LW
link
Did Bengio and Tegmark lose a debate about AI x-risk against LeCun and Mitchell?
Karl von Wendt
Jun 25, 2023, 4:59 PM
106
points
53
comments
7
min read
LW
link
AI-Plans.com—a contributable compendium
Iknownothing
Jun 25, 2023, 2:40 PM
39
points
7
comments
4
min read
LW
link
(ai-plans.com)
Map of maps of interesting fields
MaxG
Jun 25, 2023, 2:02 PM
24
points
0
comments
1
min read
LW
link
(glozematrix.substack.com)
Why am I Me?
dadadarren
Jun 25, 2023, 12:07 PM
45
points
46
comments
3
min read
LW
link
Will the growing deer prion epidemic spread to humans? Why not?
eukaryote
Jun 25, 2023, 4:31 AM
170
points
33
comments
13
min read
LW
link
(eukaryotewritesblog.com)
Crystal Healing — or the Origins of Expected Utility Maximizers
Alexander Gietelink Oldenziel
,
RP
and
Kaarel
Jun 25, 2023, 3:18 AM
50
points
11
comments
5
min read
LW
link
What’s in it for AI?
archeon
Jun 25, 2023, 1:17 AM
−20
points
0
comments
1
min read
LW
link
Lessons Learned: Properly Publicizing a Regional Meetup Event (also, last call to apply!)
Willa
Jun 25, 2023, 12:58 AM
9
points
2
comments
4
min read
LW
link
San Francisco ACX Meetup “First Saturday” July 1, 1 pm
guenael
Jun 24, 2023, 10:40 PM
2
points
0
comments
1
min read
LW
link
Correctly Calibrated Trust
habryka
Jun 24, 2023, 7:48 PM
38
points
3
comments
11
min read
LW
link
(forum.effectivealtruism.org)
Democratic AI Constitution: Round-Robin Debate and Synthesis
scottviteri
Jun 24, 2023, 7:31 PM
10
points
4
comments
5
min read
LW
link
(scottviteri.com)
DSLT 4. Phase Transitions in Neural Networks
Liam Carroll
Jun 24, 2023, 5:22 PM
30
points
3
comments
16
min read
LW
link
[Question]
Donate Now vs Donate Later—Relative Value of Donations to AI Alignment
AlignmentOptimizer
Jun 24, 2023, 5:20 PM
4
points
4
comments
1
min read
LW
link
ACX/EA Meetup Bremen
RasmusHB
Jun 24, 2023, 4:23 PM
3
points
0
comments
1
min read
LW
link
How to prevent Re-Traumatization on Meditation Retreats
EternallyBlissful
Jun 24, 2023, 2:16 PM
20
points
1
comment
5
min read
LW
link
[Question]
Can you prevent negative long-term effects of bad trips with sleep deprivation?
EternallyBlissful
Jun 24, 2023, 2:05 PM
15
points
5
comments
1
min read
LW
link
We ran a reading group on The Scout Mindset
Neil Crawford
and
andreamurillo
Jun 24, 2023, 10:10 AM
7
points
0
comments
2
min read
LW
link
Crisis Boot Camp: lessons learned and implications for EA
Nicole Ross
Jun 24, 2023, 6:28 AM
26
points
0
comments
LW
link
I just watched don’t look up.
ATheCoder
Jun 23, 2023, 9:22 PM
0
points
5
comments
2
min read
LW
link
Automatic Rate Limiting on LessWrong
Raemon
Jun 23, 2023, 8:19 PM
83
points
34
comments
8
min read
LW
link
Catastrophic Risks from AI #3: AI Race
Dan H
,
Mantas Mazeika
and
TW123
Jun 23, 2023, 7:21 PM
18
points
9
comments
29
min read
LW
link
(arxiv.org)
Write the Worst Post on LessWrong!
Johannes C. Mayer
Jun 23, 2023, 7:17 PM
−10
points
5
comments
4
min read
LW
link
Slaying the Hydra: toward a new game board for AI
Prometheus
Jun 23, 2023, 5:04 PM
0
points
5
comments
6
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel