Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
1
Let’s See You Write That Corrigibility Tag
Eliezer Yudkowsky
Jun 19, 2022, 9:11 PM
125
points
70
comments
1
min read
LW
link
Half-baked alignment idea: training to generalize
Aaron Bergman
Jun 19, 2022, 8:16 PM
10
points
2
comments
4
min read
LW
link
Where I agree and disagree with Eliezer
paulfchristiano
Jun 19, 2022, 7:15 PM
901
points
224
comments
18
min read
LW
link
2
reviews
[Question]
AI misalignment risk from GPT-like systems?
fiso64
Jun 19, 2022, 5:35 PM
10
points
8
comments
1
min read
LW
link
[Link-post] On Deference and Yudkowsky’s AI Risk Estimates
bmg
Jun 19, 2022, 5:25 PM
29
points
8
comments
1
min read
LW
link
Hebbian Learning Is More Common Than You Think
Aleksi Liimatainen
Jun 19, 2022, 3:57 PM
8
points
2
comments
1
min read
LW
link
The Malthusian Trap: An Extremely Short Introduction
Davis Kedrosky
Jun 19, 2022, 3:25 PM
5
points
0
comments
6
min read
LW
link
(daviskedrosky.substack.com)
Parliaments without the Parties
Yair Halberstadt
Jun 19, 2022, 2:06 PM
18
points
18
comments
2
min read
LW
link
Lamda is not an LLM
Kevin
Jun 19, 2022, 11:13 AM
7
points
10
comments
1
min read
LW
link
(www.wired.com)
Getting stuck in local minima
louis030195
Jun 19, 2022, 8:50 AM
3
points
1
comment
1
min read
LW
link
(brain.louis030195.com)
[Linkpost] The importance of stupidity in scientific research
Pattern
Jun 19, 2022, 5:17 AM
17
points
1
comment
1
min read
LW
link
(journals.biologists.com)
ETH is probably undervalued right now
mukashi
Jun 19, 2022, 2:20 AM
−7
points
22
comments
1
min read
LW
link
Juneberry Cake
jefftk
Jun 19, 2022, 1:40 AM
29
points
0
comments
1
min read
LW
link
(www.jefftk.com)
Agent level parallelism
Johannes C. Mayer
Jun 18, 2022, 8:56 PM
5
points
5
comments
1
min read
LW
link
What are our outs to play to?
Hastings
Jun 18, 2022, 7:32 PM
7
points
0
comments
2
min read
LW
link
[Question]
What’s the information value of government hearings?
Kenny
Jun 18, 2022, 5:13 PM
6
points
4
comments
2
min read
LW
link
The best ‘free solo’ (rock climbing) video
Kenny
Jun 18, 2022, 3:29 PM
14
points
4
comments
2
min read
LW
link
[Question]
What’s the name of this fallacy/reasoning antipattern?
David Gross
Jun 18, 2022, 2:04 PM
9
points
6
comments
1
min read
LW
link
“Brain enthusiasts” in AI Safety
Jan
and
Samuel Nellessen
Jun 18, 2022, 9:59 AM
63
points
5
comments
10
min read
LW
link
(universalprior.substack.com)
To what extent have ideas and scientific discoveries gotten harder to find?
lsusr
Jun 18, 2022, 7:15 AM
33
points
10
comments
6
min read
LW
link
[Question]
What’s the goal in life?
Konstantin Weitz
Jun 18, 2022, 6:09 AM
5
points
6
comments
1
min read
LW
link
Can DALL-E understand simple geometry?
Isaac King
Jun 18, 2022, 4:37 AM
25
points
2
comments
1
min read
LW
link
Scott Aaronson is joining OpenAI to work on AI safety
peterbarnett
Jun 18, 2022, 4:06 AM
117
points
31
comments
1
min read
LW
link
(scottaaronson.blog)
[Question]
Why don’t we think we’re in the simplest universe with intelligent life?
ADifferentAnonymous
Jun 18, 2022, 3:05 AM
30
points
33
comments
1
min read
LW
link
Do yourself a FAVAR: security mindset
lemonhope
Jun 18, 2022, 2:08 AM
20
points
2
comments
2
min read
LW
link
Forecasting Fusion Power
Daniel Kokotajlo
Jun 18, 2022, 12:04 AM
29
points
8
comments
1
min read
LW
link
(astralcodexten.substack.com)
Pivotal outcomes and pivotal processes
Andrew_Critch
Jun 17, 2022, 11:43 PM
97
points
31
comments
4
min read
LW
link
Quantifying General Intelligence
JasonBrown
Jun 17, 2022, 9:57 PM
9
points
6
comments
13
min read
LW
link
Apply for Productivity Coaching and AI Alignment Mentorship
Nick
Jun 17, 2022, 9:36 PM
12
points
1
comment
1
min read
LW
link
Things That Make Me Enjoy Giving Career Advice
Neel Nanda
Jun 17, 2022, 8:49 PM
16
points
0
comments
9
min read
LW
link
(www.neelnanda.io)
The Unified Theory of Normative Ethics
Thane Ruthenis
Jun 17, 2022, 7:55 PM
8
points
0
comments
6
min read
LW
link
1689: Uncovering the World New Institutionalism Created
Davis Kedrosky
Jun 17, 2022, 7:32 PM
7
points
0
comments
9
min read
LW
link
(daviskedrosky.substack.com)
[Question]
Is there an unified way to make sense of ai failure modes?
walking_mushroom
Jun 17, 2022, 6:00 PM
3
points
1
comment
1
min read
LW
link
In defense of flailing, with foreword by Bill Burr
lc
Jun 17, 2022, 4:40 PM
88
points
6
comments
4
min read
LW
link
An Approach to Land Value Taxation
harsimony
Jun 17, 2022, 3:53 PM
4
points
12
comments
4
min read
LW
link
(harsimony.wordpress.com)
Value extrapolation vs Wireheading
Stuart_Armstrong
Jun 17, 2022, 3:02 PM
16
points
1
comment
1
min read
LW
link
#SAT with Tensor Networks
Adam Jermyn
Jun 17, 2022, 1:20 PM
4
points
0
comments
2
min read
LW
link
Announcing the Clearer Thinking Regrants program
spencerg
Jun 17, 2022, 1:14 PM
36
points
1
comment
1
min read
LW
link
Singapore—Small casual dinner in Chinatown #3: DALL-E 2 edition
Joe Rocca
Jun 17, 2022, 8:32 AM
2
points
2
comments
1
min read
LW
link
[Question]
Is civilizational alignment on the table?
Aleksi Liimatainen
Jun 17, 2022, 8:27 AM
5
points
1
comment
1
min read
LW
link
Apply to the Machine Learning For Good bootcamp in France
Alexandre Variengien
Jun 17, 2022, 7:32 AM
10
points
0
comments
1
min read
LW
link
What’s it like to have sex with Duncan?
Duncan Sabien (Inactive)
Jun 17, 2022, 2:32 AM
52
points
19
comments
17
min read
LW
link
wrapper-minds are the enemy
nostalgebraist
Jun 17, 2022, 1:58 AM
105
points
43
comments
8
min read
LW
link
A Litany Missing from the Canon
benwr
17 Jun 2022 1:39 UTC
39
points
3
comments
1
min read
LW
link
(www.benwr.net)
[Question]
Why did Russia invade Ukraine?
bohaska
17 Jun 2022 1:36 UTC
0
points
5
comments
1
min read
LW
link
A transparency and interpretability tech tree
evhub
16 Jun 2022 23:44 UTC
163
points
11
comments
18
min read
LW
link
1
review
BBC Future covers progress studies
jasoncrawford
16 Jun 2022 22:44 UTC
21
points
6
comments
3
min read
LW
link
(rootsofprogress.org)
Humans are very reliable agents
alyssavance
16 Jun 2022 22:02 UTC
269
points
35
comments
3
min read
LW
link
Towards Gears-Level Understanding of Agency
Thane Ruthenis
16 Jun 2022 22:00 UTC
25
points
4
comments
18
min read
LW
link
A possible AI-inoculation due to early “robot uprising”
Shmi
16 Jun 2022 21:21 UTC
16
points
2
comments
1
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel