Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
My Overview of the AI Alignment Landscape: Threat Models
Neel Nanda
Dec 25, 2021, 11:07 PM
53
points
3
comments
28
min read
LW
link
[Question]
What is a probabilistic physical theory?
Ege Erdil
Dec 25, 2021, 4:30 PM
15
points
36
comments
2
min read
LW
link
Belief-conditional things—things that only exist when you believe in them
Jan
Dec 25, 2021, 10:49 AM
7
points
3
comments
5
min read
LW
link
(universalprior.substack.com)
Tough Choices and Disappointment
maralorn
Dec 24, 2021, 9:59 PM
2
points
6
comments
1
min read
LW
link
Converging toward a Million Worlds
Joe Kwon
Dec 24, 2021, 9:33 PM
11
points
1
comment
3
min read
LW
link
Understanding the tensor product formulation in Transformer Circuits
Tom Lieberum
Dec 24, 2021, 6:05 PM
16
points
2
comments
3
min read
LW
link
[Question]
How to select a long-term goal and align my mind towards it?
Alexander
Dec 24, 2021, 11:40 AM
19
points
8
comments
2
min read
LW
link
Prerequisite Skills
lsusr
Dec 24, 2021, 10:11 AM
17
points
3
comments
1
min read
LW
link
Mechanistic Interpretability for the MLP Layers (rough early thoughts)
MadHatter
Dec 24, 2021, 7:24 AM
12
points
3
comments
1
min read
LW
link
(www.youtube.com)
Risks from AI persuasion
Beth Barnes
Dec 24, 2021, 1:48 AM
76
points
15
comments
31
min read
LW
link
Prioritizing Information
jsteinhardt
Dec 24, 2021, 12:00 AM
18
points
0
comments
7
min read
LW
link
(bounded-regret.ghost.io)
Omicron Post #9
Zvi
Dec 23, 2021, 9:50 PM
89
points
11
comments
19
min read
LW
link
(thezvi.wordpress.com)
Reply to Eliezer on Biological Anchors
HoldenKarnofsky
Dec 23, 2021, 4:15 PM
149
points
46
comments
15
min read
LW
link
Get Set, Also Go
Zvi
Dec 23, 2021, 3:00 PM
62
points
21
comments
16
min read
LW
link
(thezvi.wordpress.com)
2021 AI Alignment Literature Review and Charity Comparison
Larks
Dec 23, 2021, 2:06 PM
168
points
28
comments
73
min read
LW
link
Testing, Testing, Hopefully
Zvi
Dec 23, 2021, 12:30 PM
41
points
8
comments
4
min read
LW
link
(thezvi.wordpress.com)
Physics Erotica
lsusr
Dec 23, 2021, 11:01 AM
7
points
12
comments
1
min read
LW
link
[Book Review] “The Most Powerful Idea in the World” by William Rosen
lsusr
Dec 23, 2021, 8:27 AM
41
points
4
comments
8
min read
LW
link
Specialization
DirectedEvolution
Dec 23, 2021, 3:23 AM
15
points
1
comment
5
min read
LW
link
Worst-case thinking in AI alignment
Buck
Dec 23, 2021, 1:29 AM
167
points
18
comments
6
min read
LW
link
2
reviews
[Question]
Hedging the Possibility of Russia invading Ukraine
Annapurna
Dec 23, 2021, 1:13 AM
27
points
8
comments
1
min read
LW
link
Gifts
George3d6
Dec 22, 2021, 11:50 PM
13
points
1
comment
9
min read
LW
link
(www.epistem.ink)
A spreadsheet/template for doing an annual review
peterslattery
Dec 22, 2021, 11:29 PM
12
points
1
comment
2
min read
LW
link
[Question]
What time in your life were you the most productive at learning and/or thinking and why?
Jack R
Dec 22, 2021, 10:56 PM
11
points
2
comments
1
min read
LW
link
Transformer Circuits
evhub
Dec 22, 2021, 9:09 PM
144
points
4
comments
3
min read
LW
link
(transformer-circuits.pub)
[Question]
Help figuring out my sexuality?
Centhart
Dec 22, 2021, 8:28 PM
13
points
13
comments
2
min read
LW
link
DnD.Sci GURPS Evaluation and Ruleset
J Bostock
Dec 22, 2021, 7:05 PM
17
points
2
comments
6
min read
LW
link
Potential gears level explanations of smooth progress
ryan_greenblatt
Dec 22, 2021, 6:05 PM
4
points
2
comments
2
min read
LW
link
Random facts can come back to bite you
tailcalled
Dec 22, 2021, 5:33 PM
70
points
7
comments
2
min read
LW
link
1
review
What’s Up With the CDC Nowcast?
Zvi
Dec 22, 2021, 1:00 PM
61
points
4
comments
5
min read
LW
link
(thezvi.wordpress.com)
Morality and constrained maximization, part 1
Joe Carlsmith
Dec 22, 2021, 8:47 AM
20
points
5
comments
11
min read
LW
link
Six Specializations Makes You World-Class
lsusr
Dec 22, 2021, 8:03 AM
53
points
23
comments
1
min read
LW
link
Worldbuilding exercise: The Highwayverse.
Yair Halberstadt
Dec 22, 2021, 6:47 AM
13
points
13
comments
11
min read
LW
link
Two (very different) kinds of donors
Duncan Sabien (Inactive)
Dec 22, 2021, 1:43 AM
102
points
19
comments
3
min read
LW
link
[Question]
Confusion about Sequences and Review Sequences
Alex_Altair
Dec 21, 2021, 6:13 PM
14
points
3
comments
1
min read
LW
link
Working through D&D.Sci, problem 1 (solution)
Pablo Repetto
Dec 21, 2021, 5:42 PM
9
points
2
comments
1
min read
LW
link
(pabloernesto.github.io)
Demanding and Designing Aligned Cognitive Architectures
Koen.Holtman
Dec 21, 2021, 5:32 PM
8
points
5
comments
5
min read
LW
link
Experiences raising children in shared housing
juliawise
Dec 21, 2021, 5:09 PM
117
points
5
comments
6
min read
LW
link
[Question]
What questions do you have about doing work on AI safety?
peterbarnett
Dec 21, 2021, 4:36 PM
13
points
8
comments
1
min read
LW
link
Perpetual Dickensian Poverty?
jefftk
Dec 21, 2021, 1:30 PM
120
points
18
comments
1
min read
LW
link
(www.jefftk.com)
On (Not) Reading Papers
Jan
Dec 21, 2021, 9:57 AM
53
points
10
comments
7
min read
LW
link
(universalprior.substack.com)
Quick Poll: Booster Reactions
Elizabeth
Dec 21, 2021, 7:40 AM
40
points
2
comments
2
min read
LW
link
(acesounderglass.com)
Book Launch: The Engines of Cognition
Ben Pace
Dec 21, 2021, 7:24 AM
174
points
56
comments
5
min read
LW
link
Researcher incentives cause smoother progress on benchmarks
ryan_greenblatt
Dec 21, 2021, 4:13 AM
20
points
4
comments
1
min read
LW
link
Omicron Post #8
Zvi
Dec 20, 2021, 11:10 PM
96
points
33
comments
16
min read
LW
link
(thezvi.wordpress.com)
[Question]
Good complete views on motivation
Valdes
Dec 20, 2021, 10:10 PM
6
points
4
comments
1
min read
LW
link
Prizes for last year’s 2019 Review
Raemon
Dec 20, 2021, 9:58 PM
40
points
0
comments
3
min read
LW
link
Omicron Paths
jefftk
Dec 20, 2021, 6:30 PM
14
points
8
comments
2
min read
LW
link
(www.jefftk.com)
[Question]
Is there a term / better way of phrasing the general case where an intervention helps certain individuals do better at zero-sum games but doesn’t provide any external value?
freedomandutility
Dec 20, 2021, 5:35 PM
4
points
8
comments
1
min read
LW
link
Bayesian Dharani, Great Dharani for Conserving Evidence
Gordon Seidoh Worley
Dec 20, 2021, 4:32 PM
9
points
5
comments
1
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel