Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
Distributed Strategic Epistemology
StrivingForLegibility
Dec 28, 2023, 10:12 PM
11
points
0
comments
3
min read
LW
link
Building Trust in Strategic Settings
StrivingForLegibility
Dec 28, 2023, 10:12 PM
24
points
0
comments
7
min read
LW
link
An Ontology for Strategic Epistemology
StrivingForLegibility
Dec 28, 2023, 10:11 PM
9
points
0
comments
5
min read
LW
link
AI Institution Design Hackathon (EAG Bay Area Satellite Event)
beatrice@foresight.org
and
Allison Duettmann
Dec 28, 2023, 7:46 PM
1
point
0
comments
1
min read
LW
link
Psychology of AI doomers and AI optimists
Igor Ivanov
Dec 28, 2023, 5:55 PM
3
points
0
comments
22
min read
LW
link
Evening meal or drinks followed by techno rave
yakimoff
Dec 28, 2023, 3:08 PM
3
points
0
comments
1
min read
LW
link
AI #44: Copyright Confrontation
Zvi
Dec 28, 2023, 2:30 PM
54
points
13
comments
43
min read
LW
link
(thezvi.wordpress.com)
How to develop a photographic memory 1/3
PhilosophicalSoul
Dec 28, 2023, 1:26 PM
34
points
6
comments
14
min read
LW
link
Gunpowder as metaphor for AI
Nathan Helm-Burger
Dec 28, 2023, 4:31 AM
14
points
0
comments
2
min read
LW
link
E.T. Jaynes Probability Theory: The logic of Science I
Jan Christian Refsgaard
and
dentalperson
Dec 27, 2023, 11:47 PM
63
points
20
comments
21
min read
LW
link
Free agents
Michele Campolo
Dec 27, 2023, 8:20 PM
6
points
19
comments
13
min read
LW
link
Merry Christmas Everyone!
johnlawrenceaspden
Dec 27, 2023, 7:49 PM
14
points
1
comment
1
min read
LW
link
Natural Latents: The Math
johnswentworth
and
David Lorell
Dec 27, 2023, 7:03 PM
129
points
41
comments
12
min read
LW
link
2
reviews
NYT is suing OpenAI&Microsoft for alleged copyright infringement; some quick thoughts
Mikhail Samin
Dec 27, 2023, 6:44 PM
42
points
17
comments
1
min read
LW
link
Extropy magazine review
Peter lawless
Dec 27, 2023, 6:37 PM
3
points
0
comments
1
min read
LW
link
The Progress Paradox
Ben Turtel
Dec 27, 2023, 6:26 PM
3
points
3
comments
4
min read
LW
link
(bturtel.substack.com)
The virtuous circle: twelve conjectures about female reproductive agency and cultural self-determination
Miles Saltiel
Dec 27, 2023, 6:25 PM
0
points
2
comments
14
min read
LW
link
MSP Article Discussion Meetup: The EMH, Long-Term Investing, and Leveraged ETFs
25Hour
Dec 27, 2023, 4:50 PM
3
points
1
comment
1
min read
LW
link
In Defense of Epistemic Empathy
Kevin Dorst
Dec 27, 2023, 4:27 PM
60
points
19
comments
6
min read
LW
link
(kevindorst.substack.com)
Critical review of Christiano’s disagreements with Yudkowsky
Vanessa Kosoy
Dec 27, 2023, 4:02 PM
176
points
40
comments
15
min read
LW
link
AGI will be made of heterogeneous components, Transformer and Selective SSM blocks will be among them
Roman Leventov
Dec 27, 2023, 2:51 PM
33
points
9
comments
4
min read
LW
link
5. Moral Value for Sentient Animals? Alas, Not Yet
RogerDearnaley
Dec 27, 2023, 6:42 AM
33
points
41
comments
23
min read
LW
link
Differential Optimization Reframes and Generalizes Utility-Maximization
J Bostock
Dec 27, 2023, 1:54 AM
30
points
2
comments
3
min read
LW
link
More Thoughts on the Human-AGI War
Seth Ahrenbach
Dec 27, 2023, 1:03 AM
−3
points
4
comments
7
min read
LW
link
METR is hiring!
Beth Barnes
Dec 26, 2023, 9:00 PM
65
points
1
comment
1
min read
LW
link
Environmental allergies are curable? (Sublingual immunotherapy)
Chipmonk
Dec 26, 2023, 7:05 PM
47
points
10
comments
1
min read
LW
link
Picasso in the Gallery of Babel
samhealy
Dec 26, 2023, 4:25 PM
12
points
12
comments
4
min read
LW
link
Flagging Potentially Unfair Parenting
jefftk
Dec 26, 2023, 12:40 PM
69
points
1
comment
1
min read
LW
link
(www.jefftk.com)
Link Collection: Impact Markets
Saul Munn
Dec 26, 2023, 9:01 AM
27
points
0
comments
2
min read
LW
link
(www.brasstacks.blog)
How Emergency Medicine Solves the Alignment Problem
StrivingForLegibility
Dec 26, 2023, 5:24 AM
41
points
4
comments
6
min read
LW
link
Rationality outreach vs. rationality teaching
Lenmar
Dec 26, 2023, 12:37 AM
7
points
2
comments
1
min read
LW
link
Exploring the Residual Stream of Transformers for Mechanistic Interpretability — Explained
Zeping Yu
Dec 26, 2023, 12:36 AM
7
points
1
comment
11
min read
LW
link
[Question]
Anki setup best practices?
Sinclair Chen
Dec 25, 2023, 10:34 PM
11
points
4
comments
1
min read
LW
link
[Question]
Why does expected utility matter?
Marco Discendenti
Dec 25, 2023, 2:47 PM
18
points
21
comments
4
min read
LW
link
Freeze Dried Raspberry Truffles
jefftk
Dec 25, 2023, 2:10 PM
14
points
0
comments
1
min read
LW
link
(www.jefftk.com)
Pornographic and semi-pornographic ads on mainstream websites as an instance of the AI alignment problem?
greenrd
Dec 25, 2023, 1:19 PM
−1
points
5
comments
12
min read
LW
link
Defense Against The Dark Arts: An Introduction
Lyrongolem
Dec 25, 2023, 6:36 AM
24
points
36
comments
20
min read
LW
link
Occlusions of Moral Knowledge
herschel
Dec 25, 2023, 5:55 AM
−1
points
0
comments
2
min read
LW
link
(brothernin.substack.com)
[Question]
Would you have a baby in 2024?
martinkunev
Dec 25, 2023, 1:52 AM
24
points
76
comments
1
min read
LW
link
align your latent spaces
bhauth
Dec 24, 2023, 4:30 PM
27
points
8
comments
2
min read
LW
link
(www.bhauth.com)
Viral Guessing Game
jefftk
Dec 24, 2023, 1:10 PM
19
points
0
comments
1
min read
LW
link
(www.jefftk.com)
The Sugar Alignment Problem
Adam Zerner
Dec 24, 2023, 1:35 AM
5
points
3
comments
7
min read
LW
link
A Crisper Explanation of Simulacrum Levels
Thane Ruthenis
Dec 23, 2023, 10:13 PM
92
points
13
comments
13
min read
LW
link
Hyperbolic Discounting and Pascal’s Mugging
Andrew Keenan Richardson
Dec 23, 2023, 9:55 PM
9
points
0
comments
7
min read
LW
link
AISN #28: Center for AI Safety 2023 Year in Review
Dan H
23 Dec 2023 21:31 UTC
30
points
1
comment
5
min read
LW
link
(newsletter.safe.ai)
“Inftoxicity” and other new words to describe malicious information and communication thereof
Jáchym Fibír
23 Dec 2023 18:15 UTC
−1
points
6
comments
3
min read
LW
link
AI’s impact on biology research: Part I, today
octopocta
23 Dec 2023 16:29 UTC
31
points
6
comments
2
min read
LW
link
AI Girlfriends Won’t Matter Much
Maxwell Tabarrok
23 Dec 2023 15:58 UTC
42
points
22
comments
2
min read
LW
link
(maximumprogress.substack.com)
The Next Right Token
jefftk
23 Dec 2023 3:20 UTC
14
points
0
comments
1
min read
LW
link
(www.jefftk.com)
Fact Finding: Do Early Layers Specialise in Local Processing? (Post 5)
Neel Nanda
,
Senthooran Rajamanoharan
,
János Kramár
and
Rohin Shah
23 Dec 2023 2:46 UTC
18
points
0
comments
4
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel