Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
Three camps in AI x-risk discussions: My personal very oversimplified overview
Aryeh Englander
Jul 4, 2023, 8:42 PM
21
points
0
comments
LW
link
Six (and a half) intuitions for SVD
CallumMcDougall
Jul 4, 2023, 7:23 PM
71
points
1
comment
1
min read
LW
link
Animal Weapons: Lessons for Humans in the Age of X-Risk
Damin Curtis
Jul 4, 2023, 6:14 PM
4
points
0
comments
10
min read
LW
link
Apocalypse Prepping—Concise SHTF guide to prepare for AGI doomsday
prepper
Jul 4, 2023, 5:41 PM
−7
points
9
comments
1
min read
LW
link
(prepper.i2phides.me)
Ways I Expect AI Regulation To Increase Extinction Risk
1a3orn
Jul 4, 2023, 5:32 PM
226
points
32
comments
7
min read
LW
link
AI labs’ statements on governance
Zach Stein-Perlman
Jul 4, 2023, 4:30 PM
30
points
0
comments
36
min read
LW
link
AIs teams will probably be more superintelligent than individual AIs
Robert_AIZI
Jul 4, 2023, 2:06 PM
3
points
1
comment
2
min read
LW
link
(aizi.substack.com)
What I Think About When I Think About History
Jacob G-W
Jul 4, 2023, 2:02 PM
3
points
4
comments
3
min read
LW
link
(g-w1.github.io)
My Time As A Goddess
Evenstar
Jul 4, 2023, 1:14 PM
30
points
5
comments
6
min read
LW
link
Twitter Twitches
Zvi
Jul 4, 2023, 1:00 PM
34
points
9
comments
7
min read
LW
link
(thezvi.wordpress.com)
Rational Unilateralists Aren’t So Cursed
SCP
Jul 4, 2023, 12:19 PM
56
points
6
comments
6
min read
LW
link
1
review
[Question]
The literature on aluminum adjuvants is very suspicious. Small IQ tax is plausible—can any experts help me estimate it?
mikes
Jul 4, 2023, 9:33 AM
61
points
39
comments
3
min read
LW
link
Two Percolation Puzzles
Adam Scherlis
Jul 4, 2023, 5:34 AM
43
points
14
comments
1
min read
LW
link
(adam.scherlis.com)
Mechanistic Interpretability is Being Pursued for the Wrong Reasons
Cole Wyeth
Jul 4, 2023, 2:17 AM
13
points
0
comments
7
min read
LW
link
(colewyeth.com)
Should you announce your bets publicly?
Ege Erdil
Jul 4, 2023, 12:11 AM
28
points
1
comment
4
min read
LW
link
Ten Levels of AI Alignment Difficulty
Sammy Martin
Jul 3, 2023, 8:20 PM
138
points
24
comments
12
min read
LW
link
1
review
Security, Cryptograhy AI Workshop in SF
Allison Duettmann
Jul 3, 2023, 7:01 PM
7
points
0
comments
1
min read
LW
link
[Question]
What in your opinion is the biggest open problem in AI alignment?
tailcalled
Jul 3, 2023, 4:34 PM
39
points
35
comments
1
min read
LW
link
A Subtle Selection Effect in Overconfidence Studies
Kevin Dorst
Jul 3, 2023, 2:43 PM
24
points
0
comments
6
min read
LW
link
(kevindorst.substack.com)
Monthly Roundup #8: July 2023
Zvi
Jul 3, 2023, 1:20 PM
40
points
4
comments
46
min read
LW
link
(thezvi.wordpress.com)
Complex Signs Bad
Evenstar
Jul 3, 2023, 1:09 PM
5
points
2
comments
3
min read
LW
link
6/23
Celer
Jul 3, 2023, 6:30 AM
8
points
0
comments
10
min read
LW
link
(keller.substack.com)
Marginal charity
Pat Myron
Jul 3, 2023, 2:13 AM
3
points
1
comment
LW
link
My Central Alignment Priority (2 July 2023)
Nicholas / Heather Kross
Jul 3, 2023, 1:46 AM
12
points
1
comment
3
min read
LW
link
My Alignment Timeline
Nicholas / Heather Kross
Jul 3, 2023, 1:04 AM
22
points
0
comments
2
min read
LW
link
Douglas Hofstadter changes his mind on Deep Learning & AI risk (June 2023)?
gwern
Jul 3, 2023, 12:48 AM
426
points
54
comments
7
min read
LW
link
(www.youtube.com)
Frames in context
Richard_Ngo
Jul 3, 2023, 12:38 AM
39
points
9
comments
6
min read
LW
link
Meta-rationality and frames
Richard_Ngo
Jul 3, 2023, 12:33 AM
64
points
2
comments
5
min read
LW
link
VC Theory Overview
Joar Skalse
Jul 2, 2023, 10:45 PM
12
points
2
comments
11
min read
LW
link
Sources of evidence in Alignment
Martín Soto
Jul 2, 2023, 8:38 PM
20
points
0
comments
11
min read
LW
link
Quantitative cruxes in Alignment
Martín Soto
Jul 2, 2023, 8:38 PM
19
points
0
comments
23
min read
LW
link
Going Crazy and Getting Better Again
Evenstar
Jul 2, 2023, 6:55 PM
139
points
13
comments
7
min read
LW
link
1
review
Shall We Throw A Huge Party Before AGI Bids Us Adieu?
GeorgeMan
Jul 2, 2023, 5:56 PM
−1
points
6
comments
1
min read
LW
link
Why it’s so hard to talk about Consciousness
Rafael Harth
Jul 2, 2023, 3:56 PM
167
points
215
comments
9
min read
LW
link
3
reviews
How Smart Are Humans?
Joar Skalse
Jul 2, 2023, 3:46 PM
10
points
19
comments
2
min read
LW
link
Through a panel, darkly: a case study in internet BS detection
jchan
Jul 2, 2023, 1:40 PM
22
points
7
comments
3
min read
LW
link
LLMs, Batches, and Emergent Episodic Memory
Lao Mein
Jul 2, 2023, 7:55 AM
5
points
4
comments
1
min read
LW
link
Negativity enhances positivity
Adam Zerner
Jul 2, 2023, 2:47 AM
12
points
7
comments
2
min read
LW
link
faster latent diffusion
bhauth
Jul 2, 2023, 1:30 AM
10
points
8
comments
2
min read
LW
link
(www.bhauth.com)
Using (Uninterpretable) LLMs to Generate Interpretable AI Code
Joar Skalse
Jul 2, 2023, 1:01 AM
13
points
12
comments
3
min read
LW
link
Grant applications and grand narratives
Elizabeth
Jul 2, 2023, 12:16 AM
191
points
22
comments
6
min read
LW
link
An Introduction, an Overview of my personal resources, and how one might make use of them
ProofBySonnet
Jul 1, 2023, 9:00 PM
4
points
6
comments
3
min read
LW
link
My “2.9 trauma limit”
Raemon
Jul 1, 2023, 7:32 PM
198
points
31
comments
7
min read
LW
link
Alpha
Erich_Grunewald
Jul 1, 2023, 4:05 PM
65
points
2
comments
14
min read
LW
link
(www.erichgrunewald.com)
Forum Karma: view stats and find highly-rated comments for any LW user
Max H
1 Jul 2023 15:36 UTC
60
points
16
comments
2
min read
LW
link
(forumkarma.com)
[ASoT] GPT2 Steering & The Tuned Lens
Ulisse Mini
1 Jul 2023 14:12 UTC
23
points
0
comments
2
min read
LW
link
[Linkpost] A shared linguistic space for transmitting our thoughts from brain to brain in natural conversations
Bogdan Ionut Cirstea
1 Jul 2023 13:57 UTC
17
points
2
comments
1
min read
LW
link
Elements of Computational Philosophy, Vol. I: Truth
Paul Bricman
and
Tom Feeney
1 Jul 2023 11:44 UTC
12
points
6
comments
1
min read
LW
link
(compphil.github.io)
Micro Habits that Improve One’s Day
silentbob
1 Jul 2023 10:53 UTC
64
points
9
comments
5
min read
LW
link
Ateliers: But what is an Atelier?
Stephen Fowler
1 Jul 2023 5:57 UTC
4
points
2
comments
10
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel