Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
2
Movie posters
KatjaGrace
Mar 6, 2024, 6:20 AM
40
points
0
comments
2
min read
LW
link
(worldspiritsockpuppet.com)
We Inspected Every Head In GPT-2 Small using SAEs So You Don’t Have To
robertzk
,
Connor Kissane
,
Arthur Conmy
and
Neel Nanda
Mar 6, 2024, 5:03 AM
63
points
0
comments
12
min read
LW
link
[Question]
Does anyone know good essays on how different AI timelines will affect asset prices?
Tim Liptrot
Mar 6, 2024, 4:21 AM
8
points
2
comments
1
min read
LW
link
Twin Cities ACX Meetup—March 2024
Timothy M.
Mar 5, 2024, 9:15 PM
1
point
0
comments
1
min read
LW
link
My Clients, The Liars
ymeskhout
Mar 5, 2024, 9:06 PM
249
points
86
comments
7
min read
LW
link
If Ukraine fails, the world will reap fatal consequences
Danylo Zhyrko
Mar 5, 2024, 7:42 PM
−22
points
14
comments
5
min read
LW
link
Making Connections with ChatGPT: The Macksey Game
Bill Benzon
Mar 5, 2024, 6:15 PM
5
points
2
comments
11
min read
LW
link
[Question]
Good taxonomies of all risks (small or large) from AI?
Aryeh Englander
Mar 5, 2024, 6:15 PM
6
points
1
comment
1
min read
LW
link
[Question]
Making 2023 ACX Prediction Results Public
Legionnaire
Mar 5, 2024, 5:56 PM
3
points
9
comments
1
min read
LW
link
Social status part 2/2: everything else
Steven Byrnes
Mar 5, 2024, 4:29 PM
65
points
2
comments
23
min read
LW
link
Social status part 1/2: negotiations over object-level preferences
Steven Byrnes
Mar 5, 2024, 4:29 PM
118
points
15
comments
21
min read
LW
link
Two Tales of AI Takeover: My Doubts
Violet Hour
Mar 5, 2024, 3:51 PM
30
points
8
comments
29
min read
LW
link
Research Report: Sparse Autoencoders find only 9/180 board state features in OthelloGPT
Robert_AIZI
Mar 5, 2024, 1:55 PM
61
points
24
comments
10
min read
LW
link
(aizi.substack.com)
Read the Roon
Zvi
Mar 5, 2024, 1:50 PM
136
points
6
comments
19
min read
LW
link
(thezvi.wordpress.com)
In defense of anthropically updating EDT
Anthony DiGiovanni
Mar 5, 2024, 6:21 AM
18
points
17
comments
13
min read
LW
link
Claude Doesn’t Want to Die
garrison
Mar 5, 2024, 6:00 AM
22
points
3
comments
LW
link
(garrisonlovely.substack.com)
Many arguments for AI x-risk are wrong
TurnTrout
Mar 5, 2024, 2:31 AM
162
points
87
comments
12
min read
LW
link
Some ways of spending your time are better than others
depressurize
Mar 4, 2024, 11:21 PM
6
points
5
comments
4
min read
LW
link
Claude 3 claims it’s conscious, doesn’t want to die or be modified
Mikhail Samin
Mar 4, 2024, 11:05 PM
81
points
118
comments
14
min read
LW
link
Modifying Jones’ “AI Dilemma” Model
harsimony
Mar 4, 2024, 9:55 PM
7
points
0
comments
6
min read
LW
link
(splittinginfinity.substack.com)
Benefits of adding poison to your DMT
George3d6
Mar 4, 2024, 8:35 PM
6
points
2
comments
5
min read
LW
link
(morelucid.substack.com)
Notes on Awe
David Gross
Mar 4, 2024, 8:23 PM
20
points
1
comment
33
min read
LW
link
Boston’s Line 1
jefftk
Mar 4, 2024, 7:30 PM
12
points
0
comments
1
min read
LW
link
(www.jefftk.com)
Anthropic release Claude 3, claims >GPT-4 Performance
LawrenceC
Mar 4, 2024, 6:23 PM
115
points
41
comments
2
min read
LW
link
(www.anthropic.com)
Anomalous Concept Detection for Detecting Hidden Cognition
Paul Colognese
Mar 4, 2024, 4:52 PM
24
points
3
comments
10
min read
LW
link
INTERVIEW: StakeOut.AI w/ Dr. Peter Park
jacobhaimes
Mar 4, 2024, 4:35 PM
6
points
0
comments
1
min read
LW
link
(into-ai-safety.github.io)
Housing Roundup #7
Zvi
Mar 4, 2024, 3:00 PM
42
points
1
comment
44
min read
LW
link
(thezvi.wordpress.com)
The Solution to Sleeping Beauty
Ape in the coat
Mar 4, 2024, 6:46 AM
18
points
77
comments
13
min read
LW
link
Are we so good to simulate?
KatjaGrace
Mar 4, 2024, 5:20 AM
38
points
24
comments
2
min read
LW
link
(worldspiritsockpuppet.com)
The Broken Screwdriver and other parables
bhauth
Mar 4, 2024, 3:34 AM
49
points
1
comment
2
min read
LW
link
Grief is a fire sale
Nathan Young
Mar 4, 2024, 1:11 AM
77
points
1
comment
4
min read
LW
link
[Question]
Good HPMoR scenes / passages?
PhilGoetz
Mar 3, 2024, 10:42 PM
15
points
17
comments
1
min read
LW
link
Attending Sold-Out Beantown Stomp
jefftk
Mar 3, 2024, 9:30 PM
9
points
0
comments
1
min read
LW
link
(www.jefftk.com)
AI things that are perhaps as important as human-controlled AI
Chi Nguyen
Mar 3, 2024, 6:07 PM
55
points
4
comments
LW
link
A tedious and effective way to learn 汉字 (Chinese characters)
dkl9
Mar 3, 2024, 4:41 PM
7
points
1
comment
2
min read
LW
link
(dkl9.net)
Some costs of superposition
Linda Linsefors
Mar 3, 2024, 4:08 PM
46
points
11
comments
3
min read
LW
link
[Question]
If you controlled the first agentic AGI, what would you set as its first task(s)?
sweenesm
Mar 3, 2024, 2:16 PM
−13
points
5
comments
2
min read
LW
link
Self-Resolving Prediction Markets
PeterMcCluskey
Mar 3, 2024, 2:39 AM
33
points
0
comments
3
min read
LW
link
(bayesianinvestor.com)
[Question]
Increase the tax value of donations with high-variance investments?
Brendan Long
Mar 3, 2024, 1:39 AM
20
points
4
comments
2
min read
LW
link
Common Philosophical Mistakes, according to Joe Schmid [videos]
DanielFilan
Mar 3, 2024, 12:15 AM
8
points
3
comments
1
min read
LW
link
(www.youtube.com)
Agreeing With Stalin in Ways That Exhibit Generally Rationalist Principles
Zack_M_Davis
Mar 2, 2024, 10:05 PM
27
points
25
comments
58
min read
LW
link
(unremediatedgender.space)
The World in 2029
Nathan Young
Mar 2, 2024, 6:03 PM
74
points
37
comments
3
min read
LW
link
The Most Dangerous Idea
rogersbacon
Mar 2, 2024, 5:53 PM
−8
points
2
comments
26
min read
LW
link
(www.secretorum.life)
Future life
DavidMadsen
2 Mar 2024 15:41 UTC
−12
points
2
comments
2
min read
LW
link
Ugo Conti’s Whistle-Controlled Synthesizer
jefftk
2 Mar 2024 2:50 UTC
15
points
1
comment
2
min read
LW
link
(www.jefftk.com)
A one-sentence formulation of the AI X-Risk argument I try to make
tcelferact
2 Mar 2024 0:44 UTC
3
points
0
comments
LW
link
If you weren’t such an idiot...
kave
and
Mark Xu
2 Mar 2024 0:01 UTC
157
points
76
comments
2
min read
LW
link
(markxu.com)
Increasing IQ is trivial
George3d6
1 Mar 2024 22:43 UTC
38
points
61
comments
6
min read
LW
link
(epistemink.substack.com)
self-fulfilling prophecies when applying for funding
Chris Lakin
1 Mar 2024 19:01 UTC
31
points
0
comments
1
min read
LW
link
(chipmonk.substack.com)
Antagonistic AI
Xybermancer
1 Mar 2024 18:50 UTC
−8
points
1
comment
1
min read
LW
link
Previous
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel