Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
introduction to thermal conductivity and noise management
bhauth
Mar 6, 2024, 11:14 PM
31
points
1
comment
4
min read
LW
link
(www.bhauth.com)
Essaying Other Plans
Screwtape
Mar 6, 2024, 10:59 PM
29
points
4
comments
7
min read
LW
link
Invest in ACX Grants projects!
Saul Munn
Mar 6, 2024, 8:27 PM
23
points
1
comment
LW
link
Vote on Anthropic Topics to Discuss
Ben Pace
Mar 6, 2024, 7:43 PM
75
points
55
comments
1
min read
LW
link
Simple Kelly betting in prediction markets
jessicata
Mar 6, 2024, 6:59 PM
38
points
3
comments
3
min read
LW
link
(unstablerontology.substack.com)
On Claude 3.0
Zvi
Mar 6, 2024, 6:50 PM
76
points
5
comments
31
min read
LW
link
(thezvi.wordpress.com)
[Question]
Why correlation, though?
numpyNaN
Mar 6, 2024, 4:53 PM
22
points
7
comments
1
min read
LW
link
Using axis lines for good or evil
dynomight
Mar 6, 2024, 2:47 PM
151
points
39
comments
4
min read
LW
link
(dynomight.net)
Let’s build definitely-not-conscious AI
lemonhope
Mar 6, 2024, 7:50 AM
4
points
18
comments
1
min read
LW
link
Movie posters
KatjaGrace
Mar 6, 2024, 6:20 AM
40
points
0
comments
2
min read
LW
link
(worldspiritsockpuppet.com)
We Inspected Every Head In GPT-2 Small using SAEs So You Don’t Have To
robertzk
,
Connor Kissane
,
Arthur Conmy
and
Neel Nanda
Mar 6, 2024, 5:03 AM
63
points
0
comments
12
min read
LW
link
[Question]
Does anyone know good essays on how different AI timelines will affect asset prices?
Tim Liptrot
Mar 6, 2024, 4:21 AM
8
points
2
comments
1
min read
LW
link
Twin Cities ACX Meetup—March 2024
Timothy M.
Mar 5, 2024, 9:15 PM
1
point
0
comments
1
min read
LW
link
My Clients, The Liars
ymeskhout
Mar 5, 2024, 9:06 PM
247
points
86
comments
7
min read
LW
link
If Ukraine fails, the world will reap fatal consequences
Danylo Zhyrko
Mar 5, 2024, 7:42 PM
−22
points
14
comments
5
min read
LW
link
Making Connections with ChatGPT: The Macksey Game
Bill Benzon
Mar 5, 2024, 6:15 PM
5
points
2
comments
11
min read
LW
link
[Question]
Good taxonomies of all risks (small or large) from AI?
Aryeh Englander
Mar 5, 2024, 6:15 PM
6
points
1
comment
1
min read
LW
link
[Question]
Making 2023 ACX Prediction Results Public
Legionnaire
Mar 5, 2024, 5:56 PM
3
points
9
comments
1
min read
LW
link
Social status part 2/2: everything else
Steven Byrnes
Mar 5, 2024, 4:29 PM
65
points
2
comments
23
min read
LW
link
Social status part 1/2: negotiations over object-level preferences
Steven Byrnes
Mar 5, 2024, 4:29 PM
118
points
15
comments
21
min read
LW
link
Two Tales of AI Takeover: My Doubts
Violet Hour
Mar 5, 2024, 3:51 PM
30
points
8
comments
29
min read
LW
link
Research Report: Sparse Autoencoders find only 9/180 board state features in OthelloGPT
Robert_AIZI
Mar 5, 2024, 1:55 PM
61
points
24
comments
10
min read
LW
link
(aizi.substack.com)
Read the Roon
Zvi
Mar 5, 2024, 1:50 PM
136
points
6
comments
19
min read
LW
link
(thezvi.wordpress.com)
In defense of anthropically updating EDT
Anthony DiGiovanni
Mar 5, 2024, 6:21 AM
18
points
17
comments
13
min read
LW
link
Claude Doesn’t Want to Die
garrison
Mar 5, 2024, 6:00 AM
22
points
3
comments
LW
link
(garrisonlovely.substack.com)
Many arguments for AI x-risk are wrong
TurnTrout
Mar 5, 2024, 2:31 AM
162
points
87
comments
12
min read
LW
link
Some ways of spending your time are better than others
depressurize
Mar 4, 2024, 11:21 PM
6
points
5
comments
4
min read
LW
link
Claude 3 claims it’s conscious, doesn’t want to die or be modified
Mikhail Samin
Mar 4, 2024, 11:05 PM
80
points
117
comments
14
min read
LW
link
Modifying Jones’ “AI Dilemma” Model
harsimony
Mar 4, 2024, 9:55 PM
7
points
0
comments
6
min read
LW
link
(splittinginfinity.substack.com)
Benefits of adding poison to your DMT
George3d6
Mar 4, 2024, 8:35 PM
6
points
2
comments
5
min read
LW
link
(morelucid.substack.com)
Notes on Awe
David Gross
Mar 4, 2024, 8:23 PM
20
points
1
comment
33
min read
LW
link
Boston’s Line 1
jefftk
Mar 4, 2024, 7:30 PM
12
points
0
comments
1
min read
LW
link
(www.jefftk.com)
Anthropic release Claude 3, claims >GPT-4 Performance
LawrenceC
Mar 4, 2024, 6:23 PM
115
points
41
comments
2
min read
LW
link
(www.anthropic.com)
Anomalous Concept Detection for Detecting Hidden Cognition
Paul Colognese
Mar 4, 2024, 4:52 PM
24
points
3
comments
10
min read
LW
link
INTERVIEW: StakeOut.AI w/ Dr. Peter Park
jacobhaimes
Mar 4, 2024, 4:35 PM
6
points
0
comments
1
min read
LW
link
(into-ai-safety.github.io)
Housing Roundup #7
Zvi
Mar 4, 2024, 3:00 PM
42
points
1
comment
44
min read
LW
link
(thezvi.wordpress.com)
The Solution to Sleeping Beauty
Ape in the coat
Mar 4, 2024, 6:46 AM
18
points
77
comments
13
min read
LW
link
Are we so good to simulate?
KatjaGrace
Mar 4, 2024, 5:20 AM
38
points
24
comments
2
min read
LW
link
(worldspiritsockpuppet.com)
The Broken Screwdriver and other parables
bhauth
Mar 4, 2024, 3:34 AM
49
points
1
comment
2
min read
LW
link
Grief is a fire sale
Nathan Young
Mar 4, 2024, 1:11 AM
77
points
1
comment
4
min read
LW
link
[Question]
Good HPMoR scenes / passages?
PhilGoetz
Mar 3, 2024, 10:42 PM
15
points
17
comments
1
min read
LW
link
Attending Sold-Out Beantown Stomp
jefftk
Mar 3, 2024, 9:30 PM
9
points
0
comments
1
min read
LW
link
(www.jefftk.com)
AI things that are perhaps as important as human-controlled AI
Chi Nguyen
Mar 3, 2024, 6:07 PM
55
points
4
comments
LW
link
A tedious and effective way to learn 汉字 (Chinese characters)
dkl9
Mar 3, 2024, 4:41 PM
7
points
1
comment
2
min read
LW
link
(dkl9.net)
Some costs of superposition
Linda Linsefors
Mar 3, 2024, 4:08 PM
46
points
11
comments
3
min read
LW
link
[Question]
If you controlled the first agentic AGI, what would you set as its first task(s)?
sweenesm
Mar 3, 2024, 2:16 PM
−13
points
5
comments
2
min read
LW
link
Self-Resolving Prediction Markets
PeterMcCluskey
Mar 3, 2024, 2:39 AM
33
points
0
comments
3
min read
LW
link
(bayesianinvestor.com)
[Question]
Increase the tax value of donations with high-variance investments?
Brendan Long
Mar 3, 2024, 1:39 AM
20
points
4
comments
2
min read
LW
link
Common Philosophical Mistakes, according to Joe Schmid [videos]
DanielFilan
Mar 3, 2024, 12:15 AM
8
points
3
comments
1
min read
LW
link
(www.youtube.com)
Agreeing With Stalin in Ways That Exhibit Generally Rationalist Principles
Zack_M_Davis
Mar 2, 2024, 10:05 PM
30
points
25
comments
58
min read
LW
link
(unremediatedgender.space)
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel