Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
Page
1
Rationalist Storytelling (French)
Camille Berger
Feb 19, 2024, 10:25 PM
3
points
0
comments
1
min read
LW
link
Abs-E (or, speak only in the positive)
dkl9
Feb 19, 2024, 9:14 PM
29
points
24
comments
2
min read
LW
link
(dkl9.net)
Retirement Accounts and Short Timelines
jefftk
Feb 19, 2024, 6:50 PM
83
points
35
comments
2
min read
LW
link
(www.jefftk.com)
How Technical AI Safety Researchers Can Help Implement Punitive Damages to Mitigate Catastrophic AI Risk
Gabriel Weil
Feb 19, 2024, 6:00 PM
18
points
0
comments
4
min read
LW
link
Protocol evaluations: good analogies vs control
Fabien Roger
Feb 19, 2024, 6:00 PM
42
points
10
comments
11
min read
LW
link
When Should Copyright Get Shorter?
Maxwell Tabarrok
Feb 19, 2024, 4:03 PM
11
points
14
comments
4
min read
LW
link
(www.maximum-progress.com)
Auto-matching hidden layers in Pytorch LLMs
chanind
Feb 19, 2024, 12:40 PM
2
points
0
comments
3
min read
LW
link
I’d also take $7 trillion
bhauth
Feb 19, 2024, 3:31 AM
47
points
12
comments
10
min read
LW
link
(www.bhauth.com)
On coincidences and Bayesian reasoning, as applied to the origins of COVID-19
viking_math
Feb 19, 2024, 1:14 AM
62
points
28
comments
14
min read
LW
link
Solution to the two envelopes problem for moral weights
MichaelStJules
Feb 19, 2024, 12:15 AM
9
points
1
comment
LW
link
Conspiracy Investigation Done Right
ymeskhout
Feb 19, 2024, 12:09 AM
24
points
0
comments
6
min read
LW
link
Scientific Method
Andrij “Androniq” Ghorbunov
Feb 18, 2024, 9:06 PM
24
points
4
comments
30
min read
LW
link
[Question]
Weighing reputational and moral consequences of leaving Russia or staying
spza
Feb 18, 2024, 7:36 PM
29
points
24
comments
1
min read
LW
link
Things I’ve Grieved
Raemon
Feb 18, 2024, 7:32 PM
125
points
6
comments
2
min read
LW
link
Senses of “knowing” a person
dkl9
Feb 18, 2024, 7:13 PM
3
points
0
comments
1
min read
LW
link
(dkl9.net)
The Jolly Green Giant Chronicles [ChatGPT]
Bill Benzon
Feb 18, 2024, 5:28 PM
4
points
0
comments
8
min read
LW
link
Intuition for 1 + 2 + 3 + … = −1/12
Shankar Sivarajan
Feb 18, 2024, 4:46 PM
18
points
28
comments
3
min read
LW
link
No Clickbait—Misalignment Database
Kabir Kumar
Feb 18, 2024, 5:35 AM
6
points
10
comments
1
min read
LW
link
Idea: NV⁻ Centers for Brain Interpretability
James Camacho
Feb 18, 2024, 5:28 AM
6
points
1
comment
3
min read
LW
link
Celiacs don’t need to live in fear
Jarrah
Feb 18, 2024, 2:30 AM
16
points
4
comments
4
min read
LW
link
“What if we could redesign society from scratch? The promise of charter cities.” [Rational Animations video]
Jackson Wagner
Feb 18, 2024, 12:57 AM
40
points
7
comments
LW
link
(www.youtube.com)
Evaluating Solar
jefftk
Feb 17, 2024, 9:50 PM
26
points
5
comments
2
min read
LW
link
(www.jefftk.com)
Opinions survey 2 (with rationalism score at the end)
tailcalled
Feb 17, 2024, 12:03 PM
2
points
11
comments
1
min read
LW
link
(docs.google.com)
Achieving AI Alignment through Deliberate Uncertainty in Multiagent Systems
Florian_Dietz
Feb 17, 2024, 8:45 AM
4
points
0
comments
13
min read
LW
link
Communication, consciousness, and belief strength measures
Jakub Smékal
Feb 17, 2024, 5:45 AM
1
point
0
comments
3
min read
LW
link
San Fernando Valley Rationality: February 22, 2024
Thomas Broadley
Feb 17, 2024, 1:58 AM
3
points
0
comments
1
min read
LW
link
Self-Awareness: Taxonomy and eval suite proposal
Daniel Kokotajlo
Feb 17, 2024, 1:47 AM
65
points
2
comments
11
min read
LW
link
Opinions survey (with rationalism score at the end)
tailcalled
Feb 17, 2024, 12:41 AM
8
points
14
comments
1
min read
LW
link
(docs.google.com)
Phallocentricity in GPT-J’s bizarre stratified ontology
mwatkins
Feb 17, 2024, 12:16 AM
56
points
37
comments
9
min read
LW
link
FUTARCHY NOW BABY
sapphire
Feb 17, 2024, 12:03 AM
−1
points
7
comments
1
min read
LW
link
Making the “stance” explicit
NicholasKees
Feb 16, 2024, 11:57 PM
23
points
3
comments
2
min read
LW
link
2023 Survey Results
Screwtape
Feb 16, 2024, 10:24 PM
150
points
26
comments
44
min read
LW
link
Physics-based early warning signal shows that AMOC is on tipping course
Annapurna
Feb 16, 2024, 10:07 PM
19
points
3
comments
1
min read
LW
link
(www.science.org)
Kingfisher Winter Tour 2024
jefftk
Feb 16, 2024, 9:40 PM
8
points
0
comments
1
min read
LW
link
(www.jefftk.com)
The Pointer Resolution Problem
Jozdien
Feb 16, 2024, 9:25 PM
41
points
20
comments
3
min read
LW
link
Every “Every Bay Area House Party” Bay Area House Party
Richard_Ngo
Feb 16, 2024, 6:53 PM
181
points
6
comments
4
min read
LW
link
“No-one in my org puts money in their pension”
Tobes
Feb 16, 2024, 6:33 PM
272
points
16
comments
9
min read
LW
link
(seekingtobejolly.substack.com)
Addressing Feature Suppression in SAEs
Benjamin Wright
and
Lee Sharkey
Feb 16, 2024, 6:32 PM
86
points
4
comments
10
min read
LW
link
Retrospective: PIBBSS Fellowship 2023
DusanDNesic
and
Nora_Ammann
Feb 16, 2024, 5:48 PM
31
points
1
comment
8
min read
LW
link
Fatebook for Chrome: Make and embed forecasts anywhere on the web
Adam B
and
Sage Future
Feb 16, 2024, 4:08 PM
14
points
3
comments
1
min read
LW
link
“Arctic Instincts? The universal principles of Arctic psychological adaptation and the origins of East Asian psychology”—Call for Reviewers (Seeds of Science)
rogersbacon
Feb 16, 2024, 3:02 PM
0
points
0
comments
2
min read
LW
link
The Altman Technocracy
PhilosophicalSoul
Feb 16, 2024, 1:27 PM
5
points
31
comments
2
min read
LW
link
Discord space for people with FTX clawbacks/claims request
kotrfa
Feb 16, 2024, 9:04 AM
1
point
0
comments
1
min read
LW
link
(forum.effectivealtruism.org)
OpenAI’s Sora is an agent
Caleb Biddulph
Feb 16, 2024, 7:35 AM
97
points
25
comments
4
min read
LW
link
Massapequa (Long Island), New York – ACX/SSC Meetup
Gabriel Weil
Feb 16, 2024, 1:24 AM
4
points
0
comments
1
min read
LW
link
Offering AI safety support calls for ML professionals
Vael Gates
Feb 15, 2024, 11:48 PM
61
points
1
comment
LW
link
7. Evolution and Ethics
RogerDearnaley
Feb 15, 2024, 11:38 PM
3
points
7
comments
6
min read
LW
link
Mapping the semantic void III: Exploring neighbourhoods
mwatkins
Feb 15, 2024, 11:01 PM
13
points
0
comments
10
min read
LW
link
Mapping the semantic void II: Above, below and between token embeddings
mwatkins
Feb 15, 2024, 11:00 PM
31
points
4
comments
10
min read
LW
link
Raising children on the eve of AI
juliawise
Feb 15, 2024, 9:28 PM
275
points
47
comments
5
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel