Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
What is Morality?
Zero Contradictions
Jul 29, 2024, 7:19 PM
−1
points
0
comments
1
min read
LW
link
(thewaywardaxolotl.blogspot.com)
Arch-anarchism and immortality
Peter lawless
Jul 29, 2024, 6:10 PM
−5
points
1
comment
2
min read
LW
link
AI Safety Newsletter #39: Implications of a Trump Administration for AI Policy Plus, Safety Engineering
Corin Katzke
,
Alexa Pan
,
Julius
and
Dan H
Jul 29, 2024, 5:50 PM
17
points
1
comment
6
min read
LW
link
(newsletter.safe.ai)
New Blog Post Against AI Doom
Noah Birnbaum
Jul 29, 2024, 5:21 PM
1
point
5
comments
1
min read
LW
link
(substack.com)
An Interpretability Illusion from Population Statistics in Causal Analysis
Daniel Tan
Jul 29, 2024, 2:50 PM
9
points
3
comments
1
min read
LW
link
[Question]
How tokenization influences prompting?
Boris Kashirin
Jul 29, 2024, 10:28 AM
9
points
4
comments
1
min read
LW
link
Understanding Positional Features in Layer 0 SAEs
bilalchughtai
and
Yeu-Tong Lau
Jul 29, 2024, 9:36 AM
43
points
0
comments
5
min read
LW
link
Prediction Markets Explained
Benjamin_Sturisky
Jul 29, 2024, 8:02 AM
8
points
0
comments
9
min read
LW
link
Relativity Theory for What the Future ‘You’ Is and Isn’t
FlorianH
Jul 29, 2024, 2:01 AM
7
points
49
comments
4
min read
LW
link
Wittgenstein and Word2vec: Capturing Relational Meaning in Language and Thought
cleanwhiteroom
Jul 28, 2024, 7:55 PM
2
points
2
comments
2
min read
LW
link
Making Beliefs Pay Rent
Screwtape
and
NoSignalNoNoise
Jul 28, 2024, 5:59 PM
7
points
2
comments
1
min read
LW
link
This is already your second chance
Malmesbury
Jul 28, 2024, 5:13 PM
185
points
13
comments
8
min read
LW
link
[Question]
Has Eliezer publicly and satisfactorily responded to attempted rebuttals of the analogy to evolution?
kaler
Jul 28, 2024, 12:23 PM
10
points
14
comments
1
min read
LW
link
Family and Society
Zero Contradictions
Jul 28, 2024, 7:05 AM
1
point
0
comments
1
min read
LW
link
(thewaywardaxolotl.blogspot.com)
[Question]
What is AI Safety’s line of retreat?
Remmelt
Jul 28, 2024, 5:43 AM
12
points
12
comments
LW
link
AXRP Episode 34 - AI Evaluations with Beth Barnes
DanielFilan
Jul 28, 2024, 3:30 AM
23
points
0
comments
69
min read
LW
link
Rats, Back a Candidate
Blake
Jul 28, 2024, 3:19 AM
−40
points
19
comments
1
min read
LW
link
AI existential risk probabilities are too unreliable to inform policy
Oleg Trott
Jul 28, 2024, 12:59 AM
18
points
5
comments
1
min read
LW
link
(www.aisnakeoil.com)
Idle Speculations on Pipeline Parallelism
DaemonicSigil
Jul 27, 2024, 10:40 PM
1
point
0
comments
4
min read
LW
link
(pbement.com)
Re: Anthropic’s suggested SB-1047 amendments
RobertM
Jul 27, 2024, 10:32 PM
87
points
13
comments
9
min read
LW
link
(www.documentcloud.org)
The problem with psychology is that it has no theory.
Nicholas D.
Jul 27, 2024, 7:36 PM
2
points
7
comments
4
min read
LW
link
(nicholasdecker.substack.com)
Bryan Johnson and a search for healthy longevity
NancyLebovitz
Jul 27, 2024, 3:28 PM
18
points
17
comments
1
min read
LW
link
What are matching markets?
ohmurphy
Jul 27, 2024, 3:05 PM
12
points
0
comments
8
min read
LW
link
(ohmurphy.substack.com)
Safety consultations for AI lab employees
Zach Stein-Perlman
Jul 27, 2024, 3:00 PM
181
points
4
comments
1
min read
LW
link
The Case Against UBI
Zero Contradictions
Jul 27, 2024, 6:36 AM
−1
points
2
comments
2
min read
LW
link
(thewaywardaxolotl.blogspot.com)
Unlocking Solutions—By Understanding Coordination Problems
James Stephen Brown
Jul 27, 2024, 4:52 AM
56
points
4
comments
5
min read
LW
link
(nonzerosum.games)
Utilitarianism and the replaceability of desires and attachments
MichaelStJules
Jul 27, 2024, 1:57 AM
5
points
2
comments
LW
link
Inspired by: Failures in Kindness
X4vier
Jul 27, 2024, 1:21 AM
60
points
2
comments
3
min read
LW
link
My Experience Using Gamification
Wyatt S
Jul 26, 2024, 11:06 PM
13
points
4
comments
4
min read
LW
link
How the AI safety technical landscape has changed in the last year, according to some practitioners
tlevin
Jul 26, 2024, 7:06 PM
57
points
6
comments
2
min read
LW
link
A Visual Task that’s Hard for GPT-4o, but Doable for Primary Schoolers
Lennart Finke
Jul 26, 2024, 5:51 PM
25
points
6
comments
2
min read
LW
link
Unaligned AI is coming regardless.
verbalshadow
Jul 26, 2024, 4:41 PM
−15
points
3
comments
2
min read
LW
link
Index of rationalist groups in the Bay Area June 2025
Lucie Philippon
,
Czynski
and
Screwtape
Jul 26, 2024, 4:32 PM
39
points
14
comments
2
min read
LW
link
End Single Family Zoning by Overturning Euclid V Ambler
Maxwell Tabarrok
Jul 26, 2024, 2:08 PM
32
points
1
comment
7
min read
LW
link
(www.maximum-progress.com)
Common Uses of “Acceptance”
Yi-Yang
Jul 26, 2024, 11:18 AM
14
points
5
comments
24
min read
LW
link
Universal Basic Income and Poverty
Eliezer Yudkowsky
Jul 26, 2024, 7:23 AM
328
points
141
comments
9
min read
LW
link
A Solomonoff Inductor Walks Into a Bar: Schelling Points for Communication
johnswentworth
and
David Lorell
Jul 26, 2024, 12:33 AM
95
points
2
comments
13
min read
LW
link
What does a Gambler’s Verity world look like?
ErioirE
Jul 25, 2024, 10:03 PM
7
points
6
comments
1
min read
LW
link
Pacing Outside the Box: RNNs Learn to Plan in Sokoban
Adrià Garriga-alonso
,
taufeeque
,
AdamGleave
and
ChengCheng
Jul 25, 2024, 10:00 PM
59
points
8
comments
2
min read
LW
link
(arxiv.org)
Sex, Death, and Complexity
Zero Contradictions
Jul 25, 2024, 9:22 PM
0
points
0
comments
1
min read
LW
link
(thewaywardaxolotl.blogspot.com)
Does robustness improve with scale?
ChengCheng
,
niki.h
,
Ian McKenzie
,
Oskar Hollinsworth
,
Tom Tseng
and
AdamGleave
Jul 25, 2024, 8:55 PM
14
points
0
comments
1
min read
LW
link
(far.ai)
Organisation for Program Equilibrium reading group
Smaug123
25 Jul 2024 19:11 UTC
11
points
14
comments
1
min read
LW
link
In Text
Valerii Kremnev
25 Jul 2024 18:22 UTC
−3
points
0
comments
5
min read
LW
link
“AI achieves silver-medal standard solving International Mathematical Olympiad problems”
gjm
25 Jul 2024 15:58 UTC
133
points
38
comments
2
min read
LW
link
(deepmind.google)
[Talk transcript] What “structure” is and why it matters
Alex_Altair
25 Jul 2024 15:49 UTC
23
points
0
comments
5
min read
LW
link
(www.youtube.com)
AI #74: GPT-4o Mini Me and Llama 3
Zvi
25 Jul 2024 13:50 UTC
30
points
6
comments
36
min read
LW
link
(thezvi.wordpress.com)
AI Constitutions are a tool to reduce societal scale risk
Sammy Martin
25 Jul 2024 11:18 UTC
30
points
2
comments
18
min read
LW
link
Determining the power of investors over Frontier AI Labs is strategically important to reduce x-risk
Lucie Philippon
25 Jul 2024 1:12 UTC
18
points
7
comments
2
min read
LW
link
FLI is hiring across Comms and Ops
beisenpress
25 Jul 2024 0:06 UTC
1
point
0
comments
1
min read
LW
link
A framework for thinking about AI power-seeking
Joe Carlsmith
24 Jul 2024 22:41 UTC
62
points
15
comments
16
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel