Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
A model of the final phase: the current frontier AIs as de facto CEOs of their own companies
Mitchell_Porter
Mar 8, 2025, 10:15 PM
23
points
2
comments
1
min read
LW
link
Harry Potter and the Methods of Rationality 10 Year Anniversary Party!
Robert Cousineau
Mar 8, 2025, 9:29 PM
6
points
0
comments
1
min read
LW
link
A case for peer-reviewed conspiracy theories
Sam G
Mar 8, 2025, 8:41 PM
13
points
2
comments
4
min read
LW
link
The machine has no mouth and it must scream
zef
Mar 8, 2025, 4:40 PM
77
points
1
comment
7
min read
LW
link
(zephyyr.substack.com)
How Do We Fix the Education Crisis?
James Camacho
Mar 8, 2025, 2:59 AM
12
points
4
comments
8
min read
LW
link
GPT-4.5 Can Play Losing Chess
GoteNoSente
Mar 8, 2025, 12:58 AM
9
points
0
comments
1
min read
LW
link
(chatgpt.com)
[Question]
are “almost-p-zombies” possible?
KvmanThinking
Mar 7, 2025, 10:58 PM
4
points
3
comments
1
min read
LW
link
Sufficiently Decentralized Intelligence is Indistinguishable from Synchronicity
Sahil
Mar 7, 2025, 9:50 PM
27
points
0
comments
19
min read
LW
link
Amplifying the Computational No-Coincidence Conjecture
glauberdebona
Mar 7, 2025, 9:29 PM
8
points
6
comments
7
min read
LW
link
[ages 16-21] Apply to PAIR & ESPR, Summer AI & Rationality Programs
Anna Gajdova
Mar 7, 2025, 7:49 PM
4
points
0
comments
1
min read
LW
link
What if consciousness emerges from a predictive loop?
JohnMarkNorman
Mar 7, 2025, 7:46 PM
2
points
0
comments
1
min read
LW
link
Forecasting newsletter #3/2025: Long march through the institutions
NunoSempere
Mar 7, 2025, 6:17 PM
8
points
0
comments
1
min read
LW
link
(forecasting.substack.com)
Childhood and Education #9: School is Hell
Zvi
Mar 7, 2025, 12:40 PM
52
points
36
comments
37
min read
LW
link
(thezvi.wordpress.com)
The Insanity Detector and Writing
Johannes C. Mayer
Mar 7, 2025, 11:19 AM
20
points
3
comments
1
min read
LW
link
So how well is Claude playing Pokémon?
Julian Bradshaw
Mar 7, 2025, 5:54 AM
171
points
74
comments
5
min read
LW
link
Of Loving Grace
Charlie Sanders
Mar 7, 2025, 4:48 AM
−3
points
0
comments
3
min read
LW
link
(www.dailymicrofiction.com)
In-Context Scheming: A Run is Worth a Thousand Words
noise-field
Mar 7, 2025, 2:47 AM
10
points
0
comments
1
min read
LW
link
(github.com)
AI for Music, A Tool for Manipulation or Expression?
Sunny Huiseon Lee
Mar 7, 2025, 2:47 AM
1
point
0
comments
1
min read
LW
link
Are recent LLMs better at reasoning or better at memorizing?
Jude Khouja
,
harrymayne
,
ryanothnielkearns
and
karolinakorgul
Mar 7, 2025, 2:44 AM
11
points
0
comments
4
min read
LW
link
The Dead Planet Theory
arealsociety
Mar 7, 2025, 2:43 AM
17
points
0
comments
1
min read
LW
link
(open.substack.com)
The end of state
Peter lawless
Mar 7, 2025, 12:17 AM
−21
points
1
comment
1
min read
LW
link
How Can Average People Contribute to AI Safety?
Stephen McAleese
Mar 6, 2025, 10:50 PM
16
points
4
comments
8
min read
LW
link
Anthropic’s Recommendations to OSTP for the U.S. AI Action Plan
UnofficialLinkpostBot
Mar 6, 2025, 10:38 PM
11
points
2
comments
2
min read
LW
link
(www.anthropic.com)
Lots of brief thoughts on Software Engineering
Yair Halberstadt
Mar 6, 2025, 7:50 PM
47
points
17
comments
10
min read
LW
link
What the Headlines Miss About the Latest Decision in the Musk vs. OpenAI Lawsuit
garrison
Mar 6, 2025, 7:49 PM
98
points
0
comments
LW
link
(garrisonlovely.substack.com)
The optimizer won’t just guess your intended semantics
Thomas Kehrenberg
Mar 6, 2025, 7:42 PM
20
points
1
comment
6
min read
LW
link
AISN #49: Superintelligence Strategy
Corin Katzke
and
Dan H
Mar 6, 2025, 5:46 PM
6
points
1
comment
5
min read
LW
link
(newsletter.safe.ai)
Decision-Relevance of worlds and ADT implementations
Maxime Riché
Mar 6, 2025, 4:57 PM
9
points
0
comments
15
min read
LW
link
AI #106: Not so Fast
Zvi
Mar 6, 2025, 3:40 PM
34
points
5
comments
38
min read
LW
link
(thezvi.wordpress.com)
Can a finite physical device be Turing equivalent?
Noosphere89
Mar 6, 2025, 3:02 PM
0
points
10
comments
2
min read
LW
link
(lifeiscomputation.com)
We should start looking for scheming “in the wild”
Marius Hobbhahn
Mar 6, 2025, 1:49 PM
89
points
4
comments
5
min read
LW
link
Bounded AI might be viable
Mateusz Bagiński
and
JustinShovelain
Mar 6, 2025, 12:55 PM
24
points
4
comments
20
min read
LW
link
Publish your genomic data
samuelshadrach
Mar 6, 2025, 12:39 PM
1
point
0
comments
1
min read
LW
link
Which meat to eat: CO₂ vs Animal suffering
B Jacobs
Mar 6, 2025, 12:37 PM
2
points
2
comments
3
min read
LW
link
(bobjacobs.substack.com)
Musings on Scenario Forecasting and AI
Alvin Ånestrand
Mar 6, 2025, 12:28 PM
10
points
0
comments
11
min read
LW
link
(forecastingaifutures.substack.com)
Minor interpretability exploration #2: Extending superposition to different activation functions
Rareș Baron
Mar 6, 2025, 11:22 AM
1
point
0
comments
4
min read
LW
link
What is Lock-In?
alamerton
Mar 6, 2025, 11:09 AM
5
points
0
comments
9
min read
LW
link
ASI Game Theory: The Cosmic Dark Forest Deterrent
tavurth
Mar 6, 2025, 10:28 AM
1
point
4
comments
1
min read
LW
link
The Hidden Cost of Our Lies to AI
Nicholas Andresen
Mar 6, 2025, 5:03 AM
144
points
18
comments
7
min read
LW
link
(substack.com)
Camps Should List Bands
jefftk
Mar 6, 2025, 3:00 AM
7
points
0
comments
1
min read
LW
link
(www.jefftk.com)
Give Neo a Chance
ank
Mar 6, 2025, 1:48 AM
3
points
7
comments
7
min read
LW
link
[Question]
Sparks of Original Thought?
Annapurna
Mar 6, 2025, 12:53 AM
6
points
4
comments
1
min read
LW
link
Social Dilemmas — public goods, free riders, and exploitation
James Stephen Brown
Mar 5, 2025, 11:31 PM
6
points
0
comments
3
min read
LW
link
(nonzerosum.games)
Introducing MASK: A Benchmark for Measuring Honesty in AI Systems
Richard Ren
,
Mantas Mazeika
and
Dan H
5 Mar 2025 22:56 UTC
35
points
5
comments
2
min read
LW
link
(www.mask-benchmark.ai)
The Hardware-Software Framework: A New Perspective on Economic Growth with AI
Jakub Growiec
5 Mar 2025 19:59 UTC
3
points
0
comments
3
min read
LW
link
NY State Has a New Frontier Model Bill (+quick takes)
henryj
5 Mar 2025 19:29 UTC
9
points
0
comments
1
min read
LW
link
(www.henryjosephson.com)
The old memories tree
Yair Halberstadt
5 Mar 2025 19:03 UTC
7
points
1
comment
1
min read
LW
link
Reply to Vitalik on d/acc
samuelshadrach
5 Mar 2025 18:55 UTC
8
points
0
comments
3
min read
LW
link
(samuelshadrach.com)
A Bear Case: My Predictions Regarding AI Progress
Thane Ruthenis
5 Mar 2025 16:41 UTC
362
points
157
comments
9
min read
LW
link
On the Rationality of Deterring ASI
Dan H
5 Mar 2025 16:11 UTC
166
points
34
comments
4
min read
LW
link
(nationalsecurity.ai)
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel