Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
On safety of being a moral patient of ASI
Yaroslav Granowski
May 24, 2025, 9:24 PM
3
points
8
comments
1
min read
LW
link
We Need a Baseline for LLM-Aided Experiments
J Bostock
May 24, 2025, 8:52 PM
11
points
1
comment
1
min read
LW
link
Lie Detectors. Technical solutions to the cooperation problem.
Window Frame
May 24, 2025, 8:05 PM
6
points
0
comments
10
min read
LW
link
It’s hard to make scheming evals look realistic for LLMs
Igor Ivanov
and
Danil Kadochnikov
May 24, 2025, 7:17 PM
141
points
27
comments
5
min read
LW
link
Launch of the New Horizons Podcast
Nezir Alic
May 24, 2025, 5:50 PM
5
points
0
comments
1
min read
LW
link
Priming effects are fake, but framing effects are real
Matrice Jacobine
May 24, 2025, 10:54 AM
32
points
0
comments
1
min read
LW
link
(xphi.net)
The Cosmic Lottery
James Stephen Brown
May 24, 2025, 4:05 AM
5
points
0
comments
5
min read
LW
link
(nonzerosum.games)
Some Considerations on Prediction Markets
belos
May 24, 2025, 3:24 AM
2
points
0
comments
9
min read
LW
link
The Paradox of Low Fertility
Zero Contradictions
May 24, 2025, 12:59 AM
−1
points
6
comments
1
min read
LW
link
(expandingrationality.substack.com)
That’s Not How Epigenetic Modifications Work
johnswentworth
May 24, 2025, 12:15 AM
67
points
12
comments
2
min read
LW
link
[Question]
To what extent is AI safety work trying to get AI to reliably and safely do what the user asks vs. do what is best in some ultimate sense?
Jordan Arel
May 23, 2025, 9:05 PM
14
points
3
comments
1
min read
LW
link
Default history is dead wrong
kilgoar
May 23, 2025, 4:31 PM
−20
points
11
comments
1
min read
LW
link
Notes on Claude 4 System Card
Dentosal
May 23, 2025, 3:23 PM
19
points
2
comments
6
min read
LW
link
What is emptiness?
Vadim Golub
May 23, 2025, 12:06 PM
−4
points
11
comments
9
min read
LW
link
Idiohobbies
dkl9
May 23, 2025, 6:38 AM
11
points
2
comments
1
min read
LW
link
(dkl9.net)
Qualitative Fit Testing
jefftk
May 23, 2025, 2:50 AM
10
points
0
comments
2
min read
LW
link
(www.jefftk.com)
Anthropic is Quietly Backpedalling on its Safety Commitments
garrison
May 23, 2025, 2:26 AM
79
points
7
comments
LW
link
(www.obsolete.pub)
Learning (more) from horse employment history
Tim H
May 23, 2025, 2:11 AM
68
points
13
comments
5
min read
LW
link
Schizobench: Documenting Magical-Thinking Behavior in Claude 4 Opus
viemccoy
May 23, 2025, 1:31 AM
22
points
0
comments
1
min read
LW
link
(metanomicon.ink)
Post-Manifest coworking at Mox
Rachel Shu
and
Austin Chen
May 23, 2025, 12:20 AM
4
points
1
comment
1
min read
LW
link
Claude 4, Opportunistic Blackmail, and “Pleas”
Stephen Martin
May 22, 2025, 7:59 PM
24
points
1
comment
2
min read
LW
link
Problems in AI Alignment: A Scale Model
Mickey Muldoon
May 22, 2025, 7:22 PM
−1
points
3
comments
2
min read
LW
link
(muldoon.cloud)
Art Is Art: AI Is the Next Erotica
Charlie Edwards
May 22, 2025, 6:04 PM
0
points
1
comment
14
min read
LW
link
Reward button alignment
Steven Byrnes
May 22, 2025, 5:36 PM
50
points
15
comments
12
min read
LW
link
We’re Not Advertising Enough (Post 3 of 6 on AI Governance)
Mass_Driver
May 22, 2025, 5:05 PM
109
points
10
comments
28
min read
LW
link
Claude 4
Zach Stein-Perlman
May 22, 2025, 5:00 PM
71
points
24
comments
1
min read
LW
link
(www.anthropic.com)
Video and transcript of talk on AI welfare
Joe Carlsmith
May 22, 2025, 4:15 PM
24
points
1
comment
28
min read
LW
link
(joecarlsmith.substack.com)
What we can learn from afterlife myths
jchan
May 22, 2025, 3:49 PM
5
points
0
comments
15
min read
LW
link
Policy recommendations regarding reproductive technology
TsviBT
May 22, 2025, 2:49 PM
76
points
2
comments
3
min read
LW
link
AI #117: OpenAI Buys Device Maker IO
Zvi
May 22, 2025, 1:40 PM
37
points
9
comments
62
min read
LW
link
(thezvi.wordpress.com)
Does BPC-157 work for healing and tissue repair?
ChristianKl
May 22, 2025, 1:18 PM
17
points
0
comments
5
min read
LW
link
(somaticsignals.jollyjoyjourney.com)
[Question]
How load-bearing is KL divergence from a known-good base model in modern RL?
faul_sname
May 22, 2025, 12:08 PM
12
points
3
comments
4
min read
LW
link
Christianity vs. Tantra vs. Sex – one spiritual path?
pchvykov
May 22, 2025, 11:15 AM
−2
points
0
comments
24
min read
LW
link
Mirror Organisms Are Not Immune to Predation
Matthias Dellago
May 22, 2025, 11:10 AM
27
points
5
comments
1
min read
LW
link
How 2025 AI Forecasts Fared So Far
Adam B
,
romeo
and
elifland
May 22, 2025, 9:42 AM
11
points
2
comments
8
min read
LW
link
(theaidigest.org)
Contain and verify: The endgame of US-China AI competition
sjadler
May 22, 2025, 8:13 AM
5
points
6
comments
2
min read
LW
link
(open.substack.com)
Laugencroissant
Martin Sustrik
May 22, 2025, 6:30 AM
13
points
0
comments
3
min read
LW
link
(250bpm.substack.com)
Google I/O Day
Zvi
May 21, 2025, 10:00 PM
49
points
0
comments
20
min read
LW
link
(thezvi.wordpress.com)
Podcast: How not to waste a billion dollars (on your clinical trial), with Meri Beckwith on Development & Research
rossry
May 21, 2025, 9:27 PM
25
points
0
comments
3
min read
LW
link
(developmentandresearch.bio)
Podcast: From molecule to medicine, with Ross Rheingans-Yoo on Complex Systems
rossry
May 21, 2025, 9:08 PM
15
points
0
comments
5
min read
LW
link
(www.complexsystemspodcast.com)
The stakes of AI moral status
Joe Carlsmith
May 21, 2025, 6:20 PM
78
points
62
comments
14
min read
LW
link
(joecarlsmith.substack.com)
[Question]
Which AI Safety techniques will be ineffective against diffusion models?
Allen Thomas
May 21, 2025, 6:13 PM
4
points
1
comment
1
min read
LW
link
Through The Looking Glasses: Issues & Solutions for Augmented Reality
claywren
21 May 2025 18:11 UTC
1
point
0
comments
22
min read
LW
link
Rooting for Moments, Not Jerseys. Another Approach to Enjoying Sports
Ahmed Elsayyad
21 May 2025 18:11 UTC
1
point
0
comments
3
min read
LW
link
Unexploitable search: blocking malicious use of free parameters
Jacob Pfau
and
Geoffrey Irving
21 May 2025 17:23 UTC
34
points
16
comments
6
min read
LW
link
The Real AI Safety Risk Is a Conceptual Exploit: Anthropomorphism
Anthony Fox
21 May 2025 16:29 UTC
−2
points
0
comments
2
min read
LW
link
You Can’t Skip Exploration: Why understanding experimentation and taste is key to understanding AI
Oliver Sourbut
21 May 2025 16:08 UTC
18
points
0
comments
11
min read
LW
link
(www.oliversourbut.net)
The Problem and Opportunity of Scale
belos
21 May 2025 15:52 UTC
1
point
0
comments
5
min read
LW
link
(bestofagreatlot.substack.com)
Sleep need reduction therapies
harsimony
21 May 2025 15:22 UTC
76
points
18
comments
10
min read
LW
link
(splittinginfinity.substack.com)
Parental Guidance: Framing Superintelligence
ejk64
21 May 2025 15:01 UTC
10
points
0
comments
3
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel