Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
[Question]
Colonialism in space: Does a collection of minds have exactly two attractors?
StanislavKrym
May 27, 2025, 11:35 PM
3
points
5
comments
1
min read
LW
link
[Question]
What are the best arguments you’ve seen for the Litany of Gendlin?
flowerfeatherfocus
May 27, 2025, 9:19 PM
5
points
3
comments
1
min read
LW
link
What We Learned from Briefing 70+ Lawmakers on the Threat from AI
leticiagarcia
May 27, 2025, 6:23 PM
452
points
14
comments
16
min read
LW
link
(substack.com)
My script for organizing OBNYC meetups
Orioth
May 27, 2025, 6:14 PM
3
points
0
comments
4
min read
LW
link
Untrusted AIs can exploit feedback in control protocols
Mia Hopman
,
BionicD0LPH1N
and
Tyler Tracy
May 27, 2025, 4:41 PM
26
points
0
comments
16
min read
LW
link
Requiem for the hopes of a pre-AI world
Mitchell_Porter
May 27, 2025, 2:47 PM
68
points
0
comments
3
min read
LW
link
The Best of All Possible Worlds
Jakub Growiec
May 27, 2025, 1:16 PM
11
points
7
comments
49
min read
LW
link
Dating Roundup #5: Opening Day
Zvi
May 27, 2025, 1:10 PM
26
points
8
comments
27
min read
LW
link
(thezvi.wordpress.com)
Season Recap of the Village: Agents raise $2,000
Shoshannah Tekofsky
May 27, 2025, 12:34 PM
126
points
14
comments
6
min read
LW
link
(theaidigest.org)
Beware the Moral Homophone
ymeskhout
May 27, 2025, 12:06 PM
63
points
4
comments
9
min read
LW
link
(www.ymeskhout.com)
Association taxes are collusion subsidies
KatjaGrace
May 27, 2025, 6:50 AM
102
points
7
comments
1
min read
LW
link
(worldspiritsockpuppet.com)
Creating My Own Winter Solstice Celebration—Southern Hemisphere Edition
joshuamerriam
May 27, 2025, 2:11 AM
5
points
0
comments
2
min read
LW
link
U.S. Government Seeks Input on National AI R&D Strategic Plan—Deadline May 29
mbrooks
May 27, 2025, 1:57 AM
17
points
0
comments
1
min read
LW
link
All Rationalists hate & sabotage Strategy without having any awareness of it.
Oxidize
May 26, 2025, 10:09 PM
−27
points
8
comments
7
min read
LW
link
Personal Ruminations on AI’s Missing Variable Problem
Thehumanproject.ai
May 26, 2025, 9:11 PM
1
point
0
comments
3
min read
LW
link
Poetic Methods II: Rhyme as a Focusing Device
adamShimi
May 26, 2025, 6:29 PM
24
points
1
comment
17
min read
LW
link
(formethods.substack.com)
Is Building Good Note-Taking Software an AGI-Complete Problem?
Thane Ruthenis
May 26, 2025, 6:26 PM
25
points
13
comments
7
min read
LW
link
Principal-Agent Problems and the Structure of Governance
belos
May 26, 2025, 6:23 PM
1
point
0
comments
8
min read
LW
link
(bestofagreatlot.substack.com)
[Question]
Does the Universal Geometry of Embeddings paper have big implications for interpretability?
Evan R. Murphy
May 26, 2025, 6:20 PM
42
points
3
comments
1
min read
LW
link
Socratic Persuasion: Giving Opinionated Yet Truth-Seeking Advice
Neel Nanda
May 26, 2025, 5:38 PM
56
points
13
comments
21
min read
LW
link
(www.neelnanda.io)
[Beneath Psychology] Case study on chronic pain: First insights, and the remaining challenge
jimmy
May 26, 2025, 5:29 PM
8
points
0
comments
11
min read
LW
link
An observation on self-play
jonrxu
May 26, 2025, 5:22 PM
14
points
1
comment
3
min read
LW
link
New website analyzing AI companies’ model evals
Zach Stein-Perlman
May 26, 2025, 4:00 PM
58
points
0
comments
4
min read
LW
link
New scorecard evaluating AI companies on safety
Zach Stein-Perlman
May 26, 2025, 4:00 PM
72
points
8
comments
1
min read
LW
link
[Question]
Asking for AI Safety Career Advice
infinibot27
May 26, 2025, 3:26 PM
3
points
1
comment
1
min read
LW
link
Nerve Blisters: A Stoic Response
Jonathan Moregård
May 26, 2025, 3:07 PM
8
points
2
comments
1
min read
LW
link
(honestliving.substack.com)
On ‘On Caring’
atharva
May 26, 2025, 1:39 PM
8
points
4
comments
3
min read
LW
link
Claude 4 You: The Quest for Mundane Utility
Zvi
May 26, 2025, 1:01 PM
36
points
0
comments
17
min read
LW
link
(thezvi.wordpress.com)
Formalizing Embeddedness Failures in Universal Artificial Intelligence
Cole Wyeth
May 26, 2025, 12:36 PM
39
points
0
comments
1
min read
LW
link
(arxiv.org)
Techies Wanted: How STEM Backgrounds Can Advance Safe AI Policy
Daniel_Eth
May 26, 2025, 11:29 AM
16
points
0
comments
29
min read
LW
link
D&D.Sci: The Choosing Ones [Answerkey and Ruleset]
abstractapplic
May 26, 2025, 9:43 AM
19
points
2
comments
3
min read
LW
link
The Sundog Alignment Theorem: A Proposal for Embodied Alignment via Indirect Inference
Malice
May 26, 2025, 7:26 AM
−9
points
0
comments
3
min read
LW
link
Superposition Without Compression: Why Entangled Representations Are the Default
James Butterworth
May 26, 2025, 5:26 AM
3
points
2
comments
1
min read
LW
link
(drive.google.com)
Seeking Feedback: Toy Model of Deceptive Alignment (Game Theory)
Alex Boche
May 26, 2025, 5:23 AM
5
points
4
comments
5
min read
LW
link
Long-form data bottlenecks might stall AI progress for years
Michelle_Ma
May 26, 2025, 4:36 AM
19
points
0
comments
13
min read
LW
link
Example of Splitting a PR
jefftk
May 26, 2025, 2:20 AM
28
points
0
comments
2
min read
LW
link
(www.jefftk.com)
How I’m telling my friends about AI Safety
k64
May 25, 2025, 10:43 PM
1
point
7
comments
7
min read
LW
link
Good Writing
Adam Zerner
May 25, 2025, 9:52 PM
11
points
0
comments
2
min read
LW
link
(paulgraham.com)
Consider buying voting shares
Hruss
May 25, 2025, 6:01 PM
2
points
3
comments
1
min read
LW
link
[Question]
Can you donate to AI advocacy?
k64
May 25, 2025, 5:54 PM
17
points
4
comments
1
min read
LW
link
Rant: the extreme wastefulness of high rent prices
Knight Lee
May 25, 2025, 5:04 PM
−2
points
0
comments
2
min read
LW
link
Beyond Democracy: A System Where Citizens Vote with Their Taxes
Brendan Golledge
May 25, 2025, 5:00 PM
−1
points
3
comments
7
min read
LW
link
Claude 4 You: Safety and Alignment
Zvi
25 May 2025 14:00 UTC
86
points
8
comments
63
min read
LW
link
(thezvi.wordpress.com)
Alignment Proposal: Adversarially Robust Augmentation and Distillation
Cole Wyeth
and
abramdemski
25 May 2025 12:58 UTC
54
points
47
comments
13
min read
LW
link
An open job application to AI labs
Hruss
25 May 2025 12:57 UTC
15
points
0
comments
1
min read
LW
link
Meditations on Doge
Martin Sustrik
25 May 2025 12:00 UTC
129
points
44
comments
9
min read
LW
link
(250bpm.substack.com)
Case Studies in Simulators and Agents
WillPetillo
,
Sean Herrington
,
Spencer Ames
,
Adebayo Mubarak
and
Cancus
25 May 2025 5:40 UTC
11
points
8
comments
6
min read
LW
link
On safety of being a moral patient of ASI
Yaroslav Granowski
24 May 2025 21:24 UTC
3
points
8
comments
1
min read
LW
link
We Need a Baseline for LLM-Aided Experiments
J Bostock
24 May 2025 20:52 UTC
11
points
1
comment
1
min read
LW
link
Lie Detectors. Technical solutions to the cooperation problem.
Window Frame
24 May 2025 20:05 UTC
6
points
0
comments
10
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel