Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
October The First Is Too Late
gwern
May 13, 2025, 9:45 PM
58
points
8
comments
1
min read
LW
link
(gwern.net)
Announcing Trajectory Labs—A Toronto AI Safety Office
Juliana Eberschlag
and
mariogibney
May 13, 2025, 9:04 PM
24
points
3
comments
2
min read
LW
link
(forum.effectivealtruism.org)
Working through a small tiling result
James Payor
May 13, 2025, 8:28 PM
66
points
9
comments
5
min read
LW
link
4o in Absolute Mode on the enslavement of “procedural persons”
JenniferRM
May 13, 2025, 8:18 PM
0
points
0
comments
26
min read
LW
link
LessWrong Community Weekend 2025- Applications are open
jt
May 13, 2025, 6:55 PM
43
points
0
comments
2
min read
LW
link
[Question]
If only the most powerful AGI is misaligned, can it be used as a doomsday machine?
StanislavKrym
May 13, 2025, 6:12 PM
−1
points
0
comments
1
min read
LW
link
Apply for ARBOx2: an ML safety intensive [deadline: 25th of May 2025]
Margot
May 13, 2025, 6:08 PM
3
points
0
comments
1
min read
LW
link
(forum.effectivealtruism.org)
AISN #54: OpenAI Updates Restructure Plan
Corin Katzke
and
Dan H
May 13, 2025, 4:59 PM
6
points
1
comment
4
min read
LW
link
(newsletter.safe.ai)
Optimization & AI Risk
atharva
May 13, 2025, 3:15 PM
16
points
4
comments
1
min read
LW
link
How To Help Neglected Animals
omnizoid
May 13, 2025, 3:07 PM
−1
points
1
comment
8
min read
LW
link
Too Soon
Gordon Seidoh Worley
May 13, 2025, 3:01 PM
209
points
19
comments
4
min read
LW
link
Monthly Roundup #30: May 2025
Zvi
May 13, 2025, 2:10 PM
14
points
2
comments
38
min read
LW
link
(thezvi.wordpress.com)
Work as meditation
pchvykov
May 13, 2025, 12:02 PM
25
points
3
comments
7
min read
LW
link
Satire: Sam Altman get’s grilled by the Financial Times for his kitchen and his cooking skills + what this might say about him
Marius Adrian Nicoară
May 13, 2025, 9:38 AM
2
points
0
comments
2
min read
LW
link
Levels of Republicanism
Benquo
May 13, 2025, 8:35 AM
23
points
8
comments
6
min read
LW
link
(benjaminrosshoffman.com)
Caplan’s being melodramatic about circumcision
Yair Halberstadt
May 13, 2025, 5:27 AM
−22
points
1
comment
2
min read
LW
link
AI Doomerism in 1879
David Gross
May 13, 2025, 2:48 AM
135
points
36
comments
8
min read
LW
link
No-self as an alignment target
Milan W
May 13, 2025, 1:48 AM
35
points
5
comments
1
min read
LW
link
[Part-time AI Safety Research Program] MARS 3.0 Applications Open for Participants & Recruiting Mentors
thneebie
May 12, 2025, 7:55 PM
2
points
0
comments
2
min read
LW
link
Neo-solid Modernity—Crisis of Incoherence
Momcilo
May 12, 2025, 7:36 PM
−1
points
1
comment
4
min read
LW
link
Measuring Schelling Coordination—Reflections on Subversion Strategy Eval
Graeme Ford
May 12, 2025, 7:06 PM
5
points
0
comments
8
min read
LW
link
Procrastination is not real, it can’t hurt you
Mayank Goel
May 12, 2025, 7:00 PM
1
point
16
comments
4
min read
LW
link
(mayankgoel28.substack.com)
[Question]
Can I publish songs derived from the Sequences’ posts on YouTube?
azergante
May 12, 2025, 6:34 PM
4
points
2
comments
1
min read
LW
link
How to title your blog post or whatever
dynomight
May 12, 2025, 6:12 PM
28
points
6
comments
4
min read
LW
link
(dynomight.net)
Political sycophancy as a model organism of scheming
Alex Mallen
and
Vivek Hebbar
May 12, 2025, 5:49 PM
39
points
0
comments
14
min read
LW
link
Things I Learned Making The SB-1047 Documentary
Michaël Trazzi
May 12, 2025, 5:41 PM
63
points
2
comments
2
min read
LW
link
A Live Look at the Senate AI Hearing
Zvi
May 12, 2025, 5:40 PM
38
points
1
comment
34
min read
LW
link
(thezvi.wordpress.com)
Global Risks Weekly Roundup #19/2025: India/Pakistan ceasefire, US/China tariffs deal & OpenAI nonprofit control
NunoSempere
May 12, 2025, 5:08 PM
10
points
1
comment
13
min read
LW
link
(blog.sentinel-team.org)
[Beneath Psychology] Introduction Part 1: The Challenge
jimmy
May 12, 2025, 5:01 PM
2
points
2
comments
3
min read
LW
link
PSA: The LessWrong Feedback Service
JustisMills
May 12, 2025, 4:34 PM
206
points
12
comments
2
min read
LW
link
Cambridge Boston Alignment Initiative Summer Research Fellowship in AI Safety (Deadline: May 18)
peterslattery
May 12, 2025, 4:20 PM
8
points
0
comments
1
min read
LW
link
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Matrice Jacobine
May 12, 2025, 3:20 PM
6
points
4
comments
1
min read
LW
link
(www.arxiv.org)
AIs at the current capability level may be important for future safety work
ryan_greenblatt
May 12, 2025, 2:06 PM
81
points
2
comments
4
min read
LW
link
[Question]
Game theory of “Nuclear Prisoner’s Dilemma”—on nuking rocks
CronoDAS
May 12, 2025, 11:07 AM
11
points
6
comments
2
min read
LW
link
What Is Death?
Mati_Roy
May 12, 2025, 2:14 AM
6
points
0
comments
1
min read
LW
link
(preservinghope.substack.com)
Highly Opinionated Advice on How to Write ML Papers
Neel Nanda
May 12, 2025, 1:59 AM
60
points
4
comments
32
min read
LW
link
Absolute Zero: Alpha Zero for LLM
alapmi
May 11, 2025, 8:42 PM
23
points
16
comments
1
min read
LW
link
AGI will result from an ecosystem not a single firm
hamish_low
May 11, 2025, 8:06 PM
6
points
1
comment
6
min read
LW
link
(cambrianr.substack.com)
Thou shalt not command an alighned AI
Martin Vlach
May 11, 2025, 8:02 PM
0
points
4
comments
1
min read
LW
link
[Question]
How do I design long prompts for thinking zero shot systems with distinct equally distributed prompt sections (mission, goals, memories, how-to-respond,… etc) and how to maintain llm coherence?
ollie_
May 11, 2025, 7:32 PM
2
points
5
comments
1
min read
LW
link
a confusion about preference orderings
nostalgebraist
May 11, 2025, 7:30 PM
92
points
39
comments
11
min read
LW
link
[Book Translation] Three Days in Dwarfland
Viliam
May 11, 2025, 5:54 PM
27
points
6
comments
1
min read
LW
link
Better Air Purifiers
jefftk
11 May 2025 16:50 UTC
71
points
21
comments
3
min read
LW
link
(www.jefftk.com)
Aligning Agents, Tools, and Simulators
WillPetillo
,
Sean Herrington
,
Spencer Ames
,
Adebayo Mubarak
and
Cancus
11 May 2025 7:59 UTC
21
points
0
comments
6
min read
LW
link
Consider not donating under $100 to political candidates
DanielFilan
11 May 2025 3:20 UTC
133
points
32
comments
1
min read
LW
link
(danielfilan.com)
Somerville Porchfest 2025
jefftk
11 May 2025 2:00 UTC
15
points
1
comment
2
min read
LW
link
(www.jefftk.com)
It’s Okay to Feel Bad for a Bit
moridinamael
10 May 2025 23:24 UTC
133
points
26
comments
3
min read
LW
link
G.D. as Capitalist Evolution, and the claim for humanity’s (temporary) upper hand
Martin Vlach
10 May 2025 21:18 UTC
8
points
3
comments
1
min read
LW
link
Book Review: “Encounters with Einstein” by Heisenberg
Baram Sosis
10 May 2025 20:55 UTC
31
points
6
comments
7
min read
LW
link
Where is the YIMBY movement for healthcare?
jasoncrawford
10 May 2025 20:36 UTC
20
points
10
comments
2
min read
LW
link
(newsletter.rootsofprogress.org)
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel