Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
Making progress bars for Alignment
Kabir Kumar
Jan 3, 2025, 9:25 PM
2
points
0
comments
1
min read
LW
link
(lu.ma)
The Intelligence Curse
lukedrago
Jan 3, 2025, 7:07 PM
133
points
27
comments
18
min read
LW
link
(lukedrago.substack.com)
The case for pay-on-results coaching
Chipmonk
Jan 3, 2025, 6:40 PM
16
points
3
comments
1
min read
LW
link
Introducing Squiggle AI
ozziegooen
Jan 3, 2025, 5:53 PM
92
points
15
comments
LW
link
Human study on AI spear phishing campaigns
Simon Lermen
,
Fred Heiding
and
Andrew Kao
Jan 3, 2025, 3:11 PM
79
points
8
comments
5
min read
LW
link
The subset parity learning problem: much more than you wanted to know
Dmitry Vaintrob
Jan 3, 2025, 9:13 AM
94
points
18
comments
11
min read
LW
link
Building AI safety benchmark environments on themes of universal human values
Roland Pihlakas
Jan 3, 2025, 4:24 AM
18
points
3
comments
8
min read
LW
link
(docs.google.com)
Emotional Superrationality
nullproxy
Jan 2, 2025, 10:54 PM
−6
points
4
comments
11
min read
LW
link
Playing with Otamatones
jefftk
Jan 2, 2025, 7:50 PM
12
points
0
comments
1
min read
LW
link
(www.jefftk.com)
7. Iterate the Game: Racing Where?
Allison Duettmann
Jan 2, 2025, 7:06 PM
11
points
0
comments
9
min read
LW
link
6. Increase Intelligence: Welcome AI Players
Allison Duettmann
Jan 2, 2025, 7:06 PM
6
points
1
comment
19
min read
LW
link
5. Uphold Voluntarism: Digital Defense
Allison Duettmann
Jan 2, 2025, 7:05 PM
3
points
0
comments
18
min read
LW
link
4. Uphold Voluntarism: Physical Defense
Allison Duettmann
Jan 2, 2025, 7:04 PM
6
points
2
comments
23
min read
LW
link
3. Improve Cooperation: Better Technologies
Allison Duettmann
Jan 2, 2025, 7:03 PM
4
points
2
comments
23
min read
LW
link
2. Skim the Manual: Intelligent Voluntary Cooperation
Allison Duettmann
Jan 2, 2025, 7:02 PM
13
points
3
comments
18
min read
LW
link
1. Meet the Players: Value Diversity
Allison Duettmann
Jan 2, 2025, 7:00 PM
32
points
2
comments
11
min read
LW
link
Preface
Allison Duettmann
Jan 2, 2025, 6:59 PM
26
points
2
comments
7
min read
LW
link
The AI Agent Revolution: Beyond the Hype of 2025
DimaG
Jan 2, 2025, 6:55 PM
−7
points
1
comment
28
min read
LW
link
On False Dichotomies
nullproxy
Jan 2, 2025, 6:54 PM
−3
points
0
comments
5
min read
LW
link
Preference Inversion
Benquo
Jan 2, 2025, 6:15 PM
51
points
48
comments
4
min read
LW
link
(benjaminrosshoffman.com)
Alignment Is Not All You Need
Adam Jones
Jan 2, 2025, 5:50 PM
43
points
10
comments
6
min read
LW
link
(adamjones.me)
What’s the short timeline plan?
Marius Hobbhahn
Jan 2, 2025, 2:59 PM
353
points
49
comments
23
min read
LW
link
AI #97: 4
Zvi
Jan 2, 2025, 2:10 PM
45
points
4
comments
40
min read
LW
link
(thezvi.wordpress.com)
[Question]
Can private companies test LVTs?
Yair Halberstadt
Jan 2, 2025, 11:08 AM
7
points
0
comments
1
min read
LW
link
Grammars, subgrammars, and combinatorics of generalization in transformers
Dmitry Vaintrob
Jan 2, 2025, 9:37 AM
36
points
0
comments
17
min read
LW
link
[Question]
2025 Alignment Predictions
anaguma
Jan 2, 2025, 5:37 AM
3
points
3
comments
1
min read
LW
link
Grading my 2024 AI predictions
Nikola Jurkovic
Jan 2, 2025, 5:01 AM
19
points
1
comment
3
min read
LW
link
Practicing Bayesian Epistemology with “Two Boys” Probability Puzzles
Liron
Jan 2, 2025, 4:42 AM
43
points
14
comments
6
min read
LW
link
Implications of Moral Realism on AI Safety
Myles H
Jan 2, 2025, 2:58 AM
7
points
1
comment
3
min read
LW
link
Read The Sequences As If They Were Written Today
Peter Berggren
Jan 2, 2025, 2:51 AM
63
points
7
comments
4
min read
LW
link
A Collection of Empirical Frames about Language Models
Daniel Tan
Jan 2, 2025, 2:49 AM
27
points
0
comments
3
min read
LW
link
My January alignment theory Nanowrimo
Dmitry Vaintrob
Jan 2, 2025, 12:07 AM
42
points
2
comments
2
min read
LW
link
Intranasal mRNA Vaccines?
J Bostock
Jan 1, 2025, 11:46 PM
26
points
2
comments
3
min read
LW
link
Example of GPU-accelerated scientific computing with PyTorch
Tahp
Jan 1, 2025, 11:01 PM
6
points
0
comments
6
min read
LW
link
(passwordpaper.com)
Economic Post-ASI Transition
Joel Burget
Jan 1, 2025, 10:37 PM
20
points
11
comments
1
min read
LW
link
2024 in AI predictions
jessicata
Jan 1, 2025, 8:29 PM
117
points
3
comments
8
min read
LW
link
Approaches to Group Singing
jefftk
Jan 1, 2025, 12:50 PM
12
points
1
comment
3
min read
LW
link
(www.jefftk.com)
Alienable (not Inalienable) Right to Buy
FlorianH
Jan 1, 2025, 12:19 PM
7
points
6
comments
4
min read
LW
link
AGI is what generates evolutionarily fit and novel information
onur
Jan 1, 2025, 9:22 AM
1
point
0
comments
6
min read
LW
link
(solmaz.io)
The OODA Loop—Observe, Orient, Decide, Act
Davis_Kingsley
Jan 1, 2025, 8:00 AM
53
points
2
comments
11
min read
LW
link
Comment on “Death and the Gorgon”
Zack_M_Davis
Jan 1, 2025, 5:47 AM
103
points
33
comments
8
min read
LW
link
Fireplace and Candle Smoke
jefftk
Jan 1, 2025, 1:50 AM
36
points
4
comments
1
min read
LW
link
(www.jefftk.com)
Merry Sciencemas: A Rat Solstice Retrospective
leebriskCyrano
Jan 1, 2025, 1:08 AM
−8
points
0
comments
1
min read
LW
link
(leebriskcyrano.com)
Riffing on Machines of Loving Grace
an1lam
Jan 1, 2025, 1:06 AM
9
points
0
comments
1
min read
LW
link
(an1lam.substack.com)
new chinese stealth aircraft
bhauth
Jan 1, 2025, 12:19 AM
58
points
3
comments
6
min read
LW
link
(bhauth.com)
The Roots of Progress 2024 in review
jasoncrawford
1 Jan 2025 0:02 UTC
27
points
0
comments
11
min read
LW
link
(newsletter.rootsofprogress.org)
Genesis
PeterMcCluskey
31 Dec 2024 22:01 UTC
18
points
0
comments
2
min read
LW
link
(bayesianinvestor.com)
Favorite colors of some LLMs.
Canaletto
31 Dec 2024 21:22 UTC
10
points
3
comments
7
min read
LW
link
My AGI safety research—2024 review, ’25 plans
Steven Byrnes
31 Dec 2024 21:05 UTC
109
points
4
comments
8
min read
LW
link
How Business Solved (?) the Human Alignment Problem
Gianluca Calcagni
31 Dec 2024 20:39 UTC
−2
points
1
comment
8
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel