Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
Луна Лавгуд и Комната Тайн, Часть 1
Kongo Landwalker
and
lsusr
May 26, 2024, 10:17 PM
24
points
0
comments
3
min read
LW
link
If you are also the worst at politics
lemonhope
May 26, 2024, 8:07 PM
32
points
8
comments
1
min read
LW
link
Review: Conor Moreton’s “Civilization & Cooperation”
Duncan Sabien (Inactive)
May 26, 2024, 7:32 PM
80
points
8
comments
38
min read
LW
link
The necessity of “Guardian AI” and two conditions for its achievement
Proica
May 26, 2024, 5:39 PM
−2
points
0
comments
15
min read
LW
link
Notifications Received in 30 Minutes of Class
tanagrabeast
May 26, 2024, 5:02 PM
358
points
16
comments
8
min read
LW
link
Show LW: HackerNews but for research papers
sleno
May 26, 2024, 3:14 PM
6
points
1
comment
1
min read
LW
link
Disproving and partially fixing a fully homomorphic encryption scheme with perfect secrecy
Lysandre Terrisse
May 26, 2024, 2:56 PM
16
points
1
comment
18
min read
LW
link
The AI Revolution in Biology
Roman Leventov
May 26, 2024, 9:30 AM
13
points
0
comments
1
min read
LW
link
(www.cognitiverevolution.ai)
[Question]
Who does the artwork for LessWrong?
Edwin Evans
May 26, 2024, 5:55 AM
10
points
1
comment
1
min read
LW
link
[Question]
Is there an idiom for bonding over shared trials/trauma?
CstineSublime
May 26, 2024, 1:18 AM
2
points
1
comment
1
min read
LW
link
Moloch—An Illustrated Primer
James Stephen Brown
May 26, 2024, 1:04 AM
5
points
0
comments
7
min read
LW
link
(nonzerosum.games)
[Question]
Is CDT with precommitment enough?
martinkunev
May 25, 2024, 9:40 PM
10
points
17
comments
1
min read
LW
link
Complex systems theory in human performance. New model for conceptualizing training, adaptation and long-term development
Matěj Nekoranec
May 25, 2024, 8:17 PM
1
point
0
comments
7
min read
LW
link
Blindspot in Sport’s Data-Driven Age
Matěj Nekoranec
May 25, 2024, 8:17 PM
2
points
0
comments
7
min read
LW
link
LMSR subsidy parameter is the price of information
Abhimanyu Pallavi Sudhir
May 25, 2024, 6:05 PM
5
points
0
comments
1
min read
LW
link
Low Fertility is a Degrowth Paradise
Maxwell Tabarrok
May 25, 2024, 5:35 PM
7
points
2
comments
3
min read
LW
link
(www.maximum-progress.com)
Episode: Austin vs Linch on OpenAI
Austin Chen
May 25, 2024, 4:15 PM
20
points
25
comments
LW
link
(manifund.substack.com)
Training-time domain authorization could be helpful for safety
domenicrosati
,
Jan Wehner
and
David Atanasov
May 25, 2024, 3:10 PM
15
points
4
comments
7
min read
LW
link
Level up your spreadsheeting
angelinahli
May 25, 2024, 2:57 PM
44
points
11
comments
3
min read
LW
link
(docs.google.com)
“Successful language model evals” by Jason Wei
Arjun Panickssery
May 25, 2024, 9:34 AM
7
points
0
comments
1
min read
LW
link
(www.jasonwei.net)
Beta Tester Request: Rallypoint Bounties
lukemarks
May 25, 2024, 9:11 AM
25
points
4
comments
1
min read
LW
link
[Question]
What should the norms around AI voices be?
ChristianKl
May 25, 2024, 6:29 AM
17
points
6
comments
1
min read
LW
link
Secret US natsec project with intel revealed
Nathan Helm-Burger
May 25, 2024, 4:22 AM
27
points
1
comment
1
min read
LW
link
(www.politico.com)
Launch & Grow Your University Group: Apply now to OSP & FSP!
agucova
May 25, 2024, 1:03 AM
3
points
0
comments
LW
link
Computational Mechanics Hackathon (June 1 & 2)
Adam Shai
May 24, 2024, 10:18 PM
34
points
5
comments
1
min read
LW
link
[Question]
Request for comments/opinions/ideas on safety/ethics for use of tool AI in a large healthcare system.
bokov
May 24, 2024, 8:53 PM
5
points
2
comments
1
min read
LW
link
NYU Code Debates Update/Postmortem
David Rein
May 24, 2024, 4:08 PM
27
points
4
comments
10
min read
LW
link
AI companies aren’t really using external evaluators
Zach Stein-Perlman
May 24, 2024, 4:01 PM
242
points
15
comments
4
min read
LW
link
The Schumer Report on AI (RTFB)
Zvi
May 24, 2024, 3:10 PM
34
points
3
comments
36
min read
LW
link
(thezvi.wordpress.com)
minutes from a human-alignment meeting
bhauth
May 24, 2024, 5:01 AM
67
points
4
comments
2
min read
LW
link
Talent Needs of Technical AI Safety Teams
yams
,
Carson Jones
,
McKennaFitzgerald
and
Ryan Kidd
May 24, 2024, 12:36 AM
118
points
65
comments
14
min read
LW
link
How to Give Coming AGI’s the Best Chance of Figuring Out Ethics for Us
sweenesm
May 23, 2024, 7:44 PM
1
point
2
comments
10
min read
LW
link
Mentorship in AGI Safety (MAGIS) call for mentors
Valentin2026
and
Joe Rogero
May 23, 2024, 6:28 PM
31
points
3
comments
2
min read
LW
link
Quick Thoughts on Scaling Monosemanticity
Joel Burget
May 23, 2024, 4:22 PM
28
points
1
comment
4
min read
LW
link
(transformer-circuits.pub)
The case for stopping AI safety research
catubc
May 23, 2024, 3:55 PM
53
points
38
comments
1
min read
LW
link
[Question]
SAE sparse feature graph using only residual layers
Jaehyuk Lim
May 23, 2024, 1:32 PM
0
points
3
comments
1
min read
LW
link
[Question]
Are most people deeply confused about “love”, or am I missing a human universal?
SpectrumDT
May 23, 2024, 1:22 PM
13
points
28
comments
3
min read
LW
link
Executive Dysfunction 101
DaystarEld
May 23, 2024, 12:43 PM
28
points
1
comment
3
min read
LW
link
(daystareld.com)
AI #65: I Spy With My AI
Zvi
May 23, 2024, 12:40 PM
28
points
7
comments
43
min read
LW
link
(thezvi.wordpress.com)
What mistakes has the AI safety movement made?
EuanMcLean
May 23, 2024, 11:19 AM
64
points
29
comments
12
min read
LW
link
What should AI safety be trying to achieve?
EuanMcLean
May 23, 2024, 11:17 AM
17
points
1
comment
13
min read
LW
link
What will the first human-level AI look like, and how might things go wrong?
EuanMcLean
May 23, 2024, 11:17 AM
20
points
2
comments
15
min read
LW
link
Big Picture AI Safety: Introduction
EuanMcLean
May 23, 2024, 11:15 AM
46
points
7
comments
5
min read
LW
link
Paper in Science: Managing extreme AI risks amid rapid progress
JanB
May 23, 2024, 8:40 AM
50
points
2
comments
1
min read
LW
link
Power Law Policy
Ben Turtel
May 23, 2024, 5:28 AM
4
points
7
comments
6
min read
LW
link
(bturtel.substack.com)
Why entropy means you might not have to worry as much about superintelligent AI
Ron J
May 23, 2024, 3:52 AM
−26
points
1
comment
2
min read
LW
link
Quick Thoughts on Our First Sampling Run
jefftk
May 23, 2024, 12:20 AM
29
points
3
comments
2
min read
LW
link
(www.jefftk.com)
AI Safety proposal—Influencing the superintelligence explosion
Morgan
22 May 2024 23:31 UTC
0
points
2
comments
7
min read
LW
link
Implementing Asimov’s Laws of Robotics—How I imagine alignment working.
Joshua Clancy
22 May 2024 23:15 UTC
2
points
0
comments
11
min read
LW
link
Higher-Order Forecasts
ozziegooen
22 May 2024 21:49 UTC
45
points
1
comment
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel