Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
1
Situational awareness (Section 2.1 of “Scheming AIs”)
Joe Carlsmith
Nov 26, 2023, 11:00 PM
10
points
5
comments
8
min read
LW
link
AXRP Episode 26 - AI Governance with Elizabeth Seger
DanielFilan
Nov 26, 2023, 11:00 PM
14
points
0
comments
66
min read
LW
link
Solving Two-Sided Adverse Selection with Prediction Market Matchmaking
Saul Munn
Nov 26, 2023, 8:10 PM
16
points
7
comments
4
min read
LW
link
(www.brasstacks.blog)
Wikipedia is not so great, and what can be done about it.
euserx
Nov 26, 2023, 7:13 PM
0
points
27
comments
16
min read
LW
link
(forum.effectivealtruism.org)
[Question]
Help me solve this problem: The basilisk isn’t real, but people are
canary_itm
Nov 26, 2023, 5:44 PM
−19
points
4
comments
1
min read
LW
link
Twin Cities ACX Meetup—December 2023
Timothy M.
Nov 26, 2023, 5:32 PM
1
point
1
comment
1
min read
LW
link
Spaced repetition for teaching two-year olds how to read (Interview)
Chipmonk
Nov 26, 2023, 4:52 PM
48
points
9
comments
5
min read
LW
link
(chipmonk.substack.com)
Paper out now on creatine and cognitive performance
Fabienne
Nov 26, 2023, 10:58 AM
59
points
2
comments
1
min read
LW
link
Why Q*, if real, might be a game changer
Shmi
Nov 26, 2023, 6:12 AM
5
points
6
comments
1
min read
LW
link
Moral Reality Check (a short story)
jessicata
Nov 26, 2023, 5:03 AM
149
points
45
comments
21
min read
LW
link
1
review
(unstableontology.com)
Accounting for Foregone Pay
jefftk
Nov 26, 2023, 3:30 AM
11
points
0
comments
2
min read
LW
link
(www.jefftk.com)
Corrigibility or DWIM is an attractive primary goal for AGI
Seth Herd
Nov 25, 2023, 7:37 PM
19
points
4
comments
1
min read
LW
link
On “slack” in training (Section 1.5 of “Scheming AIs”)
Joe Carlsmith
Nov 25, 2023, 5:51 PM
1
point
0
comments
5
min read
LW
link
Announcing New Beginner-friendly Book on AI Safety and Risk
Darren McKee
Nov 25, 2023, 3:57 PM
74
points
3
comments
LW
link
Fertility as Metascience
Maxwell Tabarrok
Nov 25, 2023, 3:42 PM
20
points
1
comment
3
min read
LW
link
(maximumprogress.substack.com)
Reaction to “Empowerment is (almost) All We Need” : an open-ended alternative
Ryo
Nov 25, 2023, 3:35 PM
9
points
3
comments
5
min read
LW
link
How Microsoft’s ruthless employee evaluation system annihilated team collaboration.
positivesum
Nov 25, 2023, 1:25 PM
3
points
2
comments
1
min read
LW
link
(tryingtruly.substack.com)
What are the results of more parental supervision and less outdoor play?
juliawise
Nov 25, 2023, 12:52 PM
228
points
31
comments
5
min read
LW
link
A simple treacherous turn demonstration
Nikola Jurkovic
Nov 25, 2023, 4:51 AM
22
points
5
comments
3
min read
LW
link
The two paragraph argument for AI risk
CronoDAS
Nov 25, 2023, 2:01 AM
19
points
8
comments
1
min read
LW
link
Goodhart’s Law Example: Training Verifiers to Solve Math Word Problems
Chris_Leong
Nov 25, 2023, 12:53 AM
27
points
2
comments
1
min read
LW
link
(arxiv.org)
Some thoughts on CBDC
PixelatedPenguin
Nov 25, 2023, 12:32 AM
−1
points
1
comment
1
min read
LW
link
Testing for consequence-blindness in LLMs using the HI-ADS unit test.
David Scott Krueger (formerly: capybaralet)
Nov 24, 2023, 11:35 PM
25
points
2
comments
2
min read
LW
link
Epoch is hiring an ML Distributed Systems Senior Researcher
merilalama
and
Jaime Sevilla Molina
Nov 24, 2023, 10:33 PM
2
points
0
comments
4
min read
LW
link
(careers.rethinkpriorities.org)
Article Discussion And Free Pizza—St Paul
25Hour
Nov 24, 2023, 9:02 PM
1
point
0
comments
1
min read
LW
link
Why focus on schemers in particular (Sections 1.3 and 1.4 of “Scheming AIs”)
Joe Carlsmith
Nov 24, 2023, 7:18 PM
8
points
0
comments
22
min read
LW
link
Surviving and Shaping Long-Term Competitions: Lessons from Net Assessment
Gentzel
and
ihavenoahidea
Nov 24, 2023, 6:18 PM
5
points
0
comments
13
min read
LW
link
Ability to solve long-horizon tasks correlates with wanting things in the behaviorist sense
So8res
Nov 24, 2023, 5:37 PM
197
points
84
comments
5
min read
LW
link
1
review
The Limitations of GPT-4
p.b.
Nov 24, 2023, 3:30 PM
27
points
12
comments
4
min read
LW
link
Progress links digest, 2023-11-24: Bottlenecks of aging, Starship launches, and much more
jasoncrawford
Nov 24, 2023, 3:25 PM
40
points
1
comment
14
min read
LW
link
(rootsofprogress.org)
[Question]
What’s the evidence that LLMs will scale up efficiently beyond GPT4? i.e. couldn’t GPT5, etc., be very inefficient?
M. Y. Zuo
Nov 24, 2023, 3:22 PM
9
points
6
comments
1
min read
LW
link
Sapience, understanding, and “AGI”
Seth Herd
Nov 24, 2023, 3:13 PM
15
points
3
comments
6
min read
LW
link
Insulate your ideas
Logan Kieller
Nov 24, 2023, 2:08 PM
18
points
5
comments
2
min read
LW
link
(logankieller.substack.com)
Bordeaux, Gironde, France – irregular ACX Meetup 2023-12-09
vi21maobk9vp
Nov 24, 2023, 11:17 AM
5
points
1
comment
1
min read
LW
link
[Question]
A Question For People Who Believe In God
yanni kyriacos
Nov 24, 2023, 5:22 AM
3
points
38
comments
1
min read
LW
link
[Question]
First and Last Questions for GPT-5*
Mitchell_Porter
Nov 24, 2023, 5:03 AM
15
points
5
comments
1
min read
LW
link
4. A Moral Case for Evolved-Sapience-Chauvinism
RogerDearnaley
Nov 24, 2023, 4:56 AM
10
points
0
comments
4
min read
LW
link
Detecting What’s Been Seen
jefftk
Nov 24, 2023, 3:30 AM
23
points
0
comments
2
min read
LW
link
(www.jefftk.com)
[Question]
Help to find a blog I don’t remember the name of
JavierCC
Nov 23, 2023, 10:49 PM
3
points
2
comments
1
min read
LW
link
[Question]
What did you change your mind about in the last year?
mike_hawke
Nov 23, 2023, 8:53 PM
41
points
16
comments
1
min read
LW
link
A few Superhuman examples of Superaligned Superintelligence from Google Bard (Thanksgiving 2023)
bionicles
and
bionalexhoward
Nov 23, 2023, 7:06 PM
−9
points
1
comment
17
min read
LW
link
Prepsgiving, A Convergently Instrumental Human Practice
JenniferRM
Nov 23, 2023, 5:24 PM
39
points
0
comments
8
min read
LW
link
AI #39: The Week of OpenAI
Zvi
Nov 23, 2023, 3:10 PM
67
points
8
comments
28
min read
LW
link
(thezvi.wordpress.com)
3. Uploading
RogerDearnaley
Nov 23, 2023, 7:39 AM
21
points
5
comments
8
min read
LW
link
2. AIs as Economic Agents
RogerDearnaley
Nov 23, 2023, 7:07 AM
9
points
2
comments
6
min read
LW
link
Thomas Kwa’s research journal
Thomas Kwa
and
Adrià Garriga-alonso
Nov 23, 2023, 5:11 AM
79
points
1
comment
6
min read
LW
link
Never Drop A Ball
Screwtape
Nov 23, 2023, 4:15 AM
101
points
8
comments
6
min read
LW
link
1
review
Possible OpenAI’s Q* breakthrough and DeepMind’s AlphaGo-type systems plus LLMs
Burny
Nov 23, 2023, 3:16 AM
37
points
25
comments
2
min read
LW
link
Boston Secular Solstice: Call for Singers and Musicans
jefftk
Nov 23, 2023, 2:40 AM
16
points
2
comments
1
min read
LW
link
(www.jefftk.com)
My Mental Model of Infohazards
MadHatter
Nov 23, 2023, 2:37 AM
8
points
34
comments
2
min read
LW
link
1
review
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel