Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
1
My techno-optimism [By Vitalik Buterin]
habryka
Nov 27, 2023, 11:53 PM
107
points
17
comments
2
min read
LW
link
(www.lesswrong.com)
[Question]
Could Germany have won World War I with high probability given the benefit of hindsight?
Roko
Nov 27, 2023, 10:52 PM
10
points
18
comments
1
min read
LW
link
[Question]
Could World War I have been prevented given the benefit of hindsight?
Roko
Nov 27, 2023, 10:39 PM
16
points
8
comments
1
min read
LW
link
AISC 2024 - Project Summaries
NickyP
Nov 27, 2023, 10:32 PM
48
points
3
comments
18
min read
LW
link
“Epistemic range of motion” and LessWrong moderation
habryka
and
Gabriel Alfour
Nov 27, 2023, 9:58 PM
65
points
3
comments
12
min read
LW
link
Apply to the Conceptual Boundaries Workshop for AI Safety
Chipmonk
Nov 27, 2023, 9:04 PM
50
points
0
comments
3
min read
LW
link
There is no IQ for AI
Gabriel Alfour
Nov 27, 2023, 6:21 PM
30
points
10
comments
9
min read
LW
link
(cognition.cafe)
Two concepts of an “episode” (Section 2.2.1 of “Scheming AIs”)
Joe Carlsmith
Nov 27, 2023, 6:01 PM
19
points
1
comment
13
min read
LW
link
[Linkpost] George Mack’s Razors
trevor
Nov 27, 2023, 5:53 PM
38
points
8
comments
3
min read
LW
link
(twitter.com)
On possible cross-fertilization between AI and neuroscience [Creativity]
Bill Benzon
Nov 27, 2023, 4:50 PM
15
points
22
comments
7
min read
LW
link
Ethicophysics I
MadHatter
Nov 27, 2023, 3:44 PM
−1
points
16
comments
1
min read
LW
link
(open.substack.com)
Sentience Institute 2023 End of Year Summary
michael_dello
Nov 27, 2023, 12:11 PM
11
points
0
comments
5
min read
LW
link
(www.sentienceinstitute.org)
[Question]
A Question about Corrigibility (2015)
A.H.
Nov 27, 2023, 12:05 PM
4
points
2
comments
1
min read
LW
link
Appendices to the live agendas
technicalities
and
Stag
Nov 27, 2023, 11:10 AM
16
points
4
comments
1
min read
LW
link
Shallow review of live agendas in alignment & safety
technicalities
and
Stag
Nov 27, 2023, 11:10 AM
348
points
73
comments
29
min read
LW
link
1
review
Napoleon stole the Roman Inquisition archives and investigated the Galileo case
Meow P
Nov 27, 2023, 9:41 AM
−3
points
0
comments
1
min read
LW
link
(www.cricetuscricetus.co.uk)
Found Paper: “FDT in an evolutionary environment”
the gears to ascension
Nov 27, 2023, 5:27 AM
30
points
47
comments
1
min read
LW
link
(arxiv.org)
[Question]
why did OpenAI employees sign
bhauth
Nov 27, 2023, 5:21 AM
49
points
23
comments
1
min read
LW
link
Unknown Probabilities
transhumanist_atom_understander
Nov 27, 2023, 2:30 AM
22
points
1
comment
4
min read
LW
link
Justification for Induction
Krantz
Nov 27, 2023, 2:05 AM
2
points
25
comments
5
min read
LW
link
Situational awareness (Section 2.1 of “Scheming AIs”)
Joe Carlsmith
Nov 26, 2023, 11:00 PM
10
points
5
comments
8
min read
LW
link
AXRP Episode 26 - AI Governance with Elizabeth Seger
DanielFilan
Nov 26, 2023, 11:00 PM
14
points
0
comments
66
min read
LW
link
Solving Two-Sided Adverse Selection with Prediction Market Matchmaking
Saul Munn
Nov 26, 2023, 8:10 PM
16
points
7
comments
4
min read
LW
link
(www.brasstacks.blog)
Wikipedia is not so great, and what can be done about it.
euserx
Nov 26, 2023, 7:13 PM
0
points
27
comments
16
min read
LW
link
(forum.effectivealtruism.org)
[Question]
Help me solve this problem: The basilisk isn’t real, but people are
canary_itm
Nov 26, 2023, 5:44 PM
−19
points
4
comments
1
min read
LW
link
Twin Cities ACX Meetup—December 2023
Timothy M.
Nov 26, 2023, 5:32 PM
1
point
1
comment
1
min read
LW
link
Spaced repetition for teaching two-year olds how to read (Interview)
Chipmonk
Nov 26, 2023, 4:52 PM
48
points
9
comments
5
min read
LW
link
(chipmonk.substack.com)
Paper out now on creatine and cognitive performance
Fabienne
Nov 26, 2023, 10:58 AM
59
points
2
comments
1
min read
LW
link
Why Q*, if real, might be a game changer
Shmi
Nov 26, 2023, 6:12 AM
5
points
6
comments
1
min read
LW
link
Moral Reality Check (a short story)
jessicata
Nov 26, 2023, 5:03 AM
149
points
45
comments
21
min read
LW
link
1
review
(unstableontology.com)
Accounting for Foregone Pay
jefftk
Nov 26, 2023, 3:30 AM
11
points
0
comments
2
min read
LW
link
(www.jefftk.com)
Corrigibility or DWIM is an attractive primary goal for AGI
Seth Herd
Nov 25, 2023, 7:37 PM
19
points
4
comments
1
min read
LW
link
On “slack” in training (Section 1.5 of “Scheming AIs”)
Joe Carlsmith
Nov 25, 2023, 5:51 PM
1
point
0
comments
5
min read
LW
link
Announcing New Beginner-friendly Book on AI Safety and Risk
Darren McKee
Nov 25, 2023, 3:57 PM
74
points
3
comments
LW
link
Fertility as Metascience
Maxwell Tabarrok
Nov 25, 2023, 3:42 PM
20
points
1
comment
3
min read
LW
link
(maximumprogress.substack.com)
Reaction to “Empowerment is (almost) All We Need” : an open-ended alternative
Ryo
Nov 25, 2023, 3:35 PM
9
points
3
comments
5
min read
LW
link
How Microsoft’s ruthless employee evaluation system annihilated team collaboration.
positivesum
Nov 25, 2023, 1:25 PM
3
points
2
comments
1
min read
LW
link
(tryingtruly.substack.com)
What are the results of more parental supervision and less outdoor play?
juliawise
Nov 25, 2023, 12:52 PM
228
points
31
comments
5
min read
LW
link
A simple treacherous turn demonstration
Nikola Jurkovic
Nov 25, 2023, 4:51 AM
22
points
5
comments
3
min read
LW
link
The two paragraph argument for AI risk
CronoDAS
Nov 25, 2023, 2:01 AM
19
points
8
comments
1
min read
LW
link
Goodhart’s Law Example: Training Verifiers to Solve Math Word Problems
Chris_Leong
Nov 25, 2023, 12:53 AM
27
points
2
comments
1
min read
LW
link
(arxiv.org)
Some thoughts on CBDC
PixelatedPenguin
Nov 25, 2023, 12:32 AM
−1
points
1
comment
1
min read
LW
link
Testing for consequence-blindness in LLMs using the HI-ADS unit test.
David Scott Krueger (formerly: capybaralet)
Nov 24, 2023, 11:35 PM
25
points
2
comments
2
min read
LW
link
Epoch is hiring an ML Distributed Systems Senior Researcher
merilalama
and
Jaime Sevilla Molina
Nov 24, 2023, 10:33 PM
2
points
0
comments
4
min read
LW
link
(careers.rethinkpriorities.org)
Article Discussion And Free Pizza—St Paul
25Hour
Nov 24, 2023, 9:02 PM
1
point
0
comments
1
min read
LW
link
Why focus on schemers in particular (Sections 1.3 and 1.4 of “Scheming AIs”)
Joe Carlsmith
Nov 24, 2023, 7:18 PM
8
points
0
comments
22
min read
LW
link
Surviving and Shaping Long-Term Competitions: Lessons from Net Assessment
Gentzel
and
ihavenoahidea
Nov 24, 2023, 6:18 PM
5
points
0
comments
13
min read
LW
link
Ability to solve long-horizon tasks correlates with wanting things in the behaviorist sense
So8res
Nov 24, 2023, 5:37 PM
197
points
84
comments
5
min read
LW
link
1
review
The Limitations of GPT-4
p.b.
24 Nov 2023 15:30 UTC
27
points
12
comments
4
min read
LW
link
Progress links digest, 2023-11-24: Bottlenecks of aging, Starship launches, and much more
jasoncrawford
24 Nov 2023 15:25 UTC
40
points
1
comment
14
min read
LW
link
(rootsofprogress.org)
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel