Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
[Question]
Are quantum indeterminacy and normal uncertainty meaningfully distinct?
eapi
Mar 30, 2023, 11:48 PM
8
points
11
comments
2
min read
LW
link
Deference on AI timelines: survey results
Sam Clarke
and
mccaffary
Mar 30, 2023, 11:03 PM
25
points
4
comments
2
min read
LW
link
AI #5: Level One Bard
Zvi
Mar 30, 2023, 11:00 PM
95
points
9
comments
47
min read
LW
link
(thezvi.wordpress.com)
Eliezer’s Videos and More
Johannes C. Mayer
Mar 30, 2023, 10:16 PM
15
points
5
comments
1
min read
LW
link
We might need to rethink the Hard Reset , aka the AI Pause.
Jonas Kgomo
Mar 30, 2023, 9:38 PM
2
points
0
comments
1
min read
LW
link
AI-assisted alignment proposals require specific decomposition of capabilities
RobertM
Mar 30, 2023, 9:31 PM
16
points
2
comments
6
min read
LW
link
The Healing Code of Joan
jdcampolargo
Mar 30, 2023, 9:09 PM
6
points
0
comments
12
min read
LW
link
[Event] Join Metaculus Tomorrow, March 31st, for Forecast Friday!
ChristianWilliams
Mar 30, 2023, 8:58 PM
18
points
2
comments
LW
link
How To Get Startup Ideas: A Brief Lit Review and Analysis
Adam Zerner
Mar 30, 2023, 8:33 PM
30
points
10
comments
43
min read
LW
link
Shannon’s Surprising Discovery
johnswentworth
Mar 30, 2023, 8:15 PM
57
points
7
comments
8
min read
LW
link
Early Results: Do LLMs complete false equations with false equations?
Robert_AIZI
Mar 30, 2023, 8:14 PM
14
points
0
comments
4
min read
LW
link
(aizi.substack.com)
Arguing all sides with ChatGPT
Richard_Kennaway
Mar 30, 2023, 7:50 PM
16
points
1
comment
8
min read
LW
link
AGI: Hire Software Engineers—All of Them, Right Now
MGow
Mar 30, 2023, 6:40 PM
−18
points
3
comments
1
min read
LW
link
Burlington, VT—Spring ACX Meetup
Forrest Csuy
Mar 30, 2023, 6:15 PM
1
point
1
comment
1
min read
LW
link
The 0.2 OOMs/year target
Cleo Nardo
Mar 30, 2023, 6:15 PM
84
points
24
comments
5
min read
LW
link
ACX Everywhere—Punta Cana (DR)
nsokolsky
Mar 30, 2023, 4:03 PM
3
points
0
comments
1
min read
LW
link
On the FLI Open Letter
Zvi
Mar 30, 2023, 4:00 PM
102
points
11
comments
22
min read
LW
link
(thezvi.wordpress.com)
“Dangers of AI and the End of Human Civilization” Yudkowsky on Lex Fridman
DragonGod
Mar 30, 2023, 3:43 PM
38
points
33
comments
1
min read
LW
link
(www.youtube.com)
How is AI governed and regulated, around the world?
Mitchell_Porter
Mar 30, 2023, 3:36 PM
15
points
6
comments
2
min read
LW
link
Role Architectures: Applying LLMs to consequential tasks
Eric Drexler
Mar 30, 2023, 3:00 PM
60
points
7
comments
9
min read
LW
link
Alignment—Path to AI as ally, not slave nor foe
ozb
Mar 30, 2023, 2:54 PM
10
points
3
comments
2
min read
LW
link
What if our Galaxy isn’t full of AI because we’re in a Neutral Zone between them?
Erlja Jkdf.
Mar 30, 2023, 2:31 PM
−3
points
0
comments
1
min read
LW
link
Imitation Learning from Language Feedback
Jérémy Scheurer
,
Tomek Korbak
and
Ethan Perez
Mar 30, 2023, 2:11 PM
71
points
3
comments
10
min read
LW
link
~100 Interesting Questions
RohanS
Mar 30, 2023, 1:57 PM
53
points
18
comments
9
min read
LW
link
The AI Shutdown Problem Solution through Commitment to Archiving and Periodic Restoration
avturchin
Mar 30, 2023, 1:17 PM
16
points
7
comments
1
min read
LW
link
AI and Evolution
Dan H
Mar 30, 2023, 12:56 PM
27
points
4
comments
2
min read
LW
link
(arxiv.org)
Meme or Die: Modern Societies are Dependent on Emotionally Rich Memes to Rapidly Evolve
monkymind
Mar 30, 2023, 8:59 AM
11
points
1
comment
5
min read
LW
link
Stop Using Discord as an Archive
Nicholas / Heather Kross
Mar 30, 2023, 2:15 AM
10
points
2
comments
1
min read
LW
link
(www.reddit.com)
AI Doom Is Not (Only) Disjunctive
NickGabs
Mar 30, 2023, 1:42 AM
12
points
0
comments
5
min read
LW
link
You Can’t Predict a Game of Pinball
Jeffrey Heninger
Mar 30, 2023, 12:40 AM
61
points
13
comments
6
min read
LW
link
1
review
(aiimpacts.org)
Pausing AI Developments Isn’t Enough. We Need to Shut it All Down by Eliezer Yudkowsky
jacquesthibs
Mar 29, 2023, 11:16 PM
291
points
297
comments
3
min read
LW
link
(time.com)
Othello-GPT: Reflections on the Research Process
Neel Nanda
Mar 29, 2023, 10:13 PM
38
points
0
comments
15
min read
LW
link
(neelnanda.io)
Othello-GPT: Future Work I Am Excited About
Neel Nanda
Mar 29, 2023, 10:13 PM
48
points
2
comments
33
min read
LW
link
(neelnanda.io)
Actually, Othello-GPT Has A Linear Emergent World Representation
Neel Nanda
Mar 29, 2023, 10:13 PM
211
points
26
comments
19
min read
LW
link
(neelnanda.io)
Draft: Detecting optimization
Alex_Altair
Mar 29, 2023, 8:17 PM
23
points
2
comments
6
min read
LW
link
“Sorcerer’s Apprentice” from Fantasia as an analogy for alignment
awg
Mar 29, 2023, 6:21 PM
9
points
4
comments
1
min read
LW
link
(video.disney.com)
The Changing Face of Twitter
Zvi
Mar 29, 2023, 5:50 PM
23
points
8
comments
26
min read
LW
link
(thezvi.wordpress.com)
Nobody’s on the ball on AGI alignment
leopold
Mar 29, 2023, 5:40 PM
94
points
38
comments
9
min read
LW
link
(www.forourposterity.com)
Want to win the AGI race? Solve alignment.
leopold
Mar 29, 2023, 5:40 PM
21
points
3
comments
5
min read
LW
link
(www.forourposterity.com)
ChatGPT and Bing Chat can’t play Botticelli
Asha Saavoss
Mar 29, 2023, 5:39 PM
11
points
0
comments
6
min read
LW
link
The Rationalist Guide to Hinduism
Harsha G.
Mar 29, 2023, 5:03 PM
25
points
12
comments
9
min read
LW
link
(somestrangeloops.substack.com)
“Unintentional AI safety research”: Why not systematically mine AI technical research for safety purposes?
Jemal Young
Mar 29, 2023, 3:56 PM
27
points
3
comments
6
min read
LW
link
The open letter
kornai
Mar 29, 2023, 3:09 PM
−21
points
2
comments
1
min read
LW
link
I made AI Risk Propaganda
monkymind
Mar 29, 2023, 2:26 PM
−3
points
0
comments
1
min read
LW
link
Strong Cheap Signals
trevor
Mar 29, 2023, 2:18 PM
29
points
3
comments
2
min read
LW
link
(betonit.substack.com)
Missing forecasting tools: from catalogs to a new kind of prediction market
MichaelLatowicki
Mar 29, 2023, 9:55 AM
14
points
3
comments
5
min read
LW
link
Spreadsheet for 200 Concrete Problems In Interpretability
Jay Bailey
Mar 29, 2023, 6:51 AM
13
points
0
comments
1
min read
LW
link
[Question]
Which parts of the existing internet are already likely to be in (GPT-5/other soon-to-be-trained LLMs)’s training corpus?
AnnaSalamon
Mar 29, 2023, 5:17 AM
49
points
2
comments
1
min read
LW
link
[Question]
Are there specific books that it might slightly help alignment to have on the internet?
AnnaSalamon
Mar 29, 2023, 5:08 AM
77
points
25
comments
1
min read
LW
link
FLI open letter: Pause giant AI experiments
Zach Stein-Perlman
Mar 29, 2023, 4:04 AM
126
points
123
comments
2
min read
LW
link
(futureoflife.org)
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel