Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
Why I’m Optimistic About Near-Term AI Risk
harsimony
15 May 2022 23:05 UTC
57
points
27
comments
1
min read
LW
link
[Question]
Definition Practice: Applied Rationality
ChristianKl
15 May 2022 20:44 UTC
13
points
2
comments
1
min read
LW
link
Surviving Automation In The 21st Century—Part 1
George3d6
15 May 2022 19:16 UTC
28
points
16
comments
8
min read
LW
link
(www.epistem.ink)
[Question]
Should we buy Google stock?
Random Trader
15 May 2022 18:38 UTC
0
points
26
comments
1
min read
LW
link
The AI Countdown Clock
River Lewis
15 May 2022 18:37 UTC
42
points
30
comments
2
min read
LW
link
(heytraveler.substack.com)
Is AI Progress Impossible To Predict?
alyssavance
15 May 2022 18:30 UTC
277
points
39
comments
2
min read
LW
link
My Morality
UtilityMonster
15 May 2022 16:34 UTC
3
points
7
comments
3
min read
LW
link
[Question]
Is it possible to implement switching between sequences from its pages?
EniScien
15 May 2022 9:19 UTC
5
points
2
comments
1
min read
LW
link
Gato as the Dawn of Early AGI
David Udell
15 May 2022 6:52 UTC
85
points
29
comments
12
min read
LW
link
[Question]
What’s up with the font size in the Markdown text editor?
Ege Erdil
14 May 2022 21:12 UTC
7
points
1
comment
1
min read
LW
link
[Link post] Promising Paths to Alignment—Connor Leahy | Talk
frances_lorenz
14 May 2022 16:01 UTC
34
points
0
comments
1
min read
LW
link
Inequality is inseparable from markets
NathanBarnard
14 May 2022 13:39 UTC
22
points
7
comments
3
min read
LW
link
Predicting the Elections with Deep Learning—Part 1 - Results
Quentin Chenevier
14 May 2022 12:54 UTC
0
points
0
comments
1
min read
LW
link
Clarifying the confusion around inner alignment
Rauno Arike
13 May 2022 23:05 UTC
29
points
0
comments
11
min read
LW
link
Costs and benefits of amniocentesis for normal pregnancies
braces
13 May 2022 22:47 UTC
13
points
4
comments
3
min read
LW
link
Frame for Take-Off Speeds to inform compute governance & scaling alignment
Logan Riggs
13 May 2022 22:23 UTC
15
points
2
comments
2
min read
LW
link
Alignment as Constraints
Logan Riggs
13 May 2022 22:07 UTC
10
points
0
comments
2
min read
LW
link
How close to nuclear war did we get over Cuba?
NathanBarnard
13 May 2022 19:58 UTC
13
points
0
comments
10
min read
LW
link
Against Time in Agent Models
johnswentworth
13 May 2022 19:55 UTC
62
points
13
comments
3
min read
LW
link
Agency As a Natural Abstraction
Thane Ruthenis
13 May 2022 18:02 UTC
55
points
9
comments
13
min read
LW
link
Fermi estimation of the impact you might have working on AI safety
Fabien Roger
13 May 2022 17:49 UTC
6
points
0
comments
1
min read
LW
link
“Tech company singularities”, and steering them to reduce x-risk
Andrew_Critch
13 May 2022 17:24 UTC
75
points
11
comments
4
min read
LW
link
An observation about Hubinger et al.’s framework for learned optimization
Spencer Becker-Kahn
13 May 2022 16:20 UTC
34
points
9
comments
8
min read
LW
link
[Question]
The Economics of a New Energy Source
hatta_afiq
13 May 2022 14:08 UTC
2
points
13
comments
1
min read
LW
link
[Question]
Still possible to change username?
gabrielrecc
13 May 2022 13:41 UTC
7
points
4
comments
1
min read
LW
link
[Rough notes, BAIS] Human values and cyclical preferences
pranomostro
,
Jayjay
and
Lucie Philippon
13 May 2022 13:28 UTC
5
points
0
comments
4
min read
LW
link
[Question]
Can moderators fix old sequences posts?
EniScien
13 May 2022 12:30 UTC
10
points
1
comment
1
min read
LW
link
DeepMind is hiring for the Scalable Alignment and Alignment Teams
Rohin Shah
and
Geoffrey Irving
13 May 2022 12:17 UTC
150
points
34
comments
9
min read
LW
link
Thoughts on AI Safety Camp
Charlie Steiner
13 May 2022 7:16 UTC
32
points
8
comments
7
min read
LW
link
Deferring
owencb
12 May 2022 23:56 UTC
18
points
2
comments
11
min read
LW
link
RLHF
Ansh Radhakrishnan
12 May 2022 21:18 UTC
18
points
5
comments
5
min read
LW
link
[Question]
What to do when starting a business in an imminent-AGI world?
ryan_b
12 May 2022 21:07 UTC
25
points
7
comments
1
min read
LW
link
Interpretability’s Alignment-Solving Potential: Analysis of 7 Scenarios
Evan R. Murphy
12 May 2022 20:01 UTC
53
points
0
comments
59
min read
LW
link
Introduction to the sequence: Interpretability Research for the Most Important Century
Evan R. Murphy
12 May 2022 19:59 UTC
16
points
0
comments
8
min read
LW
link
A tentative dialogue with a Friendly-boxed-super-AGI on brain uploads
Ramiro P.
12 May 2022 19:40 UTC
1
point
12
comments
4
min read
LW
link
The Last Paperclip
Logan Zoellner
12 May 2022 19:25 UTC
61
points
15
comments
17
min read
LW
link
Deepmind’s Gato: Generalist Agent
Daniel Kokotajlo
12 May 2022 16:01 UTC
165
points
62
comments
1
min read
LW
link
“A Generalist Agent”: New DeepMind Publication
1a3orn
12 May 2022 15:30 UTC
79
points
43
comments
1
min read
LW
link
Covid 5/12/22: Other Priorities
Zvi
12 May 2022 13:30 UTC
31
points
4
comments
15
min read
LW
link
(thezvi.wordpress.com)
[Question]
How would public media outlets need to be governed to cover all political views?
ChristianKl
12 May 2022 12:55 UTC
13
points
14
comments
1
min read
LW
link
[Question]
What’s keeping concerned capabilities gain researchers from leaving the field?
sovran
12 May 2022 12:16 UTC
19
points
4
comments
1
min read
LW
link
Positive outcomes under an unaligned AGI takeover
Yitz
12 May 2022 7:45 UTC
19
points
10
comments
3
min read
LW
link
[Question]
What are your recommendations for technical AI alignment podcasts?
Evan_Gaensbauer
11 May 2022 21:52 UTC
5
points
4
comments
1
min read
LW
link
Gracefully correcting uncalibrated shame
AF2022
11 May 2022 19:51 UTC
−31
points
34
comments
5
min read
LW
link
[Intro to brain-like-AGI safety] 14. Controlled AGI
Steven Byrnes
11 May 2022 13:17 UTC
41
points
25
comments
19
min read
LW
link
ProjectLawful.com: Eliezer’s latest story, past 1M words
Eliezer Yudkowsky
11 May 2022 6:18 UTC
213
points
112
comments
1
min read
LW
link
4
reviews
An Inside View of AI Alignment
Ansh Radhakrishnan
11 May 2022 2:16 UTC
32
points
2
comments
2
min read
LW
link
Fighting in various places for a really long time
KatjaGrace
11 May 2022 1:50 UTC
36
points
12
comments
4
min read
LW
link
(worldspiritsockpuppet.com)
Stuff I might do if I had covid
KatjaGrace
11 May 2022 0:00 UTC
39
points
9
comments
1
min read
LW
link
(worldspiritsockpuppet.com)
Crises Don’t Need Your Software
GabrielExists
10 May 2022 21:06 UTC
59
points
18
comments
6
min read
LW
link
Back to top
Next