Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Hide coronavirus posts
RSS
New
Hot
Active
Old
Page
1
All the posts I will never write
Self-Embedded Agent
14 Aug 2022 18:29 UTC
41
points
7
comments
8
min read
LW
link
AI Transparency: Why it’s critical and how to obtain it.
Zohar Jackson
14 Aug 2022 10:31 UTC
6
points
1
comment
5
min read
LW
link
A brief note on Simplicity Bias
Spencer Becker-Kahn
14 Aug 2022 2:05 UTC
6
points
0
comments
4
min read
LW
link
Against Relying on Evolution to Forecast AI Outcomes (Part 1)
Quintin Pope
13 Aug 2022 22:15 UTC
35
points
4
comments
8
min read
LW
link
Cultivating Valiance
Shos Tekofsky
13 Aug 2022 18:47 UTC
29
points
4
comments
4
min read
LW
link
An extended rocket alignment analogy
remember
13 Aug 2022 18:22 UTC
25
points
3
comments
4
min read
LW
link
[Question]
What is an agent in reductionist materialism?
Valentine
13 Aug 2022 15:39 UTC
18
points
15
comments
1
min read
LW
link
Refine’s First Blog Post Day
adamShimi
13 Aug 2022 10:23 UTC
46
points
3
comments
1
min read
LW
link
The Dumbest Possible Gets There First
Artaxerxes
13 Aug 2022 10:20 UTC
33
points
4
comments
2
min read
LW
link
I missed the crux of the alignment problem the whole time
zeshen
13 Aug 2022 10:11 UTC
47
points
5
comments
3
min read
LW
link
goal-program bricks
carado
13 Aug 2022 10:08 UTC
25
points
2
comments
2
min read
LW
link
(carado.moe)
Shapes of Mind and Pluralism in Alignment
adamShimi
13 Aug 2022 10:01 UTC
26
points
1
comment
2
min read
LW
link
How I think about alignment
Linda Linsefors
13 Aug 2022 10:01 UTC
22
points
8
comments
5
min read
LW
link
Steelmining via Analogy
Paul Bricman
13 Aug 2022 9:59 UTC
24
points
0
comments
2
min read
LW
link
(paulbricman.com)
the Insulated Goal-Program idea
carado
13 Aug 2022 9:57 UTC
24
points
2
comments
2
min read
LW
link
(carado.moe)
Appendix: Jargon Dictionary
CFAR!Duncan
13 Aug 2022 8:09 UTC
18
points
4
comments
21
min read
LW
link
Appendix: Hamming Questions
CFAR!Duncan
13 Aug 2022 8:07 UTC
25
points
0
comments
2
min read
LW
link
Appendix: Building a Bugs List prompts
CFAR!Duncan
13 Aug 2022 8:00 UTC
30
points
0
comments
2
min read
LW
link
Gradient descent doesn’t select for inner search
Ivan Vendrov
13 Aug 2022 4:15 UTC
24
points
9
comments
4
min read
LW
link
[Question]
How to bet against civilizational adequacy?
Wei_Dai
12 Aug 2022 23:33 UTC
48
points
13
comments
1
min read
LW
link
Back to top
Next