Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
2
Tabooing ‘Agent’ for Prosaic Alignment
Hjalmar_Wijk
23 Aug 2019 2:55 UTC
57
points
10
comments
6
min read
LW
link
Vaniver’s View on Factored Cognition
Vaniver
23 Aug 2019 2:54 UTC
48
points
4
comments
8
min read
LW
link
Redefining Fast Takeoff
VojtaKovarik
23 Aug 2019 2:15 UTC
10
points
1
comment
1
min read
LW
link
[Question]
Does Agent-like Behavior Imply Agent-like Architecture?
Scott Garrabrant
23 Aug 2019 2:01 UTC
69
points
8
comments
1
min read
LW
link
The Commitment Races problem
Daniel Kokotajlo
23 Aug 2019 1:58 UTC
159
points
56
comments
5
min read
LW
link
Analysis of a Secret Hitler Scenario
jaek
23 Aug 2019 1:24 UTC
16
points
6
comments
4
min read
LW
link
Thoughts from a Two Boxer
jaek
23 Aug 2019 0:24 UTC
18
points
11
comments
5
min read
LW
link
Deconfuse Yourself about Agency
VojtaKovarik
23 Aug 2019 0:21 UTC
15
points
9
comments
4
min read
LW
link
Logical Optimizers
Donald Hobson
22 Aug 2019 23:54 UTC
11
points
4
comments
3
min read
LW
link
Towards a mechanistic understanding of corrigibility
evhub
22 Aug 2019 23:20 UTC
47
points
26
comments
4
min read
LW
link
Response to Glen Weyl on Technocracy and the Rationalist Community
John_Maxwell
22 Aug 2019 23:14 UTC
66
points
9
comments
10
min read
LW
link
[Question]
Why so much variance in human intelligence?
Ben Pace
22 Aug 2019 22:36 UTC
65
points
28
comments
4
min read
LW
link
Logical Counterfactuals and Proposition graphs, Part 1
Donald Hobson
22 Aug 2019 22:06 UTC
20
points
0
comments
3
min read
LW
link
Time Travel, AI and Transparent Newcomb
johnswentworth
22 Aug 2019 22:04 UTC
11
points
7
comments
1
min read
LW
link
Embedded Naive Bayes
johnswentworth
22 Aug 2019 21:40 UTC
17
points
6
comments
3
min read
LW
link
Intentional Bucket Errors
Scott Garrabrant
22 Aug 2019 20:02 UTC
55
points
6
comments
3
min read
LW
link
Computational Model: Causal Diagrams with Symmetry
johnswentworth
22 Aug 2019 17:54 UTC
53
points
29
comments
4
min read
LW
link
[AN #62] Are adversarial examples caused by real but imperceptible features?
Rohin Shah
22 Aug 2019 17:10 UTC
28
points
10
comments
9
min read
LW
link
(mailchi.mp)
Implications of Quantum Computing for Artificial Intelligence Alignment Research
Jsevillamol
and
PabloAMC
22 Aug 2019 10:33 UTC
24
points
3
comments
13
min read
LW
link
Body Alignment & Balance. Our Midline Anatomy & the Median Plane.
leggi
22 Aug 2019 10:24 UTC
15
points
6
comments
4
min read
LW
link
[Question]
Simulation Argument: Why aren’t ancestor simulations outnumbered by transhumans?
maximkazhenkov
22 Aug 2019 9:07 UTC
9
points
11
comments
1
min read
LW
link
Markets are Universal for Logical Induction
johnswentworth
22 Aug 2019 6:44 UTC
77
points
2
comments
5
min read
LW
link
Announcement: Writing Day Today (Thursday)
Ben Pace
22 Aug 2019 4:48 UTC
29
points
5
comments
1
min read
LW
link
Western Massachusetts SSC meetup #15
a_lieb
22 Aug 2019 0:53 UTC
1
point
0
comments
1
min read
LW
link
Call for contributors to the Alignment Newsletter
Rohin Shah
21 Aug 2019 18:21 UTC
39
points
0
comments
4
min read
LW
link
Two senses of “optimizer”
Joar Skalse
21 Aug 2019 16:02 UTC
35
points
41
comments
3
min read
LW
link
Paradoxical Advice Thread
Hazard
21 Aug 2019 14:50 UTC
13
points
10
comments
1
min read
LW
link
Three Levels of Motivation
DragonGod
21 Aug 2019 9:24 UTC
15
points
1
comment
7
min read
LW
link
Odds are not easier
MrMind
21 Aug 2019 8:34 UTC
9
points
6
comments
1
min read
LW
link
GPT-2: 6-Month Follow-Up
lifelonglearner
21 Aug 2019 5:06 UTC
28
points
1
comment
1
min read
LW
link
Lana Wachowski is doing a new Matrix movie
mako yass
21 Aug 2019 0:47 UTC
5
points
3
comments
1
min read
LW
link
[Question]
What authors consistently give accurate pictures of complex topics they discuss?
seez
21 Aug 2019 0:09 UTC
34
points
3
comments
1
min read
LW
link
[Question]
What are the reasons to *not* consider reducing AI-Xrisk the highest priority cause?
David Scott Krueger (formerly: capybaralet)
20 Aug 2019 21:45 UTC
29
points
27
comments
1
min read
LW
link
[Question]
Has Moore’s Law actually slowed down?
Matthew Barnett
20 Aug 2019 19:18 UTC
14
points
7
comments
1
min read
LW
link
Cerebras Systems unveils a record 1.2 trillion transistor chip for AI
avturchin
20 Aug 2019 14:36 UTC
7
points
4
comments
1
min read
LW
link
(venturebeat.com)
Lisbon SSC Meetup #1
tamkin&popkin
20 Aug 2019 12:20 UTC
1
point
0
comments
1
min read
LW
link
Self-supervised learning & manipulative predictions
Steven Byrnes
20 Aug 2019 10:55 UTC
18
points
14
comments
9
min read
LW
link
Negative “eeny meeny miny moe”
jefftk
20 Aug 2019 2:48 UTC
25
points
6
comments
1
min read
LW
link
Why I Am Not a Technocrat
Spugpow
20 Aug 2019 2:06 UTC
−3
points
4
comments
LW
link
(radicalxchange.org)
A misconception about immigration
limerott
19 Aug 2019 22:37 UTC
1
point
9
comments
4
min read
LW
link
(limerott.com)
[Question]
Do We Change Our Minds Less Often Than We Think?
Raemon
19 Aug 2019 21:37 UTC
20
points
5
comments
1
min read
LW
link
Classifying specification problems as variants of Goodhart’s Law
Vika
19 Aug 2019 20:40 UTC
72
points
5
comments
5
min read
LW
link
1
review
Unstriving
Jacob Falkovich
19 Aug 2019 14:31 UTC
38
points
7
comments
6
min read
LW
link
Goodhart’s Curse and Limitations on AI Alignment
Gordon Seidoh Worley
19 Aug 2019 7:57 UTC
25
points
18
comments
10
min read
LW
link
Raph Koster on Virtual Worlds vs Games (notes)
Raemon
18 Aug 2019 19:01 UTC
26
points
8
comments
2
min read
LW
link
“Can We Survive Technology” by von Neumann
Ben Pace
18 Aug 2019 18:58 UTC
33
points
2
comments
1
min read
LW
link
(geosci.uchicago.edu)
Prokaryote Multiverse. An argument that potential simulators do not have significantly more complex physics than ours
mako yass
18 Aug 2019 4:22 UTC
0
points
5
comments
2
min read
LW
link
Neural Nets in Python 1
lifelonglearner
18 Aug 2019 2:48 UTC
10
points
3
comments
8
min read
LW
link
Inspection Paradox as a Driver of Group Separation
Shmi
17 Aug 2019 21:47 UTC
29
points
0
comments
1
min read
LW
link
South Bay Meetup
David Friedman
17 Aug 2019 19:56 UTC
1
point
0
comments
LW
link
Previous
Back to top
Next