Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
Imperial Russia was doing fine without the Soviets
Davis Kedrosky
5 Jul 2022 22:24 UTC
6
points
3
comments
14
min read
LW
link
(daviskedrosky.substack.com)
A Pattern Language For Rationality
Vaniver
5 Jul 2022 19:08 UTC
75
points
14
comments
15
min read
LW
link
How to destroy the universe with a hypercomputer
Trevor Cappallo
5 Jul 2022 19:05 UTC
2
points
3
comments
1
min read
LW
link
The curious case of Pretty Good human inner/outer alignment
PavleMiha
5 Jul 2022 19:04 UTC
41
points
45
comments
4
min read
LW
link
When is it appropriate to use statistical models and probabilities for decision making ?
Younes Kamel
5 Jul 2022 12:34 UTC
10
points
7
comments
4
min read
LW
link
(youneskamel.substack.com)
Goal Factoring
CFAR!Duncan
5 Jul 2022 7:10 UTC
80
points
2
comments
8
min read
LW
link
Assorted thoughts about abstraction
Adam Zerner
5 Jul 2022 6:40 UTC
16
points
9
comments
7
min read
LW
link
[AN #172] Sorry for the long hiatus!
Rohin Shah
5 Jul 2022 6:20 UTC
54
points
0
comments
3
min read
LW
link
(mailchi.mp)
Outline: The Rectifying of Maps
hamnox
5 Jul 2022 5:14 UTC
7
points
0
comments
2
min read
LW
link
[Question]
Seeking opinions on the current and forward state of cryptocurrencies.
jmh
5 Jul 2022 5:01 UTC
7
points
6
comments
1
min read
LW
link
ITT-passing and civility are good; “charity” is bad; steelmanning is niche
Rob Bensinger
5 Jul 2022 0:15 UTC
161
points
36
comments
6
min read
LW
link
1
review
Please help us communicate AI xrisk. It could save the world.
otto.barten
4 Jul 2022 21:47 UTC
4
points
7
comments
2
min read
LW
link
Benchmark for successful concept extrapolation/avoiding goal misgeneralization
Stuart_Armstrong
4 Jul 2022 20:48 UTC
82
points
12
comments
4
min read
LW
link
Procedural Executive Function, Part 1
DaystarEld
4 Jul 2022 18:51 UTC
33
points
2
comments
13
min read
LW
link
(daystareld.com)
Anthropic’s SoLU (Softmax Linear Unit)
Joel Burget
4 Jul 2022 18:38 UTC
21
points
1
comment
4
min read
LW
link
(transformer-circuits.pub)
Book Review: The Righteous Mind
ErnestScribbler
4 Jul 2022 17:45 UTC
33
points
8
comments
35
min read
LW
link
My Most Likely Reason to Die Young is AI X-Risk
AISafetyIsNotLongtermist
4 Jul 2022 17:08 UTC
61
points
24
comments
4
min read
LW
link
(forum.effectivealtruism.org)
Is General Intelligence “Compact”?
DragonGod
4 Jul 2022 13:27 UTC
27
points
6
comments
22
min read
LW
link
Remaking EfficientZero (as best I can)
Hoagy
4 Jul 2022 11:03 UTC
36
points
9
comments
22
min read
LW
link
We Need a Consolidated List of Bad AI Alignment Solutions
Double
4 Jul 2022 6:54 UTC
9
points
14
comments
1
min read
LW
link
AI Forecasting: One Year In
jsteinhardt
4 Jul 2022 5:10 UTC
132
points
12
comments
6
min read
LW
link
(bounded-regret.ghost.io)
A compressed take on recent disagreements
kman
4 Jul 2022 4:39 UTC
33
points
9
comments
1
min read
LW
link
New US Senate Bill on X-Risk Mitigation [Linkpost]
Evan R. Murphy
4 Jul 2022 1:25 UTC
35
points
12
comments
1
min read
LW
link
(www.hsgac.senate.gov)
Monthly Shorts 6/22
Celer
3 Jul 2022 23:40 UTC
5
points
2
comments
5
min read
LW
link
(keller.substack.com)
Decision theory and dynamic inconsistency
paulfchristiano
3 Jul 2022 22:20 UTC
79
points
33
comments
10
min read
LW
link
(sideways-view.com)
Five routes of access to scientific literature
DirectedEvolution
3 Jul 2022 20:53 UTC
13
points
4
comments
6
min read
LW
link
Toni Kurz and the Insanity of Climbing Mountains
GeneSmith
3 Jul 2022 20:51 UTC
268
points
67
comments
11
min read
LW
link
2
reviews
Wonder and The Golden AI Rule
JeffreyK
3 Jul 2022 18:21 UTC
0
points
4
comments
6
min read
LW
link
Evolution Doesn’t Have Feelings
UtilityMonster
3 Jul 2022 17:13 UTC
−1
points
0
comments
1
min read
LW
link
Nature abhors an immutable replicator… usually
MSRayne
3 Jul 2022 15:08 UTC
28
points
10
comments
3
min read
LW
link
Post hoc justifications as Compression Algorithm
Johannes C. Mayer
3 Jul 2022 5:02 UTC
8
points
0
comments
1
min read
LW
link
SOMA—A story about Consciousness
Johannes C. Mayer
3 Jul 2022 4:46 UTC
10
points
0
comments
1
min read
LW
link
(www.youtube.com)
Sexual self-acceptance
Johannes C. Mayer
3 Jul 2022 4:26 UTC
11
points
6
comments
1
min read
LW
link
Donohue, Levitt, Roe, and Wade: T-minus 20 years to a massive crime wave?
Paul Logan
3 Jul 2022 3:03 UTC
−24
points
6
comments
3
min read
LW
link
(laulpogan.substack.com)
Can we achieve AGI Alignment by balancing multiple human objectives?
Ben Smith
3 Jul 2022 2:51 UTC
11
points
1
comment
4
min read
LW
link
Trigger-Action Planning
CFAR!Duncan
3 Jul 2022 1:42 UTC
81
points
14
comments
13
min read
LW
link
2
reviews
[Question]
Which one of these two academic routes should I take to end up in AI Safety?
Martín Soto
3 Jul 2022 1:05 UTC
5
points
2
comments
1
min read
LW
link
Naive Hypotheses on AI Alignment
Shoshannah Tekofsky
2 Jul 2022 19:03 UTC
98
points
29
comments
5
min read
LW
link
The Tree of Life: Stanford AI Alignment Theory of Change
Gabriel Mukobi
2 Jul 2022 18:36 UTC
24
points
0
comments
14
min read
LW
link
Follow along with Columbia EA’s Advanced AI Safety Fellowship!
RohanS
2 Jul 2022 17:45 UTC
3
points
0
comments
2
min read
LW
link
(forum.effectivealtruism.org)
Welcome to Analogia! (Chapter 7)
Justin Bullock
2 Jul 2022 17:04 UTC
5
points
0
comments
11
min read
LW
link
[Question]
What about transhumans and beyond?
AlignmentMirror
2 Jul 2022 13:58 UTC
7
points
6
comments
1
min read
LW
link
Goal-directedness: tackling complexity
Morgan_Rogers
2 Jul 2022 13:51 UTC
8
points
0
comments
38
min read
LW
link
Literature recommendations July 2022
ChristianKl
2 Jul 2022 9:14 UTC
17
points
9
comments
1
min read
LW
link
Deontological Evil
lsusr
2 Jul 2022 6:57 UTC
38
points
4
comments
2
min read
LW
link
Could an AI Alignment Sandbox be useful?
Michael Soareverix
2 Jul 2022 5:06 UTC
2
points
1
comment
1
min read
LW
link
Five views of Bayes’ Theorem
Adam Scherlis
2 Jul 2022 2:25 UTC
38
points
4
comments
1
min read
LW
link
[Linkpost] Existential Risk Analysis in Empirical Research Papers
Dan H
2 Jul 2022 0:09 UTC
40
points
0
comments
1
min read
LW
link
(arxiv.org)
Agenty AGI – How Tempting?
PeterMcCluskey
1 Jul 2022 23:40 UTC
22
points
3
comments
5
min read
LW
link
(www.bayesianinvestor.com)
AXRP Episode 16 - Preparing for Debate AI with Geoffrey Irving
DanielFilan
1 Jul 2022 22:20 UTC
20
points
0
comments
37
min read
LW
link
Back to top
Next