Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
Some Adventures of a Curious Richard Feynman
Dalton Mabery
6 Jul 2022 23:11 UTC
10
points
0
comments
3
min read
LW
link
Cognitive Dissonance on Cognitive Capability
niederman
6 Jul 2022 22:53 UTC
6
points
0
comments
1
min read
LW
link
(maxniederman.com)
Outer vs inner misalignment: three framings
Richard_Ngo
6 Jul 2022 19:46 UTC
49
points
5
comments
9
min read
LW
link
Tarnished Guy who Puts a Num on it
Jacob Falkovich
6 Jul 2022 18:05 UTC
44
points
11
comments
4
min read
LW
link
Deep neural networks are not opaque.
jem-mosig
6 Jul 2022 18:03 UTC
22
points
14
comments
3
min read
LW
link
How humanity would respond to slow takeoff, with takeaways from the entire COVID-19 pandemic
Noosphere89
6 Jul 2022 17:52 UTC
4
points
1
comment
2
min read
LW
link
[Question]
Should you write under a blog or your own name?
Dalton Mabery
6 Jul 2022 15:26 UTC
2
points
2
comments
1
min read
LW
link
Carrying the Torch: A Response to Anna Salamon by the Guild of the Rose
moridinamael
6 Jul 2022 14:20 UTC
133
points
16
comments
6
min read
LW
link
Predicting Parental Emotional Changes?
jefftk
6 Jul 2022 13:50 UTC
39
points
11
comments
2
min read
LW
link
(www.jefftk.com)
Berlin AI Safety Open Meetup July 2022
pranomostro
6 Jul 2022 12:41 UTC
6
points
0
comments
1
min read
LW
link
Forecasting Through Fiction
Yitz
6 Jul 2022 5:03 UTC
5
points
2
comments
8
min read
LW
link
Introducing the Fund for Alignment Research (We’re Hiring!)
AdamGleave
,
Scott Emmons
,
Ethan Perez
and
Claudia Shi
6 Jul 2022 2:07 UTC
62
points
0
comments
4
min read
LW
link
My vision of a good future, part I
Jeffrey Ladish
6 Jul 2022 1:23 UTC
66
points
18
comments
9
min read
LW
link
Imperial Russia was doing fine without the Soviets
Davis Kedrosky
5 Jul 2022 22:24 UTC
6
points
3
comments
14
min read
LW
link
(daviskedrosky.substack.com)
A Pattern Language For Rationality
Vaniver
5 Jul 2022 19:08 UTC
75
points
14
comments
15
min read
LW
link
How to destroy the universe with a hypercomputer
Trevor Cappallo
5 Jul 2022 19:05 UTC
2
points
3
comments
1
min read
LW
link
The curious case of Pretty Good human inner/outer alignment
PavleMiha
5 Jul 2022 19:04 UTC
41
points
45
comments
4
min read
LW
link
When is it appropriate to use statistical models and probabilities for decision making ?
Younes Kamel
5 Jul 2022 12:34 UTC
10
points
7
comments
4
min read
LW
link
(youneskamel.substack.com)
Goal Factoring
CFAR!Duncan
5 Jul 2022 7:10 UTC
80
points
2
comments
8
min read
LW
link
Assorted thoughts about abstraction
Adam Zerner
5 Jul 2022 6:40 UTC
16
points
9
comments
7
min read
LW
link
[AN #172] Sorry for the long hiatus!
Rohin Shah
5 Jul 2022 6:20 UTC
54
points
0
comments
3
min read
LW
link
(mailchi.mp)
Outline: The Rectifying of Maps
hamnox
5 Jul 2022 5:14 UTC
7
points
0
comments
2
min read
LW
link
[Question]
Seeking opinions on the current and forward state of cryptocurrencies.
jmh
5 Jul 2022 5:01 UTC
7
points
6
comments
1
min read
LW
link
ITT-passing and civility are good; “charity” is bad; steelmanning is niche
Rob Bensinger
5 Jul 2022 0:15 UTC
161
points
36
comments
6
min read
LW
link
1
review
Please help us communicate AI xrisk. It could save the world.
otto.barten
4 Jul 2022 21:47 UTC
4
points
7
comments
2
min read
LW
link
Benchmark for successful concept extrapolation/avoiding goal misgeneralization
Stuart_Armstrong
4 Jul 2022 20:48 UTC
82
points
12
comments
4
min read
LW
link
Procedural Executive Function, Part 1
DaystarEld
4 Jul 2022 18:51 UTC
33
points
2
comments
13
min read
LW
link
(daystareld.com)
Anthropic’s SoLU (Softmax Linear Unit)
Joel Burget
4 Jul 2022 18:38 UTC
21
points
1
comment
4
min read
LW
link
(transformer-circuits.pub)
Book Review: The Righteous Mind
ErnestScribbler
4 Jul 2022 17:45 UTC
33
points
8
comments
35
min read
LW
link
My Most Likely Reason to Die Young is AI X-Risk
AISafetyIsNotLongtermist
4 Jul 2022 17:08 UTC
61
points
24
comments
4
min read
LW
link
(forum.effectivealtruism.org)
Is General Intelligence “Compact”?
DragonGod
4 Jul 2022 13:27 UTC
27
points
6
comments
22
min read
LW
link
Remaking EfficientZero (as best I can)
Hoagy
4 Jul 2022 11:03 UTC
36
points
9
comments
22
min read
LW
link
We Need a Consolidated List of Bad AI Alignment Solutions
Double
4 Jul 2022 6:54 UTC
9
points
14
comments
1
min read
LW
link
AI Forecasting: One Year In
jsteinhardt
4 Jul 2022 5:10 UTC
132
points
12
comments
6
min read
LW
link
(bounded-regret.ghost.io)
A compressed take on recent disagreements
kman
4 Jul 2022 4:39 UTC
33
points
9
comments
1
min read
LW
link
New US Senate Bill on X-Risk Mitigation [Linkpost]
Evan R. Murphy
4 Jul 2022 1:25 UTC
35
points
12
comments
1
min read
LW
link
(www.hsgac.senate.gov)
Monthly Shorts 6/22
Celer
3 Jul 2022 23:40 UTC
5
points
2
comments
5
min read
LW
link
(keller.substack.com)
Decision theory and dynamic inconsistency
paulfchristiano
3 Jul 2022 22:20 UTC
79
points
33
comments
10
min read
LW
link
(sideways-view.com)
Five routes of access to scientific literature
DirectedEvolution
3 Jul 2022 20:53 UTC
13
points
4
comments
6
min read
LW
link
Toni Kurz and the Insanity of Climbing Mountains
GeneSmith
3 Jul 2022 20:51 UTC
268
points
67
comments
11
min read
LW
link
2
reviews
Wonder and The Golden AI Rule
JeffreyK
3 Jul 2022 18:21 UTC
0
points
4
comments
6
min read
LW
link
Evolution Doesn’t Have Feelings
UtilityMonster
3 Jul 2022 17:13 UTC
−1
points
0
comments
1
min read
LW
link
Nature abhors an immutable replicator… usually
MSRayne
3 Jul 2022 15:08 UTC
28
points
10
comments
3
min read
LW
link
Post hoc justifications as Compression Algorithm
Johannes C. Mayer
3 Jul 2022 5:02 UTC
8
points
0
comments
1
min read
LW
link
SOMA—A story about Consciousness
Johannes C. Mayer
3 Jul 2022 4:46 UTC
10
points
0
comments
1
min read
LW
link
(www.youtube.com)
Sexual self-acceptance
Johannes C. Mayer
3 Jul 2022 4:26 UTC
11
points
6
comments
1
min read
LW
link
Donohue, Levitt, Roe, and Wade: T-minus 20 years to a massive crime wave?
Paul Logan
3 Jul 2022 3:03 UTC
−24
points
6
comments
3
min read
LW
link
(laulpogan.substack.com)
Can we achieve AGI Alignment by balancing multiple human objectives?
Ben Smith
3 Jul 2022 2:51 UTC
11
points
1
comment
4
min read
LW
link
Trigger-Action Planning
CFAR!Duncan
3 Jul 2022 1:42 UTC
81
points
14
comments
13
min read
LW
link
2
reviews
[Question]
Which one of these two academic routes should I take to end up in AI Safety?
Martín Soto
3 Jul 2022 1:05 UTC
5
points
2
comments
1
min read
LW
link
Back to top
Next