Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
[Question]
What is wrong with this approach to corrigibility?
Rafael Cosman
12 Jul 2022 22:55 UTC
7
points
8
comments
1
min read
LW
link
Acceptability Verification: A Research Agenda
David Udell
and
evhub
12 Jul 2022 20:11 UTC
50
points
0
comments
1
min read
LW
link
(docs.google.com)
Progress links and tweets, 2022-07-12
jasoncrawford
12 Jul 2022 15:30 UTC
12
points
0
comments
1
min read
LW
link
(rootsofprogress.org)
Response to Blake Richards: AGI, generality, alignment, & loss functions
Steven Byrnes
12 Jul 2022 13:56 UTC
62
points
9
comments
15
min read
LW
link
Three Minimum Pivotal Acts Possible by Narrow AI
Michael Soareverix
12 Jul 2022 9:51 UTC
0
points
4
comments
2
min read
LW
link
Mosaic and Palimpsests: Two Shapes of Research
adamShimi
12 Jul 2022 9:05 UTC
39
points
3
comments
9
min read
LW
link
[Question]
How do you concisely communicate & navigate the politics / culture at your job working at a large corporation or institution?
Willa
12 Jul 2022 3:22 UTC
10
points
6
comments
1
min read
LW
link
On how various plans miss the hard bits of the alignment challenge
So8res
12 Jul 2022 2:49 UTC
302
points
88
comments
29
min read
LW
link
3
reviews
Rainmaking
WalterL
12 Jul 2022 0:42 UTC
25
points
5
comments
1
min read
LW
link
(www.youtube.com)
Book Review: Neal Stephenson’s “Termination Shock”
Tyler Simmons
12 Jul 2022 0:07 UTC
13
points
0
comments
30
min read
LW
link
(www.words-and-dirt.com)
Announcing Future Forum—Apply Now
wANIEL
and
freemany
11 Jul 2022 22:57 UTC
8
points
0
comments
4
min read
LW
link
(forum.effectivealtruism.org)
Defining Optimization in a Deeper Way Part 2
J Bostock
11 Jul 2022 20:29 UTC
7
points
0
comments
4
min read
LW
link
Marriage, the Giving What We Can Pledge, and the damage caused by vague public commitments
Jeffrey Ladish
11 Jul 2022 19:38 UTC
98
points
27
comments
6
min read
LW
link
1
review
Systemization
CFAR!Duncan
11 Jul 2022 18:39 UTC
40
points
5
comments
12
min read
LW
link
How Can I Maximize My Happiness?
UtilityMonster
11 Jul 2022 17:40 UTC
6
points
2
comments
6
min read
LW
link
[Question]
How do AI timelines affect how you live your life?
Quadratic Reciprocity
11 Jul 2022 13:54 UTC
80
points
50
comments
1
min read
LW
link
Cambridge LW Meetup: Free Speech
Darmani
11 Jul 2022 4:36 UTC
7
points
0
comments
1
min read
LW
link
Checksum Sensor Alignment
lsusr
11 Jul 2022 3:31 UTC
12
points
2
comments
1
min read
LW
link
The Alignment Problem
lsusr
11 Jul 2022 3:03 UTC
46
points
18
comments
3
min read
LW
link
Immanuel Kant and the Decision Theory App Store
Daniel Kokotajlo
10 Jul 2022 16:04 UTC
88
points
12
comments
5
min read
LW
link
Metaculus is seeking experienced leaders, researchers & operators for high-impact roles
ChristianWilliams
10 Jul 2022 14:27 UTC
9
points
0
comments
1
min read
LW
link
(apply.workable.com)
Avoid the abbreviation “FLOPs” – use “FLOP” or “FLOP/s” instead
Daniel_Eth
10 Jul 2022 10:44 UTC
69
points
13
comments
1
min read
LW
link
My Opportunity Costs
abstractapplic
10 Jul 2022 10:14 UTC
21
points
3
comments
3
min read
LW
link
Why Portland
Adam Zerner
10 Jul 2022 7:20 UTC
25
points
18
comments
9
min read
LW
link
Hessian and Basin volume
Vivek Hebbar
10 Jul 2022 6:59 UTC
35
points
10
comments
4
min read
LW
link
Taste & Shaping
CFAR!Duncan
10 Jul 2022 5:50 UTC
64
points
1
comment
16
min read
LW
link
Comment on “Propositions Concerning Digital Minds and Society”
Zack_M_Davis
10 Jul 2022 5:48 UTC
99
points
12
comments
8
min read
LW
link
Heaven: The last part of dystopia
Existism
9 Jul 2022 22:36 UTC
−1
points
1
comment
6
min read
LW
link
Hope Can = Heaven
Existism
9 Jul 2022 22:35 UTC
−2
points
0
comments
3
min read
LW
link
Report from a civilizational observer on Earth
owencb
9 Jul 2022 17:26 UTC
49
points
12
comments
6
min read
LW
link
Grouped Loss may disfavor discontinuous capabilities
Adam Jermyn
9 Jul 2022 17:22 UTC
14
points
2
comments
4
min read
LW
link
Train first VS prune first in neural networks.
Donald Hobson
9 Jul 2022 15:53 UTC
20
points
5
comments
2
min read
LW
link
Visualizing Neural networks, how to blame the bias
Donald Hobson
9 Jul 2022 15:52 UTC
7
points
1
comment
6
min read
LW
link
Using Ngram to estimate depression prevalence over time
David Gross
9 Jul 2022 14:57 UTC
10
points
3
comments
2
min read
LW
link
(www.pnas.org)
Making it harder for an AGI to “trick” us, with STVs
Tor Økland Barstad
9 Jul 2022 14:42 UTC
15
points
5
comments
22
min read
LW
link
Ars D&D.sci: Mysteries of Mana
aphyer
9 Jul 2022 12:19 UTC
35
points
13
comments
3
min read
LW
link
[Question]
I’ve become a medical mystery and I don’t know how to effectively get help
CraigMichael
9 Jul 2022 6:58 UTC
30
points
53
comments
2
min read
LW
link
Some thoughts on Animals
nitinkhanna
9 Jul 2022 2:11 UTC
2
points
6
comments
2
min read
LW
link
Changes in Community Dynamics: A Follow-Up to ‘The Berkeley Community & the Rest of Us’
Evan_Gaensbauer
9 Jul 2022 1:44 UTC
21
points
6
comments
4
min read
LW
link
MATS Models
johnswentworth
9 Jul 2022 0:14 UTC
86
points
5
comments
16
min read
LW
link
Research Notes: What are we aligning for?
Shoshannah Tekofsky
8 Jul 2022 22:13 UTC
19
points
8
comments
2
min read
LW
link
[Question]
What New Desktop Should I Buy?
Zvi
8 Jul 2022 15:04 UTC
15
points
19
comments
1
min read
LW
link
Being a donor for Fecal Microbiota Transplants (FMT): Do good & earn easy money (up to 180k/y)
Anton Rodenhauser
8 Jul 2022 6:17 UTC
36
points
26
comments
8
min read
LW
link
(forum.effectivealtruism.org)
User research as a barometer of software design
Adam Zerner
8 Jul 2022 6:02 UTC
31
points
13
comments
3
min read
LW
link
Reinforcement Learner Wireheading
Nate Showell
8 Jul 2022 5:32 UTC
8
points
2
comments
3
min read
LW
link
Exposition as science: some ideas for how to make progress
riceissa
8 Jul 2022 1:29 UTC
21
points
1
comment
8
min read
LW
link
In Search of Strategic Clarity
james.lucassen
8 Jul 2022 0:52 UTC
9
points
1
comment
5
min read
LW
link
(jlucassen.com)
Unbounded Intelligence Lottery
kman
7 Jul 2022 23:28 UTC
4
points
11
comments
1
min read
LW
link
How to Become a World Historical Figure (Péladan’s Dream)
rogersbacon
7 Jul 2022 22:39 UTC
21
points
3
comments
30
min read
LW
link
(www.secretorum.life)
Safety considerations for online generative modeling
Sam Marks
7 Jul 2022 18:31 UTC
42
points
9
comments
14
min read
LW
link
Back to top
Next