Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
[Question]
What is the probability that a superintelligent, sentient AGI is actually infeasible?
Nathan1123
14 Aug 2022 22:41 UTC
−3
points
6
comments
1
min read
LW
link
Dealing With Delusions
adrusi
14 Aug 2022 21:11 UTC
9
points
2
comments
1
min read
LW
link
All the posts I will never write
Alexander Gietelink Oldenziel
14 Aug 2022 18:29 UTC
53
points
8
comments
8
min read
LW
link
Brain-like AGI project “aintelope”
Gunnar_Zarncke
14 Aug 2022 16:33 UTC
54
points
2
comments
1
min read
LW
link
AI Transparency: Why it’s critical and how to obtain it.
Zohar Jackson
14 Aug 2022 10:31 UTC
6
points
1
comment
5
min read
LW
link
A brief note on Simplicity Bias
Spencer Becker-Kahn
14 Aug 2022 2:05 UTC
19
points
0
comments
4
min read
LW
link
Evolution is a bad analogy for AGI: inner alignment
Quintin Pope
13 Aug 2022 22:15 UTC
78
points
15
comments
8
min read
LW
link
An Uncanny Prison
Nathan1123
13 Aug 2022 21:40 UTC
3
points
3
comments
2
min read
LW
link
Florida Elections
Double
13 Aug 2022 20:10 UTC
−3
points
8
comments
1
min read
LW
link
Cultivating Valiance
Shoshannah Tekofsky
13 Aug 2022 18:47 UTC
35
points
4
comments
4
min read
LW
link
An extended rocket alignment analogy
remember
13 Aug 2022 18:22 UTC
28
points
3
comments
4
min read
LW
link
[Question]
The OpenAI playground for GPT-3 is a terrible interface. Is there any great local (or web) app for exploring/learning with language models?
aviv
13 Aug 2022 16:34 UTC
3
points
1
comment
1
min read
LW
link
[Question]
What is an agent in reductionist materialism?
Valentine
13 Aug 2022 15:39 UTC
7
points
15
comments
1
min read
LW
link
Refine’s First Blog Post Day
adamShimi
13 Aug 2022 10:23 UTC
55
points
3
comments
1
min read
LW
link
The Dumbest Possible Gets There First
Artaxerxes
13 Aug 2022 10:20 UTC
44
points
7
comments
2
min read
LW
link
I missed the crux of the alignment problem the whole time
zeshen
13 Aug 2022 10:11 UTC
53
points
7
comments
3
min read
LW
link
goal-program bricks
Tamsin Leake
13 Aug 2022 10:08 UTC
31
points
2
comments
2
min read
LW
link
(carado.moe)
Shapes of Mind and Pluralism in Alignment
adamShimi
13 Aug 2022 10:01 UTC
33
points
2
comments
2
min read
LW
link
How I think about alignment
Linda Linsefors
13 Aug 2022 10:01 UTC
31
points
11
comments
5
min read
LW
link
Steelmining via Analogy
Paul Bricman
13 Aug 2022 9:59 UTC
24
points
0
comments
2
min read
LW
link
(paulbricman.com)
the Insulated Goal-Program idea
Tamsin Leake
13 Aug 2022 9:57 UTC
43
points
4
comments
2
min read
LW
link
(carado.moe)
Appendix: Jargon Dictionary
CFAR!Duncan
13 Aug 2022 8:09 UTC
32
points
5
comments
21
min read
LW
link
Appendix: Hamming Questions
CFAR!Duncan
13 Aug 2022 8:07 UTC
36
points
0
comments
2
min read
LW
link
Building a Bugs List prompts
CFAR!Duncan
13 Aug 2022 8:00 UTC
62
points
9
comments
2
min read
LW
link
Cambridge LW Meetup: Constructive Complaining
Tony Wang
13 Aug 2022 4:52 UTC
2
points
0
comments
1
min read
LW
link
Gradient descent doesn’t select for inner search
Ivan Vendrov
13 Aug 2022 4:15 UTC
47
points
23
comments
4
min read
LW
link
[Question]
How to bet against civilizational adequacy?
Wei Dai
12 Aug 2022 23:33 UTC
54
points
17
comments
1
min read
LW
link
Infant AI Scenario
Nathan1123
12 Aug 2022 21:20 UTC
1
point
0
comments
3
min read
LW
link
DeepMind alignment team opinions on AGI ruin arguments
Vika
12 Aug 2022 21:06 UTC
376
points
37
comments
14
min read
LW
link
1
review
Dissolve: The Petty Crimes of Blaise Pascal
JohnBuridan
12 Aug 2022 20:04 UTC
17
points
4
comments
6
min read
LW
link
The Host Minds of HBO’s Westworld.
Nerret
12 Aug 2022 18:53 UTC
1
point
0
comments
3
min read
LW
link
What is estimational programming? Squiggle in context
Quinn
12 Aug 2022 18:39 UTC
14
points
7
comments
7
min read
LW
link
Oversight Misses 100% of Thoughts The AI Does Not Think
johnswentworth
12 Aug 2022 16:30 UTC
97
points
50
comments
1
min read
LW
link
Timelines explanation post part 1 of ?
Nathan Helm-Burger
12 Aug 2022 16:13 UTC
10
points
1
comment
2
min read
LW
link
A little playing around with Blenderbot3
Nathan Helm-Burger
12 Aug 2022 16:06 UTC
9
points
0
comments
1
min read
LW
link
Refining the Sharp Left Turn threat model, part 1: claims and mechanisms
Vika
,
Vikrant Varma
,
Ramana Kumar
and
Mary Phuong
12 Aug 2022 15:17 UTC
85
points
4
comments
3
min read
LW
link
1
review
(vkrakovna.wordpress.com)
Argument by Intellectual Ordeal
lc
12 Aug 2022 13:03 UTC
26
points
5
comments
5
min read
LW
link
Anti-squatted AI x-risk domains index
plex
12 Aug 2022 12:01 UTC
56
points
6
comments
1
min read
LW
link
[Question]
Perfect Predictors
aditya malik
12 Aug 2022 11:51 UTC
2
points
5
comments
1
min read
LW
link
[Question]
What are some good arguments against building new nuclear power plants?
RomanS
12 Aug 2022 7:32 UTC
16
points
15
comments
2
min read
LW
link
Seeking PCK (Pedagogical Content Knowledge)
CFAR!Duncan
12 Aug 2022 4:15 UTC
52
points
11
comments
5
min read
LW
link
Artificial intelligence wireheading
Big Tony
12 Aug 2022 3:06 UTC
5
points
2
comments
1
min read
LW
link
Dissected boxed AI
Nathan1123
12 Aug 2022 2:37 UTC
−8
points
2
comments
1
min read
LW
link
Troll Timers
Screwtape
12 Aug 2022 0:55 UTC
29
points
13
comments
4
min read
LW
link
[Question]
Seriously, what goes wrong with “reward the agent when it makes you smile”?
TurnTrout
11 Aug 2022 22:22 UTC
86
points
42
comments
2
min read
LW
link
Encultured AI Pre-planning, Part 2: Providing a Service
Andrew_Critch
and
Nick Hay
11 Aug 2022 20:11 UTC
33
points
4
comments
3
min read
LW
link
My summary of the alignment problem
Peter Hroššo
11 Aug 2022 19:42 UTC
16
points
3
comments
2
min read
LW
link
(threadreaderapp.com)
Language models seem to be much better than humans at next-token prediction
Buck
,
Fabien Roger
and
LawrenceC
11 Aug 2022 17:45 UTC
182
points
59
comments
13
min read
LW
link
1
review
Introducing Pastcasting: A tool for forecasting practice
Sage Future
11 Aug 2022 17:38 UTC
95
points
10
comments
2
min read
LW
link
2
reviews
Pendulums, Policy-Level Decisionmaking, Saving State
CFAR!Duncan
11 Aug 2022 16:47 UTC
26
points
3
comments
8
min read
LW
link
Back to top
Next