Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
1
Will vague “AI sentience” concerns do more for AI safety than anything else we might do?
Aryeh Englander
14 Jun 2022 23:53 UTC
15
points
2
comments
1
min read
LW
link
Yes, AI research will be substantially curtailed if a lab causes a major disaster
lc
14 Jun 2022 22:17 UTC
104
points
31
comments
2
min read
LW
link
Slow motion videos as AI risk intuition pumps
Andrew_Critch
14 Jun 2022 19:31 UTC
242
points
41
comments
2
min read
LW
link
1
review
Cryptographic Life: How to transcend in a sub-lightspeed world via Homomorphic encryption
Golol
14 Jun 2022 19:22 UTC
1
point
0
comments
3
min read
LW
link
Blake Richards on Why he is Skeptical of Existential Risk from AI
Michaël Trazzi
14 Jun 2022 19:09 UTC
41
points
12
comments
4
min read
LW
link
(theinsideview.ai)
[Question]
How Do You Quantify [Physics Interfacing] Real World Capabilities?
DragonGod
14 Jun 2022 14:49 UTC
17
points
1
comment
4
min read
LW
link
Was the Industrial Revolution The Industrial Revolution?
Davis Kedrosky
14 Jun 2022 14:48 UTC
29
points
0
comments
12
min read
LW
link
(daviskedrosky.substack.com)
Investigating causal understanding in LLMs
Marius Hobbhahn
and
Tom Lieberum
14 Jun 2022 13:57 UTC
28
points
6
comments
13
min read
LW
link
Why multi-agent safety is important
Akbir Khan
14 Jun 2022 9:23 UTC
10
points
2
comments
10
min read
LW
link
[Question]
Was Eliezer Yudkowsky right to give himself 10% to succeed with HPMoR in 2010?
momom2
14 Jun 2022 7:00 UTC
2
points
2
comments
1
min read
LW
link
Resources I send to AI researchers about AI safety
Vael Gates
14 Jun 2022 2:24 UTC
69
points
12
comments
1
min read
LW
link
Vael Gates: Risks from Advanced AI (June 2022)
Vael Gates
14 Jun 2022 0:54 UTC
38
points
2
comments
30
min read
LW
link
Cambridge LW Meetup: Personal Finance
Tony Wang
14 Jun 2022 0:12 UTC
3
points
0
comments
1
min read
LW
link
OpenAI: GPT-based LLMs show ability to discriminate between its own wrong answers, but inability to explain how/why it makes that discrimination, even as model scales
Aditya Jain
13 Jun 2022 23:33 UTC
14
points
5
comments
1
min read
LW
link
(openai.com)
[Question]
Who said something like “The fact that putting 2 apples next to 2 other apples leads to there being 4 apples there has nothing to do with the fact that 2 + 2 = 4”?
hunterglenn
13 Jun 2022 22:23 UTC
1
point
2
comments
1
min read
LW
link
Continuity Assumptions
Jan_Kulveit
13 Jun 2022 21:31 UTC
52
points
13
comments
4
min read
LW
link
Crypto-fed Computation
aaguirre
13 Jun 2022 21:20 UTC
24
points
7
comments
7
min read
LW
link
A Modest Pivotal Act
anonymousaisafety
13 Jun 2022 19:24 UTC
−16
points
1
comment
5
min read
LW
link
Contra EY: Can AGI destroy us without trial & error?
nsokolsky
13 Jun 2022 18:26 UTC
137
points
72
comments
15
min read
LW
link
What are some smaller-but-concrete challenges related to AI safety that are impacting people today?
nonzerosum
13 Jun 2022 17:36 UTC
4
points
3
comments
1
min read
LW
link
[Link] New SEP article on Bayesian Epistemology
Aryeh Englander
13 Jun 2022 15:03 UTC
6
points
0
comments
1
min read
LW
link
Training Trace Priors
Adam Jermyn
13 Jun 2022 14:22 UTC
12
points
17
comments
4
min read
LW
link
[Question]
Can you MRI a deep learning model?
Yair Halberstadt
13 Jun 2022 13:43 UTC
3
points
3
comments
1
min read
LW
link
On A List of Lethalities
Zvi
13 Jun 2022 12:30 UTC
165
points
50
comments
54
min read
LW
link
1
review
(thezvi.wordpress.com)
D&D.Sci June 2022 Evaluation and Ruleset
abstractapplic
13 Jun 2022 10:31 UTC
34
points
11
comments
4
min read
LW
link
[Question]
What’s the “This AI is of moral concern.” fire alarm?
Quintin Pope
13 Jun 2022 8:05 UTC
37
points
56
comments
2
min read
LW
link
The beautiful magical enchanted golden Dall-e Mini is underrated
p.b.
13 Jun 2022 7:58 UTC
14
points
0
comments
1
min read
LW
link
Why so little AI risk on rationalist-adjacent blogs?
Grant Demaree
13 Jun 2022 6:31 UTC
46
points
23
comments
8
min read
LW
link
Code Quality and Rule Consequentialism
Adam Zerner
13 Jun 2022 3:12 UTC
17
points
13
comments
6
min read
LW
link
Grokking “Semi-informative priors over AI timelines”
anson.ho
12 Jun 2022 22:17 UTC
15
points
7
comments
14
min read
LW
link
[Question]
How much does cybersecurity reduce AI risk?
Darmani
12 Jun 2022 22:13 UTC
34
points
23
comments
1
min read
LW
link
[Question]
How are compute assets distributed in the world?
Chris van Merwijk
12 Jun 2022 22:13 UTC
30
points
7
comments
1
min read
LW
link
Intuitive Explanation of AIXI
Thomas Larsen
12 Jun 2022 21:41 UTC
22
points
2
comments
5
min read
LW
link
Why all the fuss about recursive self-improvement?
So8res
12 Jun 2022 20:53 UTC
159
points
62
comments
7
min read
LW
link
1
review
Why the Kaldor-Hicks criterion can be non-transitive
Rupert
12 Jun 2022 17:26 UTC
4
points
10
comments
2
min read
LW
link
[Question]
How do you post links here?
skybrian
12 Jun 2022 16:23 UTC
1
point
1
comment
1
min read
LW
link
[Question]
Filter out tags from the front page?
jaspax
12 Jun 2022 10:59 UTC
9
points
2
comments
1
min read
LW
link
How To: A Workshop (or anything)
Duncan Sabien (Inactive)
12 Jun 2022 8:00 UTC
53
points
13
comments
38
min read
LW
link
1
review
A claim that Google’s LaMDA is sentient
Ben Livengood
12 Jun 2022 4:18 UTC
31
points
133
comments
1
min read
LW
link
[Question]
How much stupider than humans can AI be and still kill us all through sheer numbers and resource access?
Shmi
12 Jun 2022 1:01 UTC
11
points
11
comments
1
min read
LW
link
ELK Proposal—Make the Reporter care about the Predictor’s beliefs
Adam Jermyn
and
Nicholas Schiefer
11 Jun 2022 22:53 UTC
8
points
0
comments
6
min read
LW
link
[Question]
Why has no person / group ever taken over the world?
Aryeh Englander
11 Jun 2022 20:51 UTC
25
points
19
comments
1
min read
LW
link
[Question]
Are there English-speaking meetups in Frankfurt/Munich/Zurich?
Grant Demaree
11 Jun 2022 20:02 UTC
6
points
2
comments
1
min read
LW
link
Beauty and the Beast
Tomás B.
11 Jun 2022 18:59 UTC
48
points
8
comments
6
min read
LW
link
Poorly-Aimed Death Rays
Thane Ruthenis
11 Jun 2022 18:29 UTC
48
points
5
comments
4
min read
LW
link
AGI Safety Communications Initiative
ines
11 Jun 2022 17:34 UTC
7
points
0
comments
1
min read
LW
link
A gaming group for rationality-aware people
dhatas
11 Jun 2022 16:04 UTC
7
points
0
comments
1
min read
LW
link
[Question]
Why don’t you introduce really impressive people you personally know to AI alignment (more often)?
Verden
11 Jun 2022 15:59 UTC
33
points
14
comments
1
min read
LW
link
Godzilla Strategies
johnswentworth
11 Jun 2022 15:44 UTC
166
points
72
comments
3
min read
LW
link
Steganography and the CycleGAN—alignment failure case study
Jan Czechowski
11 Jun 2022 9:41 UTC
34
points
0
comments
4
min read
LW
link
Back to top
Next