Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
1
Consciousness as recurrence, potential for enforcing alignment?
Foyle
Apr 18, 2023, 11:05 PM
−2
points
6
comments
1
min read
LW
link
Encouraging New Users To Bet On Their Beliefs
YafahEdelman
Apr 18, 2023, 10:10 PM
49
points
8
comments
2
min read
LW
link
AI Safety Newsletter #2: ChaosGPT, Natural Selection, and AI Safety in the Media
ozhang
,
Dan H
and
Orpheus16
Apr 18, 2023, 6:44 PM
30
points
0
comments
4
min read
LW
link
(newsletter.safe.ai)
Scientism vs. people
Roman Leventov
Apr 18, 2023, 5:28 PM
4
points
4
comments
11
min read
LW
link
Capabilities and alignment of LLM cognitive architectures
Seth Herd
Apr 18, 2023, 4:29 PM
88
points
18
comments
20
min read
LW
link
World and Mind in Artificial Intelligence: arguments against the AI pause
Arturo Macias
Apr 18, 2023, 2:40 PM
1
point
0
comments
1
min read
LW
link
(forum.effectivealtruism.org)
Slowing AI: Interventions
Zach Stein-Perlman
Apr 18, 2023, 2:30 PM
19
points
0
comments
5
min read
LW
link
Cryptographic and auxiliary approaches relevant for AI safety
Allison Duettmann
Apr 18, 2023, 2:18 PM
7
points
0
comments
6
min read
LW
link
The Overemployed Via ChatGPT
Zvi
Apr 18, 2023, 1:40 PM
58
points
7
comments
6
min read
LW
link
(thezvi.wordpress.com)
[Linkpost] AI Alignment, Explained in 5 Points (updated)
Daniel_Eth
Apr 18, 2023, 8:09 AM
10
points
0
comments
LW
link
Argentines LW/SSC/EA/MIRIx—Call to All
daviddelauba
Apr 18, 2023, 6:37 AM
1
point
0
comments
1
min read
LW
link
No, really, it predicts next tokens.
simon
Apr 18, 2023, 3:47 AM
58
points
55
comments
3
min read
LW
link
The basic reasons I expect AGI ruin
Rob Bensinger
Apr 18, 2023, 3:37 AM
189
points
73
comments
14
min read
LW
link
High schoolers can apply to the Atlas Fellowship: $10k scholarship + 11-day program
Ronny Fernandez
and
Jonas V
Apr 18, 2023, 2:53 AM
26
points
0
comments
3
min read
LW
link
Green goo is plausible
anithite
Apr 18, 2023, 12:04 AM
67
points
31
comments
4
min read
LW
link
1
review
AI Impacts Quarterly Newsletter, Jan-Mar 2023
Harlan
Apr 17, 2023, 10:10 PM
5
points
0
comments
3
min read
LW
link
(blog.aiimpacts.org)
[Question]
How do you align your emotions through updates and existential uncertainty?
VojtaKovarik
Apr 17, 2023, 8:46 PM
4
points
10
comments
1
min read
LW
link
AI Alignment Research Engineer Accelerator (ARENA): call for applicants
CallumMcDougall
Apr 17, 2023, 8:30 PM
100
points
9
comments
7
min read
LW
link
AI policy ideas: Reading list
Zach Stein-Perlman
Apr 17, 2023, 7:00 PM
24
points
7
comments
4
min read
LW
link
NYT: The Surprising Thing A.I. Engineers Will Tell You if You Let Them
Sodium
Apr 17, 2023, 6:59 PM
11
points
2
comments
1
min read
LW
link
(www.nytimes.com)
But why would the AI kill us?
So8res
Apr 17, 2023, 6:42 PM
140
points
96
comments
2
min read
LW
link
Sama Says the Age of Giant AI Models is Already Over
Algon
Apr 17, 2023, 6:36 PM
49
points
12
comments
1
min read
LW
link
(www.wired.com)
Meetup Tip: Conversation Starters
Screwtape
Apr 17, 2023, 6:25 PM
20
points
1
comment
3
min read
LW
link
Critiques of prominent AI safety labs: Redwood Research
Omega.
Apr 17, 2023, 6:20 PM
4
points
0
comments
22
min read
LW
link
(forum.effectivealtruism.org)
How Large Language Models Nuke our Naive Notions of Truth and Reality
Sean Lee
Apr 17, 2023, 6:08 PM
0
points
23
comments
11
min read
LW
link
An alternative of PPO towards alignment
ml hkust
Apr 17, 2023, 5:58 PM
2
points
2
comments
4
min read
LW
link
What I learned at the AI Safety Europe Retreat
skaisg
Apr 17, 2023, 5:40 PM
28
points
0
comments
10
min read
LW
link
(skaisg.eu)
What is your timelines for ADI (artificial disempowering intelligence)?
Christopher King
Apr 17, 2023, 5:01 PM
3
points
3
comments
2
min read
LW
link
[Question]
Can we get around Godel’s Incompleteness theorems and Turing undecidable problems via infinite computers?
Noosphere89
Apr 17, 2023, 3:14 PM
−11
points
12
comments
1
min read
LW
link
La Crosse, WI Rationality Meetup
Daniel Uebele
Apr 17, 2023, 3:13 PM
1
point
0
comments
1
min read
LW
link
Slowing AI: Foundations
Zach Stein-Perlman
Apr 17, 2023, 2:30 PM
45
points
11
comments
17
min read
LW
link
Slowing AI: Reading list
Zach Stein-Perlman
Apr 17, 2023, 2:30 PM
47
points
3
comments
4
min read
LW
link
Goodhart’s Law inside the human mind
Kaj_Sotala
Apr 17, 2023, 1:48 PM
125
points
13
comments
16
min read
LW
link
Prediction: any uncontrollable AI will turn earth into a giant computer
Karl von Wendt
Apr 17, 2023, 12:30 PM
11
points
8
comments
3
min read
LW
link
AutoBound on neural network can achieve OOMs lower training loss
Maybe_a
Apr 17, 2023, 5:20 AM
10
points
9
comments
1
min read
LW
link
(ai.googleblog.com)
Making Booking.Com less out to get you
Elizabeth
Apr 17, 2023, 4:04 AM
21
points
0
comments
1
min read
LW
link
(www.alexcharlton.co)
grey goo is unlikely
bhauth
Apr 17, 2023, 1:59 AM
156
points
123
comments
9
min read
LW
link
2
reviews
(bhauth.com)
AGI Clinics: A Safe Haven for Humanity’s First Encounters with Superintelligence
portr.
Apr 17, 2023, 1:52 AM
−5
points
1
comment
1
min read
LW
link
Summaries of top forum posts (27th March to 16th April)
Zoe Williams
Apr 17, 2023, 12:28 AM
14
points
1
comment
LW
link
AI Takeover Scenario with Scaled LLMs
simeon_c
Apr 16, 2023, 11:28 PM
42
points
15
comments
8
min read
LW
link
My experience getting funding for my biological research
Metacelsus
Apr 16, 2023, 10:53 PM
78
points
10
comments
5
min read
LW
link
(denovo.substack.com)
Top lesson from GPT: we will probably destroy humanity “for the lulz” as soon as we are able.
Shmi
Apr 16, 2023, 8:27 PM
63
points
28
comments
1
min read
LW
link
On urgency, priority and collective reaction to AI-Risks: Part I
Denreik
Apr 16, 2023, 7:14 PM
−10
points
15
comments
5
min read
LW
link
Efficient Learning: Memorization
Alvin Ånestrand
Apr 16, 2023, 5:58 PM
4
points
2
comments
5
min read
LW
link
(forum.effectivealtruism.org)
Mechanistically interpreting time in GPT-2 small
rgould
,
Elizabeth Ho
and
Arthur Conmy
Apr 16, 2023, 5:57 PM
68
points
6
comments
21
min read
LW
link
La Crosse, WI Rationality Meetup
Daniel Uebele
Apr 16, 2023, 5:33 PM
1
point
0
comments
1
min read
LW
link
The Soul of the Writer (on LLMs, the psychology of writers, and the nature of intelligence)
rogersbacon
Apr 16, 2023, 4:02 PM
11
points
1
comment
3
min read
LW
link
(www.secretorum.life)
Possibilizing vs. actualizing
TsviBT
Apr 16, 2023, 3:55 PM
31
points
2
comments
5
min read
LW
link
Human Extinction by AI through economic power
ChristianKl
Apr 16, 2023, 12:15 PM
8
points
1
comment
8
min read
LW
link
Bit Flip
Charlie Sanders
Apr 16, 2023, 7:30 AM
−2
points
11
comments
11
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel