Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
1
Auto-GPT: Open-sourced disaster?
awg
Apr 5, 2023, 10:46 PM
23
points
18
comments
1
min read
LW
link
(github.com)
The Orthogonality Thesis is Not Obviously True
omnizoid
Apr 5, 2023, 9:06 PM
3
points
80
comments
9
min read
LW
link
Williams-Beuren Syndrome: Frendly Mutations
Takk
Apr 5, 2023, 8:59 PM
−1
points
1
comment
1
min read
LW
link
OpenAI: Our approach to AI safety
Jacob G-W
Apr 5, 2023, 8:26 PM
1
point
1
comment
1
min read
LW
link
(openai.com)
Why Are Maximum Entropy Distributions So Ubiquitous?
johnswentworth
Apr 5, 2023, 8:12 PM
68
points
6
comments
9
min read
LW
link
“On Living in an Atomic Age”, by C.S. Lewis (1948)
tjaffee
Apr 5, 2023, 6:34 PM
17
points
3
comments
8
min read
LW
link
(hebrew-streams.org)
Eliezer Yudkowsky’s Letter in Time Magazine
Zvi
Apr 5, 2023, 6:00 PM
214
points
86
comments
14
min read
LW
link
(thezvi.wordpress.com)
Dark Artificial Intelligence
FrankAI
Apr 5, 2023, 5:37 PM
0
points
0
comments
4
min read
LW
link
[Question]
Best arguments against instrumental convergence?
lfrymire
Apr 5, 2023, 5:06 PM
5
points
7
comments
1
min read
LW
link
Progress links and tweets, 2023-04-05
jasoncrawford
Apr 5, 2023, 4:18 PM
20
points
0
comments
2
min read
LW
link
(rootsofprogress.org)
Universality and Hidden Information in Concept Bottleneck Models
Hoagy
Apr 5, 2023, 2:00 PM
23
points
0
comments
11
min read
LW
link
AI safety and the security mindset: user interface design, red-teams, formal verification
Allison Duettmann
Apr 5, 2023, 11:33 AM
35
points
0
comments
8
min read
LW
link
ICA Simulacra
Ozyrus
Apr 5, 2023, 6:41 AM
26
points
2
comments
7
min read
LW
link
AGI deployment as an act of aggression
dr_s
Apr 5, 2023, 6:39 AM
28
points
30
comments
13
min read
LW
link
A Brief Introduction to Algorithmic Common Intelligence, ACI . 1
Akira Pyinya
Apr 5, 2023, 5:43 AM
−2
points
1
comment
2
min read
LW
link
46% of US adults at least “somewhat concerned” about AI extinction risk.
Foyle
Apr 5, 2023, 5:25 AM
1
point
0
comments
1
min read
LW
link
[Question]
Has anyone thought about how to proceed now that AI notkilleveryoneism is becoming more relevant/is approaching the Overton window?
metachirality
Apr 5, 2023, 3:06 AM
11
points
8
comments
1
min read
LW
link
Empathy bandaid for immediate AI catastrophe
installgentoo
Apr 5, 2023, 2:12 AM
1
point
2
comments
1
min read
LW
link
“Corrigibility at some small length” by dath ilan
Christopher King
Apr 5, 2023, 1:47 AM
32
points
3
comments
9
min read
LW
link
(www.glowfic.com)
New survey: 46% of Americans are concerned about extinction from AI; 69% support a six-month pause in AI development
Orpheus16
Apr 5, 2023, 1:26 AM
46
points
9
comments
1
min read
LW
link
(today.yougov.com)
Is AGI suicidality the golden ray of hope?
Alex Kirko
Apr 4, 2023, 11:29 PM
−18
points
4
comments
1
min read
LW
link
Recontextualizing the Risks of AI in More Predictable Outcomes
ignorepeter
Apr 4, 2023, 11:28 PM
−19
points
2
comments
5
min read
LW
link
LW Team is adjusting moderation policy
Raemon
Apr 4, 2023, 8:41 PM
304
points
185
comments
3
min read
LW
link
Excessive AI growth-rate yields little socio-economic benefit.
Cleo Nardo
Apr 4, 2023, 7:13 PM
27
points
22
comments
4
min read
LW
link
Penalize Model Complexity Via Self-Distillation
research_prime_space
Apr 4, 2023, 6:52 PM
15
points
7
comments
1
min read
LW
link
The One Heresy to Rule Them All
rogersbacon
Apr 4, 2023, 6:23 PM
−22
points
0
comments
3
min read
LW
link
(www.secretorum.life)
Giant (In)scrutable Matrices: (Maybe) the Best of All Possible Worlds
1a3orn
Apr 4, 2023, 5:39 PM
211
points
38
comments
5
min read
LW
link
1
review
Play My Futarchy/Prediction Market Mafia Game
Arjun Panickssery
Apr 4, 2023, 4:12 PM
21
points
2
comments
1
min read
LW
link
(arjunpanickssery.substack.com)
[Question]
Steelman / Ideological Turing Test of Yann LeCun’s AI X-Risk argument?
Aryeh Englander
Apr 4, 2023, 3:53 PM
26
points
14
comments
1
min read
LW
link
Given the Restrict Act, Don’t Ban TikTok
Zvi
Apr 4, 2023, 2:40 PM
97
points
9
comments
4
min read
LW
link
(thezvi.wordpress.com)
Running many AI variants to find correct goal generalization
avturchin
Apr 4, 2023, 2:16 PM
20
points
3
comments
1
min read
LW
link
Invocations: The Other Capabilities Overhang?
Robert_AIZI
Apr 4, 2023, 1:38 PM
29
points
4
comments
4
min read
LW
link
(aizi.substack.com)
Wanted: Mental Health Program Manager at Rethink Wellbeing
Inga G.
Apr 4, 2023, 11:49 AM
7
points
0
comments
LW
link
Where Free Will and Determinism Meet
David Bravo
Apr 4, 2023, 10:59 AM
0
points
0
comments
3
min read
LW
link
Strategies to Prevent AI Annihilation
lastchanceformankind
Apr 4, 2023, 8:59 AM
−2
points
0
comments
4
min read
LW
link
ACX Meetup Madrid
Pablo Villalobos
Apr 4, 2023, 8:53 AM
5
points
2
comments
1
min read
LW
link
[Question]
Best Ways to Try to Get Funding for Alignment Research?
RGRGRG
Apr 4, 2023, 6:35 AM
9
points
6
comments
1
min read
LW
link
Consider applying to a 2-week alignment project with former GitHub CEO
Bird Concept
Apr 4, 2023, 6:20 AM
42
points
0
comments
1
min read
LW
link
(twitter.com)
On how it feels generating art with DALL-E
cortrinkau
Apr 4, 2023, 4:13 AM
5
points
0
comments
3
min read
LW
link
(cortrinkau.bearblog.dev)
AI Summer Harvest
Cleo Nardo
Apr 4, 2023, 3:35 AM
130
points
10
comments
1
min read
LW
link
How to respond to the recent condemnations of the rationalist community
Christopher King
Apr 4, 2023, 1:42 AM
−2
points
7
comments
4
min read
LW
link
Steering systems
Max H
Apr 4, 2023, 12:56 AM
50
points
1
comment
15
min read
LW
link
ChatGPT Suggests Listening To Russell & Yudkowsky
JenniferRM
Apr 4, 2023, 12:30 AM
9
points
1
comment
17
min read
LW
link
Complex Systems are Hard to Control
jsteinhardt
Apr 4, 2023, 12:00 AM
42
points
5
comments
10
min read
LW
link
(bounded-regret.ghost.io)
Apply to the Cavendish Labs Fellowship (by 4/15)
agg
and
derikk
Apr 3, 2023, 11:09 PM
11
points
0
comments
1
min read
LW
link
(forum.effectivealtruism.org)
Twin Cities ACX Meetup—April 2023
Timothy M.
Apr 3, 2023, 11:07 PM
5
points
3
comments
1
min read
LW
link
Communicating effectively under Knightian norms
Richard_Ngo
Apr 3, 2023, 10:39 PM
96
points
54
comments
6
min read
LW
link
If interpretability research goes well, it may get dangerous
So8res
Apr 3, 2023, 9:48 PM
201
points
11
comments
2
min read
LW
link
Towards empathy in RL agents and beyond: Insights from cognitive science for AI Alignment
Marc Carauleanu
Apr 3, 2023, 7:59 PM
15
points
6
comments
1
min read
LW
link
(clipchamp.com)
Monthly Roundup #5: April 2023
Zvi
Apr 3, 2023, 6:50 PM
26
points
12
comments
14
min read
LW
link
(thezvi.wordpress.com)
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel