Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
2
New survey: 46% of Americans are concerned about extinction from AI; 69% support a six-month pause in AI development
Orpheus16
Apr 5, 2023, 1:26 AM
46
points
9
comments
1
min read
LW
link
(today.yougov.com)
Is AGI suicidality the golden ray of hope?
Alex Kirko
Apr 4, 2023, 11:29 PM
−18
points
4
comments
1
min read
LW
link
Recontextualizing the Risks of AI in More Predictable Outcomes
ignorepeter
Apr 4, 2023, 11:28 PM
−19
points
2
comments
5
min read
LW
link
LW Team is adjusting moderation policy
Raemon
Apr 4, 2023, 8:41 PM
304
points
185
comments
3
min read
LW
link
Excessive AI growth-rate yields little socio-economic benefit.
Cleo Nardo
Apr 4, 2023, 7:13 PM
27
points
22
comments
4
min read
LW
link
Penalize Model Complexity Via Self-Distillation
research_prime_space
Apr 4, 2023, 6:52 PM
15
points
7
comments
1
min read
LW
link
The One Heresy to Rule Them All
rogersbacon
Apr 4, 2023, 6:23 PM
−22
points
0
comments
3
min read
LW
link
(www.secretorum.life)
Giant (In)scrutable Matrices: (Maybe) the Best of All Possible Worlds
1a3orn
Apr 4, 2023, 5:39 PM
211
points
38
comments
5
min read
LW
link
1
review
Play My Futarchy/Prediction Market Mafia Game
Arjun Panickssery
Apr 4, 2023, 4:12 PM
21
points
2
comments
1
min read
LW
link
(arjunpanickssery.substack.com)
[Question]
Steelman / Ideological Turing Test of Yann LeCun’s AI X-Risk argument?
Aryeh Englander
Apr 4, 2023, 3:53 PM
26
points
14
comments
1
min read
LW
link
Given the Restrict Act, Don’t Ban TikTok
Zvi
Apr 4, 2023, 2:40 PM
97
points
9
comments
4
min read
LW
link
(thezvi.wordpress.com)
Running many AI variants to find correct goal generalization
avturchin
Apr 4, 2023, 2:16 PM
20
points
3
comments
1
min read
LW
link
Invocations: The Other Capabilities Overhang?
Robert_AIZI
Apr 4, 2023, 1:38 PM
29
points
4
comments
4
min read
LW
link
(aizi.substack.com)
Wanted: Mental Health Program Manager at Rethink Wellbeing
Inga G.
Apr 4, 2023, 11:49 AM
7
points
0
comments
2
min read
LW
link
Where Free Will and Determinism Meet
David Bravo
Apr 4, 2023, 10:59 AM
0
points
0
comments
3
min read
LW
link
Strategies to Prevent AI Annihilation
lastchanceformankind
Apr 4, 2023, 8:59 AM
−2
points
0
comments
4
min read
LW
link
ACX Meetup Madrid
Pablo Villalobos
Apr 4, 2023, 8:53 AM
5
points
2
comments
1
min read
LW
link
[Question]
Best Ways to Try to Get Funding for Alignment Research?
RGRGRG
Apr 4, 2023, 6:35 AM
9
points
6
comments
1
min read
LW
link
Consider applying to a 2-week alignment project with former GitHub CEO
Bird Concept
Apr 4, 2023, 6:20 AM
42
points
0
comments
1
min read
LW
link
(twitter.com)
On how it feels generating art with DALL-E
cortrinkau
Apr 4, 2023, 4:13 AM
5
points
0
comments
3
min read
LW
link
(cortrinkau.bearblog.dev)
AI Summer Harvest
Cleo Nardo
Apr 4, 2023, 3:35 AM
130
points
10
comments
1
min read
LW
link
How to respond to the recent condemnations of the rationalist community
Christopher King
Apr 4, 2023, 1:42 AM
−2
points
7
comments
4
min read
LW
link
Steering systems
Max H
Apr 4, 2023, 12:56 AM
50
points
1
comment
15
min read
LW
link
ChatGPT Suggests Listening To Russell & Yudkowsky
JenniferRM
Apr 4, 2023, 12:30 AM
9
points
1
comment
17
min read
LW
link
Complex Systems are Hard to Control
jsteinhardt
Apr 4, 2023, 12:00 AM
42
points
5
comments
10
min read
LW
link
(bounded-regret.ghost.io)
Apply to the Cavendish Labs Fellowship (by 4/15)
agg
and
derikk
Apr 3, 2023, 11:09 PM
11
points
0
comments
1
min read
LW
link
(forum.effectivealtruism.org)
Twin Cities ACX Meetup—April 2023
Timothy M.
Apr 3, 2023, 11:07 PM
5
points
3
comments
1
min read
LW
link
Communicating effectively under Knightian norms
Richard_Ngo
Apr 3, 2023, 10:39 PM
96
points
54
comments
6
min read
LW
link
If interpretability research goes well, it may get dangerous
So8res
Apr 3, 2023, 9:48 PM
201
points
11
comments
2
min read
LW
link
Towards empathy in RL agents and beyond: Insights from cognitive science for AI Alignment
Marc Carauleanu
Apr 3, 2023, 7:59 PM
15
points
6
comments
1
min read
LW
link
(clipchamp.com)
Monthly Roundup #5: April 2023
Zvi
Apr 3, 2023, 6:50 PM
26
points
12
comments
14
min read
LW
link
(thezvi.wordpress.com)
Exploring non-anthropocentric aspects of AI existential safety
mishka
Apr 3, 2023, 6:07 PM
8
points
0
comments
3
min read
LW
link
[Question]
GJP on AGI
Suh_Prance_Alot
Apr 3, 2023, 5:21 PM
2
points
0
comments
1
min read
LW
link
Do we have a plan for the “first critical try” problem?
Christopher King
Apr 3, 2023, 4:27 PM
−3
points
14
comments
1
min read
LW
link
Exploratory Analysis of RLHF Transformers with TransformerLens
Curt Tigges
Apr 3, 2023, 4:09 PM
21
points
2
comments
11
min read
LW
link
(blog.eleuther.ai)
AWS Has Raised Prices Before
jefftk
Apr 3, 2023, 4:00 PM
7
points
3
comments
1
min read
LW
link
(www.jefftk.com)
Mati’s introduction to pausing giant AI experiments
Mati_Roy
Apr 3, 2023, 3:56 PM
7
points
0
comments
2
min read
LW
link
Superintelligence will outsmart us or it isn’t superintelligence
Neil
Apr 3, 2023, 3:01 PM
−4
points
4
comments
1
min read
LW
link
AI-kills-everyone scenarios require robotic infrastructure, but not necessarily nanotech
avturchin
Apr 3, 2023, 12:45 PM
53
points
47
comments
4
min read
LW
link
Orthogonality is expensive
beren
Apr 3, 2023, 10:20 AM
43
points
9
comments
3
min read
LW
link
Repeated Play of Imperfect Newcomb’s Paradox in Infra-Bayesian Physicalism
Sven Nilsen
Apr 3, 2023, 10:06 AM
2
points
0
comments
2
min read
LW
link
Effective Altruism Virtual Programs Apr-May 2023
Yve Nichols-Evans
Apr 3, 2023, 6:40 AM
1
point
0
comments
1
min read
LW
link
Board Game Theory
Optimization Process
Apr 3, 2023, 6:23 AM
8
points
0
comments
3
min read
LW
link
Planecrash Podcast
planecrashpodcast
Apr 3, 2023, 4:34 AM
10
points
5
comments
1
min read
LW
link
[Question]
I’m just starting to grasp Shard Theory. Is that a normal feeling?
twkaiser
Apr 3, 2023, 3:08 AM
−20
points
1
comment
1
min read
LW
link
Rules for living in a 99.9+% lizardman world
at_the_zoo
Apr 3, 2023, 2:39 AM
−1
points
12
comments
1
min read
LW
link
The Friendly Drunk Fool Alignment Strategy
JenniferRM
Apr 3, 2023, 1:26 AM
29
points
19
comments
11
min read
LW
link
Slack Group: Rationalist Startup Founders
Adam Zerner
Apr 3, 2023, 12:44 AM
31
points
2
comments
3
min read
LW
link
Orthogonality is Expensive
DragonGod
Apr 3, 2023, 12:43 AM
21
points
3
comments
1
min read
LW
link
(www.beren.io)
GTP4 capable of limited recursive improving?
Boris Kashirin
Apr 2, 2023, 9:38 PM
2
points
3
comments
1
min read
LW
link
Previous
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel