Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
Mid-Atlantic AI Alignment Alliance Unconference
Quinn
Jan 13, 2023, 8:33 PM
7
points
2
comments
1
min read
LW
link
Smallpox vaccines are widely available, for now
David Hornbein
Jan 13, 2023, 8:02 PM
26
points
5
comments
1
min read
LW
link
How does GPT-3 spend its 175B parameters?
Robert_AIZI
Jan 13, 2023, 7:21 PM
41
points
14
comments
6
min read
LW
link
(aizi.substack.com)
[ASoT] Simulators show us behavioural properties by default
Jozdien
Jan 13, 2023, 6:42 PM
36
points
3
comments
3
min read
LW
link
Wheel of Consent Theory for Rationalists and Effective Altruists
adamwilder
Jan 13, 2023, 5:59 PM
1
point
0
comments
2
min read
LW
link
Money is a way of thanking strangers
DirectedEvolution
Jan 13, 2023, 5:06 PM
13
points
5
comments
4
min read
LW
link
Tracr: Compiled Transformers as a Laboratory for Interpretability | DeepMind
DragonGod
Jan 13, 2023, 4:53 PM
62
points
12
comments
1
min read
LW
link
(arxiv.org)
How we could stumble into AI catastrophe
HoldenKarnofsky
Jan 13, 2023, 4:20 PM
71
points
18
comments
18
min read
LW
link
(www.cold-takes.com)
Robustness & Evolution [MLAISU W02]
Esben Kran
Jan 13, 2023, 3:47 PM
10
points
0
comments
3
min read
LW
link
(newsletter.apartresearch.com)
On Cooking With Gas
Zvi
Jan 13, 2023, 2:20 PM
38
points
60
comments
6
min read
LW
link
(thezvi.wordpress.com)
Beware safety-washing
Lizka
Jan 13, 2023, 1:59 PM
51
points
2
comments
4
min read
LW
link
Some Arguments Against Strong Scaling
Joar Skalse
Jan 13, 2023, 12:04 PM
25
points
21
comments
16
min read
LW
link
[Question]
Where do you find people who actually do things?
Ulisse Mini
Jan 13, 2023, 6:57 AM
7
points
12
comments
1
min read
LW
link
[Question]
Could Simulating an AGI Taking Over the World Actually Lead to a LLM Taking Over the World?
simeon_c
Jan 13, 2023, 6:33 AM
15
points
1
comment
1
min read
LW
link
Burning Uptime: When your Sandbox of Empathy is Leaky and also an Hourglass
Cedar
Jan 13, 2023, 5:18 AM
13
points
2
comments
3
min read
LW
link
Disentangling Shard Theory into Atomic Claims
Leon Lang
Jan 13, 2023, 4:23 AM
86
points
6
comments
18
min read
LW
link
AGISF adaptation for in-person groups
Sam Marks
,
Xander Davies
and
Richard_Ngo
Jan 13, 2023, 3:24 AM
44
points
2
comments
3
min read
LW
link
Actions and Flows
Alok Singh
Jan 13, 2023, 3:20 AM
5
points
0
comments
1
min read
LW
link
(alok.github.io)
A Thorough Introduction to Abstraction
RohanS
Jan 13, 2023, 12:30 AM
9
points
1
comment
18
min read
LW
link
The AI Control Problem in a wider intellectual context
philosophybear
Jan 13, 2023, 12:28 AM
11
points
3
comments
12
min read
LW
link
The Alignment Problems
Martín Soto
Jan 12, 2023, 10:29 PM
20
points
0
comments
4
min read
LW
link
Proposal for Inducing Steganography in LMs
Logan Riggs
Jan 12, 2023, 10:15 PM
22
points
3
comments
2
min read
LW
link
Announcing the 2023 PIBBSS Summer Research Fellowship
Nora_Ammann
and
DusanDNesic
Jan 12, 2023, 9:31 PM
32
points
0
comments
1
min read
LW
link
Victoria Krakovna on AGI Ruin, The Sharp Left Turn and Paradigms of AI Alignment
Michaël Trazzi
Jan 12, 2023, 5:09 PM
40
points
3
comments
4
min read
LW
link
(www.theinsideview.ai)
[Question]
What is a disagreement you have around AI safety?
tailcalled
Jan 12, 2023, 4:58 PM
16
points
7
comments
1
min read
LW
link
Reward is not Necessary: How to Create a Compositional Self-Preserving Agent for Life-Long Learning
Roman Leventov
Jan 12, 2023, 4:43 PM
17
points
2
comments
2
min read
LW
link
(arxiv.org)
ChatGPT struggles to respond to the real world
Alex Flint
Jan 12, 2023, 4:02 PM
31
points
9
comments
24
min read
LW
link
Covid 1/12/23: Unexpected Spike in Deaths
Zvi
Jan 12, 2023, 2:30 PM
31
points
2
comments
8
min read
LW
link
(thezvi.wordpress.com)
[Linkpost] Scaling Laws for Generative Mixed-Modal Language Models
Amal
Jan 12, 2023, 2:24 PM
15
points
2
comments
1
min read
LW
link
(arxiv.org)
ea.domains—Domains Free to a Good Home
plex
Jan 12, 2023, 1:32 PM
24
points
0
comments
LW
link
VIRTUA: a novel about AI alignment
Karl von Wendt
Jan 12, 2023, 9:37 AM
46
points
12
comments
1
min read
LW
link
Iron deficiencies are very bad and you should treat them
Elizabeth
Jan 12, 2023, 9:10 AM
108
points
34
comments
11
min read
LW
link
1
review
(acesounderglass.com)
Nonstandard analysis in ethics
Alok Singh
Jan 12, 2023, 5:58 AM
−1
points
0
comments
78
min read
LW
link
(nickbostrom.com)
Example of the nameless rationalist virtue
Alok Singh
Jan 12, 2023, 5:45 AM
−9
points
2
comments
1
min read
LW
link
FFMI Gains: A List of Vitalities
porby
Jan 12, 2023, 4:48 AM
26
points
3
comments
7
min read
LW
link
[Linkpost] DreamerV3: A General RL Architecture
simeon_c
Jan 12, 2023, 3:55 AM
23
points
3
comments
1
min read
LW
link
(arxiv.org)
Microsoft Plans to Invest $10B in OpenAI; $3B Invested to Date | Fortune
DragonGod
Jan 12, 2023, 3:55 AM
23
points
10
comments
2
min read
LW
link
(fortune.com)
Progress and research disruptiveness
Eleni Angelou
Jan 12, 2023, 3:51 AM
3
points
2
comments
1
min read
LW
link
(www.nature.com)
The Fable of the AI Coomer: Why the Social Prowess of Machines is AI’s Most Proximal Threat
Ace Delgado
Jan 12, 2023, 1:15 AM
−10
points
4
comments
4
min read
LW
link
Write to Think
Michael Samoilov
Jan 12, 2023, 12:33 AM
10
points
2
comments
2
min read
LW
link
Alignment is not enough
Alan Chan
Jan 12, 2023, 12:33 AM
12
points
6
comments
11
min read
LW
link
(coordination.substack.com)
How it feels to have your mind hacked by an AI
blaked
Jan 12, 2023, 12:33 AM
367
points
222
comments
17
min read
LW
link
Categorical-measure-theoretic approach to optimal policies tending to seek power
jacek
12 Jan 2023 0:32 UTC
31
points
3
comments
6
min read
LW
link
Any person/mind should have the right to suicide
askofa
12 Jan 2023 0:32 UTC
14
points
13
comments
2
min read
LW
link
Have we really forsaken natural selection?
KatjaGrace
12 Jan 2023 0:10 UTC
34
points
7
comments
2
min read
LW
link
(worldspiritsockpuppet.com)
[Question]
Using Finite Factored Sets for Causal Representation Learning?
David Reber
11 Jan 2023 22:06 UTC
2
points
3
comments
1
min read
LW
link
GWWC’s Handling of Conflicting Funding Bars
jefftk
11 Jan 2023 20:30 UTC
19
points
0
comments
3
min read
LW
link
(www.jefftk.com)
How to write a big cartesian product symbol in MathJax
Matthias G. Mayer
11 Jan 2023 20:21 UTC
8
points
1
comment
1
min read
LW
link
What’s the deal with AI consciousness?
TW123
11 Jan 2023 16:37 UTC
6
points
13
comments
9
min read
LW
link
(aiwatchtower.substack.com)
[Question]
Any significant updates on long covid risk analysis?
Randomized, Controlled
11 Jan 2023 14:31 UTC
23
points
11
comments
1
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel