Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
The Alignment Problems
Martín Soto
Jan 12, 2023, 10:29 PM
20
points
0
comments
4
min read
LW
link
Proposal for Inducing Steganography in LMs
Logan Riggs
Jan 12, 2023, 10:15 PM
22
points
3
comments
2
min read
LW
link
Announcing the 2023 PIBBSS Summer Research Fellowship
Nora_Ammann
and
DusanDNesic
Jan 12, 2023, 9:31 PM
32
points
0
comments
1
min read
LW
link
Victoria Krakovna on AGI Ruin, The Sharp Left Turn and Paradigms of AI Alignment
Michaël Trazzi
Jan 12, 2023, 5:09 PM
40
points
3
comments
4
min read
LW
link
(www.theinsideview.ai)
[Question]
What is a disagreement you have around AI safety?
tailcalled
Jan 12, 2023, 4:58 PM
16
points
7
comments
1
min read
LW
link
Reward is not Necessary: How to Create a Compositional Self-Preserving Agent for Life-Long Learning
Roman Leventov
Jan 12, 2023, 4:43 PM
17
points
2
comments
2
min read
LW
link
(arxiv.org)
ChatGPT struggles to respond to the real world
Alex Flint
Jan 12, 2023, 4:02 PM
31
points
9
comments
24
min read
LW
link
Covid 1/12/23: Unexpected Spike in Deaths
Zvi
Jan 12, 2023, 2:30 PM
31
points
2
comments
8
min read
LW
link
(thezvi.wordpress.com)
[Linkpost] Scaling Laws for Generative Mixed-Modal Language Models
Amal
Jan 12, 2023, 2:24 PM
15
points
2
comments
1
min read
LW
link
(arxiv.org)
ea.domains—Domains Free to a Good Home
plex
Jan 12, 2023, 1:32 PM
24
points
0
comments
LW
link
VIRTUA: a novel about AI alignment
Karl von Wendt
Jan 12, 2023, 9:37 AM
46
points
12
comments
1
min read
LW
link
Iron deficiencies are very bad and you should treat them
Elizabeth
Jan 12, 2023, 9:10 AM
108
points
34
comments
11
min read
LW
link
1
review
(acesounderglass.com)
Nonstandard analysis in ethics
Alok Singh
Jan 12, 2023, 5:58 AM
−1
points
0
comments
78
min read
LW
link
(nickbostrom.com)
Example of the nameless rationalist virtue
Alok Singh
Jan 12, 2023, 5:45 AM
−9
points
2
comments
1
min read
LW
link
FFMI Gains: A List of Vitalities
porby
Jan 12, 2023, 4:48 AM
26
points
3
comments
7
min read
LW
link
[Linkpost] DreamerV3: A General RL Architecture
simeon_c
Jan 12, 2023, 3:55 AM
23
points
3
comments
1
min read
LW
link
(arxiv.org)
Microsoft Plans to Invest $10B in OpenAI; $3B Invested to Date | Fortune
DragonGod
Jan 12, 2023, 3:55 AM
23
points
10
comments
2
min read
LW
link
(fortune.com)
Progress and research disruptiveness
Eleni Angelou
Jan 12, 2023, 3:51 AM
3
points
2
comments
1
min read
LW
link
(www.nature.com)
The Fable of the AI Coomer: Why the Social Prowess of Machines is AI’s Most Proximal Threat
Ace Delgado
Jan 12, 2023, 1:15 AM
−10
points
4
comments
4
min read
LW
link
Write to Think
Michael Samoilov
Jan 12, 2023, 12:33 AM
10
points
2
comments
2
min read
LW
link
Alignment is not enough
Alan Chan
Jan 12, 2023, 12:33 AM
12
points
6
comments
11
min read
LW
link
(coordination.substack.com)
How it feels to have your mind hacked by an AI
blaked
Jan 12, 2023, 12:33 AM
367
points
222
comments
17
min read
LW
link
Categorical-measure-theoretic approach to optimal policies tending to seek power
jacek
Jan 12, 2023, 12:32 AM
31
points
3
comments
6
min read
LW
link
Any person/mind should have the right to suicide
askofa
Jan 12, 2023, 12:32 AM
14
points
13
comments
2
min read
LW
link
Have we really forsaken natural selection?
KatjaGrace
Jan 12, 2023, 12:10 AM
34
points
7
comments
2
min read
LW
link
(worldspiritsockpuppet.com)
[Question]
Using Finite Factored Sets for Causal Representation Learning?
David Reber
Jan 11, 2023, 10:06 PM
2
points
3
comments
1
min read
LW
link
GWWC’s Handling of Conflicting Funding Bars
jefftk
Jan 11, 2023, 8:30 PM
19
points
0
comments
3
min read
LW
link
(www.jefftk.com)
How to write a big cartesian product symbol in MathJax
Matthias G. Mayer
Jan 11, 2023, 8:21 PM
8
points
1
comment
1
min read
LW
link
What’s the deal with AI consciousness?
TW123
Jan 11, 2023, 4:37 PM
6
points
13
comments
9
min read
LW
link
(aiwatchtower.substack.com)
[Question]
Any significant updates on long covid risk analysis?
Randomized, Controlled
Jan 11, 2023, 2:31 PM
23
points
11
comments
1
min read
LW
link
internal in nonstandard analysis
Alok Singh
Jan 11, 2023, 9:58 AM
9
points
1
comment
1
min read
LW
link
Compounding Resource X
Raemon
Jan 11, 2023, 3:14 AM
77
points
6
comments
9
min read
LW
link
Running With a Backpack
jefftk
Jan 11, 2023, 3:00 AM
19
points
11
comments
1
min read
LW
link
(www.jefftk.com)
A simple thought experiment showing why recessions are an unnecessary bug in our economic system
skogsnisse
Jan 11, 2023, 12:43 AM
1
point
1
comment
1
min read
LW
link
We don’t trade with ants
KatjaGrace
Jan 10, 2023, 11:50 PM
272
points
109
comments
7
min read
LW
link
1
review
(worldspiritsockpuppet.com)
[Question]
Who are the people who are currently profiting from inflation?
skogsnisse
Jan 10, 2023, 9:39 PM
1
point
2
comments
1
min read
LW
link
Is Progress Real?
rogersbacon
Jan 10, 2023, 5:42 PM
5
points
14
comments
14
min read
LW
link
(www.secretorum.life)
200 COP in MI: Interpreting Reinforcement Learning
Neel Nanda
Jan 10, 2023, 5:37 PM
25
points
1
comment
10
min read
LW
link
AGI and the EMH: markets are not expecting aligned or unaligned AI in the next 30 years
basil.halperin
,
J. Zachary Mazlish
and
tmychow
Jan 10, 2023, 4:06 PM
119
points
44
comments
26
min read
LW
link
The Alignment Problem from a Deep Learning Perspective (major rewrite)
SoerenMind
,
Richard_Ngo
and
LawrenceC
Jan 10, 2023, 4:06 PM
84
points
8
comments
39
min read
LW
link
(arxiv.org)
Against using stock prices to forecast AI timelines
basil.halperin
,
tmychow
and
J. Zachary Mazlish
Jan 10, 2023, 4:03 PM
23
points
2
comments
2
min read
LW
link
Sorting Pebbles Into Correct Heaps: The Animation
Writer
10 Jan 2023 15:58 UTC
26
points
2
comments
1
min read
LW
link
(youtu.be)
Escape Velocity from Bullshit Jobs
Zvi
10 Jan 2023 14:30 UTC
61
points
18
comments
5
min read
LW
link
(thezvi.wordpress.com)
Scaling laws vs individual differences
beren
10 Jan 2023 13:22 UTC
45
points
21
comments
7
min read
LW
link
Notes on writing
RP
10 Jan 2023 4:01 UTC
35
points
11
comments
3
min read
LW
link
Idea: Learning How To Move Towards The Metagame
Algon
10 Jan 2023 0:58 UTC
10
points
3
comments
1
min read
LW
link
Review AI Alignment posts to help figure out how to make a proper AI Alignment review
habryka
and
Raemon
10 Jan 2023 0:19 UTC
85
points
31
comments
2
min read
LW
link
Against the paradox of tolerance
pchvykov
10 Jan 2023 0:12 UTC
1
point
11
comments
3
min read
LW
link
Increased Scam Quality/Quantity (Hypothesis in need of data)?
Beeblebrox
9 Jan 2023 22:57 UTC
9
points
6
comments
1
min read
LW
link
Wentworth and Larsen on buying time
Orpheus16
,
Thomas Larsen
and
johnswentworth
9 Jan 2023 21:31 UTC
74
points
6
comments
12
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel