Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
Page
2
Matt Clancy AMA on the Progress Forum
jasoncrawford
Feb 12, 2023, 8:23 PM
17
points
0
comments
1
min read
LW
link
(progressforum.org)
Latent variables for prediction markets: motivation, technical guide, and design considerations
tailcalled
Feb 12, 2023, 5:54 PM
100
points
25
comments
23
min read
LW
link
2
reviews
The conceptual Doppelgänger problem
TsviBT
Feb 12, 2023, 5:23 PM
12
points
5
comments
4
min read
LW
link
How Cardioid Are Cardioids?
jefftk
Feb 12, 2023, 4:20 PM
9
points
0
comments
2
min read
LW
link
(www.jefftk.com)
How many of these jobs will have a 15% or more drop in employment plausibly attributable to AI by 2031?
tailcalled
Feb 12, 2023, 3:40 PM
12
points
5
comments
1
min read
LW
link
(manifold.markets)
Human-AI collaborative writing
DirectedEvolution
Feb 12, 2023, 2:57 PM
20
points
2
comments
5
min read
LW
link
RaD-AI workshop
Ram Rachum
Feb 12, 2023, 12:46 PM
3
points
0
comments
1
min read
LW
link
Elements of Rationalist Discourse
Rob Bensinger
Feb 12, 2023, 7:58 AM
224
points
49
comments
3
min read
LW
link
1
review
Conflict Theory of Bounded Distrust
Zack_M_Davis
Feb 12, 2023, 5:30 AM
112
points
33
comments
3
min read
LW
link
1
review
Why almost every RL agent does learned optimization
Lee Sharkey
Feb 12, 2023, 4:58 AM
32
points
3
comments
5
min read
LW
link
How I Learn From Textbooks
DirectedEvolution
Feb 12, 2023, 4:45 AM
26
points
3
comments
8
min read
LW
link
Top YouTube channel Veritasium releases video on Sleeping Beauty Problem
Alex_Altair
Feb 11, 2023, 8:36 PM
25
points
22
comments
1
min read
LW
link
(www.youtube.com)
Shortening Timelines: There’s No Buffer Anymore
Jeff Rose
Feb 11, 2023, 7:53 PM
10
points
5
comments
1
min read
LW
link
We Found An Neuron in GPT-2
Joseph Miller
and
Clement Neo
Feb 11, 2023, 6:27 PM
143
points
23
comments
7
min read
LW
link
(clementneo.com)
The Practitioner’s Path 2.0: the Pragmatist Archetype
Evenflair
Feb 11, 2023, 3:48 PM
21
points
0
comments
2
min read
LW
link
(guildoftherose.org)
The Illusion of Simplicity: Monetary Policy as a Problem of Complexity and Alignment
Edward P. Könings
Feb 11, 2023, 3:04 PM
8
points
0
comments
8
min read
LW
link
(edwardknings.substack.com)
In Defense of Chatbot Romance
Kaj_Sotala
Feb 11, 2023, 2:30 PM
124
points
53
comments
11
min read
LW
link
(kajsotala.fi)
Threatening to do the impossible: A solution to spurious counterfactuals for functional decision theory via proof theory
Christopher King
Feb 11, 2023, 7:57 AM
5
points
4
comments
5
min read
LW
link
Rationality-related things I don’t know as of 2023
Adam Zerner
Feb 11, 2023, 6:04 AM
64
points
59
comments
3
min read
LW
link
A note on ‘semiotic physics’
metasemi
Feb 11, 2023, 5:12 AM
11
points
13
comments
6
min read
LW
link
Inequality Penalty: Morality in Many Worlds
Shmi
Feb 11, 2023, 4:08 AM
11
points
17
comments
6
min read
LW
link
The Importance of AI Alignment, explained in 5 points
Daniel_Eth
Feb 11, 2023, 2:56 AM
33
points
2
comments
LW
link
Acting Normal is Good, Actually
Gordon Seidoh Worley
Feb 10, 2023, 11:35 PM
14
points
5
comments
3
min read
LW
link
[S] D&D.Sci: All the D8a. Allllllll of it.
aphyer
Feb 10, 2023, 9:14 PM
43
points
17
comments
6
min read
LW
link
A Different Kind of Ark: My failed attempt to build a bridge between universes
ChrisM
Feb 10, 2023, 8:49 PM
2
points
2
comments
6
min read
LW
link
(www.vesselproject.io)
Prizes for the 2021 Review
Raemon
Feb 10, 2023, 7:47 PM
69
points
2
comments
4
min read
LW
link
A proposed method for forecasting transformative AI
Matthew Barnett
Feb 10, 2023, 7:34 PM
121
points
21
comments
10
min read
LW
link
The best way so far to explain AI risk: The Precipice (p. 137-149)
trevor
Feb 10, 2023, 7:33 PM
53
points
2
comments
17
min read
LW
link
Is this a weak pivotal act: creating nanobots that eat evil AGIs (but nothing else)?
Christopher King
Feb 10, 2023, 7:26 PM
0
points
3
comments
1
min read
LW
link
Why I’m not working on {debate, RRM, ELK, natural abstractions}
Steven Byrnes
Feb 10, 2023, 7:22 PM
71
points
19
comments
10
min read
LW
link
Conditioning Predictive Models: Open problems, Conclusion, and Appendix
evhub
,
Adam Jermyn
,
Johannes Treutlein
,
Rubi J. Hudson
and
kcwoolverton
Feb 10, 2023, 7:21 PM
36
points
3
comments
11
min read
LW
link
Jobs that can help with the most important century
HoldenKarnofsky
Feb 10, 2023, 6:20 PM
24
points
0
comments
19
min read
LW
link
(www.cold-takes.com)
[Question]
Is it a coincidence that GPT-3 requires roughly the same amount of compute as is necessary to emulate the human brain?
RomanS
Feb 10, 2023, 4:26 PM
11
points
10
comments
1
min read
LW
link
Contra: Changing Role Terms
jefftk
Feb 10, 2023, 3:00 PM
8
points
0
comments
3
min read
LW
link
(www.jefftk.com)
Cyborgism
NicholasKees
and
janus
Feb 10, 2023, 2:47 PM
332
points
46
comments
35
min read
LW
link
2
reviews
FLI Podcast: Connor Leahy on AI Progress, Chimps, Memes, and Markets (Part 1/3)
remember
and
Andrea_Miotti
Feb 10, 2023, 1:55 PM
39
points
0
comments
43
min read
LW
link
[Question]
What’s actually going on in the “mind” of the model when we fine-tune GPT-3 to InstructGPT?
rpglover64
Feb 10, 2023, 7:57 AM
18
points
3
comments
1
min read
LW
link
Mechanism Design for AI Safety—Agenda Creation Retreat
Rubi J. Hudson
Feb 10, 2023, 3:05 AM
24
points
2
comments
LW
link
[Question]
On utility functions
jodaru
Feb 10, 2023, 1:22 AM
11
points
10
comments
1
min read
LW
link
Security Mindset—Fire Alarms and Trigger Signatures
elspood
Feb 9, 2023, 9:15 PM
23
points
0
comments
4
min read
LW
link
Impostor syndrome: how to cure it with spreadsheets and meditation
KatWoods
Feb 9, 2023, 9:04 PM
31
points
2
comments
19
min read
LW
link
Conditioning Predictive Models: Deployment strategy
evhub
,
Adam Jermyn
,
Johannes Treutlein
,
Rubi J. Hudson
and
kcwoolverton
9 Feb 2023 20:59 UTC
28
points
0
comments
10
min read
LW
link
Make Conflict of Interest Policies Public
jefftk
9 Feb 2023 19:30 UTC
33
points
7
comments
2
min read
LW
link
(www.jefftk.com)
Curated blind auction prediction markets and a reputation system as an alternative to editorial review in news publication.
ciaran
9 Feb 2023 18:48 UTC
2
points
0
comments
2
min read
LW
link
Tools for finding information on the internet
RomanHauksson
9 Feb 2023 17:05 UTC
79
points
11
comments
2
min read
LW
link
(roman.computer)
Covid 2/9/23: Interferon λ
Zvi
9 Feb 2023 16:50 UTC
48
points
8
comments
12
min read
LW
link
(thezvi.wordpress.com)
EIS II: What is “Interpretability”?
scasper
9 Feb 2023 16:48 UTC
28
points
6
comments
4
min read
LW
link
The Engineer’s Interpretability Sequence (EIS) I: Intro
scasper
9 Feb 2023 16:28 UTC
46
points
24
comments
3
min read
LW
link
[Question]
Do the Safety Properties of Powerful AI Systems Need to be Adversarially Robust? Why?
DragonGod
9 Feb 2023 13:36 UTC
22
points
42
comments
2
min read
LW
link
Which ML skills are useful for finding a new AIS research agenda?
Yonatan Cale
9 Feb 2023 13:09 UTC
16
points
1
comment
1
min read
LW
link
Previous
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel