Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
2
[Question]
Lost in the sauce
JungleTact1cs
Mar 2, 2023, 4:58 PM
−5
points
12
comments
1
min read
LW
link
AI #2
Zvi
Mar 2, 2023, 2:50 PM
66
points
18
comments
55
min read
LW
link
(thezvi.wordpress.com)
Payor’s Lemma in Natural Language
Andrew_Critch
Mar 2, 2023, 12:22 PM
62
points
0
comments
2
min read
LW
link
Joscha Bach on Synthetic Intelligence [annotated]
Roman Leventov
Mar 2, 2023, 11:02 AM
10
points
1
comment
9
min read
LW
link
(www.jimruttshow.com)
[Question]
If I want to test how good I would be as an AI safety researcher alongside my full-time job (with the hope of it becoming my full-time career at some point), is this a good plan?
Malleable_shape
Mar 2, 2023, 9:44 AM
16
points
0
comments
4
min read
LW
link
Job listing (closed): Sentience Institute is accepting applications for a researcher
michael_dello
Mar 2, 2023, 4:40 AM
6
points
0
comments
5
min read
LW
link
(www.sentienceinstitute.org)
Reflection Mechanisms as an Alignment Target—Attitudes on “near-term” AI
elandgre
,
Beth Barnes
and
Marius Hobbhahn
Mar 2, 2023, 4:29 AM
21
points
0
comments
8
min read
LW
link
Live Kingfisher Album?
jefftk
Mar 2, 2023, 3:40 AM
11
points
0
comments
1
min read
LW
link
(www.jefftk.com)
Don’t Jump or I’ll...
Double
Mar 2, 2023, 2:58 AM
13
points
7
comments
4
min read
LW
link
Clippy, the friendly paperclipper
Seth Herd
Mar 2, 2023, 12:02 AM
3
points
11
comments
2
min read
LW
link
Human level AI can plausibly take over the world
anithite
Mar 1, 2023, 11:27 PM
27
points
12
comments
2
min read
LW
link
Extreme GDP growth is a bad operating definition of “slow takeoff”
lc
Mar 1, 2023, 10:25 PM
24
points
1
comment
1
min read
LW
link
Learn the mathematical structure, not the conceptual structure
Adam Shai
Mar 1, 2023, 10:24 PM
98
points
35
comments
2
min read
LW
link
The Parable of the King and the Random Process
moridinamael
Mar 1, 2023, 10:18 PM
312
points
26
comments
6
min read
LW
link
3
reviews
To MIRI-style folk, you can’t simulate the universe from the beginning
the gears to ascension
Mar 1, 2023, 9:38 PM
2
points
19
comments
2
min read
LW
link
OpenAI introduce ChatGPT API at 1/10th the previous $/token
Arthur Conmy
Mar 1, 2023, 8:48 PM
28
points
4
comments
1
min read
LW
link
(openai.com)
Progress links and tweets, 2023-03-01
jasoncrawford
Mar 1, 2023, 8:33 PM
12
points
2
comments
1
min read
LW
link
(rootsofprogress.org)
Taboo “compute overhang”
Zach Stein-Perlman
Mar 1, 2023, 7:15 PM
21
points
8
comments
1
min read
LW
link
Call for Cruxes by Rhyme, a Longtermist History Consultancy
Lara
Mar 1, 2023, 6:39 PM
1
point
0
comments
3
min read
LW
link
(forum.effectivealtruism.org)
Fighting without hope
Orpheus16
Mar 1, 2023, 6:15 PM
46
points
14
comments
4
min read
LW
link
1
review
Sunlight is yellow parallel rays plus blue isotropic light
Thomas Kehrenberg
Mar 1, 2023, 5:58 PM
77
points
5
comments
2
min read
LW
link
Timeline: The proximal origin of SARS-CoV-2
ChristianKl
Mar 1, 2023, 5:02 PM
9
points
4
comments
1
min read
LW
link
(usrtk.org)
Twin Cities ACX Meetup—Mar 2023
Timothy M.
Mar 1, 2023, 4:54 PM
3
points
3
comments
1
min read
LW
link
Some Variants of Sleeping Beauty
SMK
and
EOC
Mar 1, 2023, 4:51 PM
34
points
10
comments
8
min read
LW
link
Dealing with infinite entropy
Alex_Altair
Mar 1, 2023, 3:01 PM
70
points
9
comments
11
min read
LW
link
Scoring forecasts from the 2016 “Expert Survey on Progress in AI”
PatrickL
Mar 1, 2023, 2:41 PM
29
points
6
comments
9
min read
LW
link
AI: Practical Advice for the Worried
Zvi
Mar 1, 2023, 12:30 PM
155
points
49
comments
16
min read
LW
link
2
reviews
(thezvi.wordpress.com)
My current thinking about ChatGPT @3QD [Gärdenfors, Wolfram, and the value of speculation]
Bill Benzon
Mar 1, 2023, 10:50 AM
2
points
0
comments
5
min read
LW
link
Open & Welcome Thread — March 2023
niplav
1 Mar 2023 9:30 UTC
7
points
48
comments
1
min read
LW
link
Problems of people new to AI safety and my project ideas to mitigate them
Igor Ivanov
1 Mar 2023 9:09 UTC
38
points
4
comments
7
min read
LW
link
reflections on lockdown, two years out
mingyuan
1 Mar 2023 6:58 UTC
86
points
9
comments
3
min read
LW
link
[Question]
(Cryonics) can I be frozen before being near-death?
hollowing
1 Mar 2023 6:44 UTC
6
points
16
comments
1
min read
LW
link
An evening at a bar
yakimoff
1 Mar 2023 6:40 UTC
1
point
2
comments
1
min read
LW
link
Predictions for shard theory mechanistic interpretability results
TurnTrout
,
Ulisse Mini
and
peligrietzer
1 Mar 2023 5:16 UTC
105
points
10
comments
5
min read
LW
link
Implied “utilities” of simulators are broad, dense, and shallow
porby
1 Mar 2023 3:23 UTC
45
points
7
comments
3
min read
LW
link
Contract Fraud
jefftk
1 Mar 2023 3:10 UTC
86
points
10
comments
1
min read
LW
link
(www.jefftk.com)
Inside the mind of a superhuman Go model: How does Leela Zero read ladders?
Haoxing Du
1 Mar 2023 1:47 UTC
157
points
8
comments
30
min read
LW
link
Bing Chat is a Precursor to Something Legitimately Dangerous
Simon Berens
1 Mar 2023 1:36 UTC
20
points
6
comments
2
min read
LW
link
(www.simonberens.com)
Enemies vs Malefactors
So8res
28 Feb 2023 23:38 UTC
226
points
69
comments
LW
link
4
reviews
Scarce Channels and Abstraction Coupling
johnswentworth
28 Feb 2023 23:26 UTC
41
points
11
comments
6
min read
LW
link
On “prepping” personal households for AI doom scenarios
coryfklein
28 Feb 2023 22:17 UTC
0
points
0
comments
2
min read
LW
link
Power-seeking can be probable and predictive for trained agents
Vika
and
janos
28 Feb 2023 21:10 UTC
56
points
22
comments
9
min read
LW
link
(arxiv.org)
Can AI experience nihilism?
chenbo
28 Feb 2023 18:58 UTC
−14
points
1
comment
2
min read
LW
link
Help kill “teach a man to fish”
Tyler G Hall
28 Feb 2023 18:53 UTC
24
points
3
comments
1
min read
LW
link
The burden of knowing
arisAlexis
28 Feb 2023 18:40 UTC
5
points
0
comments
2
min read
LW
link
Interpreting Embedding Spaces by Conceptualization
Adi Simhi
28 Feb 2023 18:38 UTC
3
points
0
comments
1
min read
LW
link
(arxiv.org)
A mostly critical review of infra-Bayesianism
David Matolcsi
28 Feb 2023 18:37 UTC
108
points
9
comments
29
min read
LW
link
Performance guarantees in classical learning theory and infra-Bayesianism
David Matolcsi
28 Feb 2023 18:37 UTC
9
points
4
comments
31
min read
LW
link
[Question]
Ethical and incentive-compatible way to share finances with partner when you both work?
Chad Nauseam
28 Feb 2023 18:28 UTC
14
points
3
comments
2
min read
LW
link
(www.reddit.com)
My Experience With Loving Kindness Meditation
maia
28 Feb 2023 18:18 UTC
47
points
8
comments
3
min read
LW
link
(particularvirtue.blogspot.com)
Previous
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel