Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
3
Summaries: Alignment Fundamentals Curriculum
Leon Lang
Sep 18, 2022, 1:08 PM
44
points
3
comments
1
min read
LW
link
(docs.google.com)
Bay Solstice 2022 Call For Volunteers
Scott Alexander
Sep 4, 2022, 6:44 AM
43
points
2
comments
1
min read
LW
link
A Starter-kit for Rationality Space
Jesse Hoogland
Sep 1, 2022, 1:04 PM
43
points
0
comments
1
min read
LW
link
(github.com)
Appendix: How to run a successful Hamming circle
CFAR!Duncan
Sep 2, 2022, 12:22 AM
42
points
6
comments
7
min read
LW
link
Georgism in Space
harsimony
Sep 28, 2022, 4:05 PM
42
points
12
comments
4
min read
LW
link
(harsimony.wordpress.com)
FDT is not directly comparable to CDT and EDT
SMK
Sep 29, 2022, 2:42 PM
42
points
8
comments
11
min read
LW
link
Transhumanism, genetic engineering, and the biological basis of intelligence.
fowlertm
Sep 14, 2022, 3:55 PM
41
points
23
comments
1
min read
LW
link
Covid 9/1/22: Meet the New Booster
Zvi
Sep 1, 2022, 2:00 PM
41
points
6
comments
14
min read
LW
link
(thezvi.wordpress.com)
AI Governance Needs Technical Work
Mau
Sep 5, 2022, 10:28 PM
41
points
1
comment
8
min read
LW
link
The Defender’s Advantage of Interpretability
Marius Hobbhahn
Sep 14, 2022, 2:05 PM
41
points
4
comments
6
min read
LW
link
When is intent alignment sufficient or necessary to reduce AGI conflict?
JesseClifton
,
Sammy Martin
and
Anthony DiGiovanni
Sep 14, 2022, 7:39 PM
40
points
0
comments
9
min read
LW
link
Katja Grace on Slowing Down AI, AI Expert Surveys And Estimating AI Risk
Michaël Trazzi
Sep 16, 2022, 5:45 PM
40
points
2
comments
3
min read
LW
link
(theinsideview.ai)
Overton Gymnastics: An Exercise in Discomfort
Shoshannah Tekofsky
and
omark
Sep 5, 2022, 7:20 PM
40
points
15
comments
4
min read
LW
link
What are you for?
lsusr
Sep 6, 2022, 3:32 AM
39
points
5
comments
1
min read
LW
link
Sticky goals: a concrete experiment for understanding deceptive alignment
evhub
Sep 2, 2022, 9:57 PM
39
points
13
comments
3
min read
LW
link
FDT defects in a realistic Twin Prisoners’ Dilemma
SMK
Sep 15, 2022, 8:55 AM
38
points
1
comment
26
min read
LW
link
There are no rules
unoptimal
Sep 23, 2022, 8:47 PM
38
points
2
comments
5
min read
LW
link
Thoughts on AGI consciousness / sentience
Steven Byrnes
Sep 8, 2022, 4:40 PM
38
points
37
comments
6
min read
LW
link
Framing AI Childhoods
David Udell
Sep 6, 2022, 11:40 PM
37
points
8
comments
4
min read
LW
link
Put Dirty Dishes in the Dishwasher
jefftk
Sep 10, 2022, 1:10 PM
37
points
16
comments
1
min read
LW
link
(www.jefftk.com)
Safety timelines: How long will it take to solve alignment?
Esben Kran
,
JonathanRystroem
and
Steinthal
Sep 19, 2022, 12:53 PM
37
points
7
comments
6
min read
LW
link
(forum.effectivealtruism.org)
Behaviour Manifolds and the Hessian of the Total Loss—Notes and Criticism
carboniferous_umbraculum
Sep 3, 2022, 12:15 AM
35
points
5
comments
6
min read
LW
link
How should DeepMind’s Chinchilla revise our AI forecasts?
Cleo Nardo
Sep 15, 2022, 5:54 PM
35
points
12
comments
13
min read
LW
link
Ought will host a factored cognition “Lab Meeting”
jungofthewon
and
stuhlmueller
Sep 9, 2022, 11:46 PM
35
points
1
comment
1
min read
LW
link
Covid 9/8/22: Booster Boosting
Zvi
Sep 8, 2022, 1:50 PM
34
points
9
comments
24
min read
LW
link
(thezvi.wordpress.com)
[Question]
Why doesn’t China (or didn’t anyone) encourage/mandate elastomeric respirators to control COVID?
Wei Dai
Sep 17, 2022, 3:07 AM
34
points
15
comments
1
min read
LW
link
Mathematical Circuits in Neural Networks
Sean Osier
Sep 22, 2022, 3:48 AM
34
points
4
comments
1
min read
LW
link
(www.youtube.com)
Emergency Residential Solar Jury-Rigging
jefftk
Sep 17, 2022, 2:30 AM
34
points
0
comments
3
min read
LW
link
(www.jefftk.com)
[Question]
Forecasting thread: How does AI risk level vary based on timelines?
elifland
Sep 14, 2022, 11:56 PM
34
points
7
comments
1
min read
LW
link
Twitter Polls: Evidence is Evidence
Zvi
Sep 20, 2022, 12:30 PM
34
points
8
comments
7
min read
LW
link
(thezvi.wordpress.com)
D&D.Sci September 2022: The Allocation Helm
abstractapplic
Sep 16, 2022, 11:10 PM
34
points
34
comments
1
min read
LW
link
Biden should be applauded for appointing Renee Wegrzyn for ARPA-H
ChristianKl
Sep 18, 2022, 7:57 PM
34
points
0
comments
2
min read
LW
link
[Question]
What’s the longest a sentient observer could survive in the Dark Era?
Raemon
Sep 15, 2022, 8:43 AM
33
points
15
comments
1
min read
LW
link
A Pin and a Balloon: Anthropic Fragility Increases Chances of Runaway Global Warming
avturchin
Sep 11, 2022, 10:25 AM
33
points
23
comments
52
min read
LW
link
90% of anything should be bad (& the precision-recall tradeoff)
cartografie
Sep 8, 2022, 1:20 AM
33
points
22
comments
6
min read
LW
link
On oxytocin-sensitive neurons in auditory cortex
Steven Byrnes
Sep 6, 2022, 12:54 PM
33
points
6
comments
12
min read
LW
link
[Question]
Can someone explain to me why most researchers think alignment is probably something that is humanly tractable?
iamthouthouarti
Sep 3, 2022, 1:12 AM
32
points
11
comments
1
min read
LW
link
Covid 9/15/22: Permanent Normal
Zvi
Sep 15, 2022, 4:00 PM
32
points
9
comments
20
min read
LW
link
(thezvi.wordpress.com)
[Question]
Why are we sure that AI will “want” something?
Shmi
Sep 16, 2022, 8:35 PM
31
points
57
comments
1
min read
LW
link
Shahar Avin On How To Regulate Advanced AI Systems
Michaël Trazzi
Sep 23, 2022, 3:46 PM
31
points
0
comments
4
min read
LW
link
(theinsideview.ai)
Strategy For Conditioning Generative Models
james.lucassen
and
evhub
Sep 1, 2022, 4:34 AM
31
points
4
comments
18
min read
LW
link
Guidelines for Mad Entrepreneurs
David Udell
Sep 16, 2022, 6:33 AM
31
points
0
comments
11
min read
LW
link
New tool for exploring EA Forum, LessWrong and Alignment Forum—Tree of Tags
Filip Sondej
Sep 13, 2022, 5:33 PM
31
points
2
comments
1
min read
LW
link
AI Safety Endgame Stories
Ivan Vendrov
Sep 28, 2022, 4:58 PM
31
points
11
comments
11
min read
LW
link
I Tripped and Became GPT! (And How This Updated My Timelines)
Frankophone
Sep 1, 2022, 5:56 PM
31
points
0
comments
4
min read
LW
link
D&D.Sci September 2022 Evaluation and Ruleset
abstractapplic
Sep 26, 2022, 10:19 PM
30
points
5
comments
3
min read
LW
link
Unit Test Everything
DirectedEvolution
Sep 29, 2022, 6:12 PM
30
points
0
comments
8
min read
LW
link
Time is not the bottleneck (on making progress thinking about difficult things)
kman
Sep 12, 2022, 8:45 PM
30
points
10
comments
1
min read
LW
link
Representational Tethers: Tying AI Latents To Human Ones
Paul Bricman
Sep 16, 2022, 2:45 PM
30
points
0
comments
16
min read
LW
link
Renormalization: Why Bigger is Simpler
tailcalled
Sep 14, 2022, 5:52 PM
30
points
5
comments
1
min read
LW
link
(www.youtube.com)
Back to first
Previous
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel