Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
Rock, Paper and Scissors: A Game Theory View
Edward P. Könings
Jan 21, 2023, 9:00 PM
18
points
3
comments
4
min read
LW
link
(edwardknings.substack.com)
A new Heuristic to Update on the Credences of Others
aaron_mai
Jan 21, 2023, 9:00 PM
6
points
0
comments
20
min read
LW
link
AI Safety “Textbook”. Test chapter. Orthogonality Thesis, Goodhart Law and Instrumental Convergency
Tapatakt
and
LacrimalBird
Jan 21, 2023, 6:13 PM
4
points
0
comments
12
min read
LW
link
[Linkpost] TIME article: DeepMind’s CEO Helped Take AI Mainstream. Now He’s Urging Caution
Orpheus16
Jan 21, 2023, 4:51 PM
58
points
2
comments
3
min read
LW
link
(time.com)
Small Go Boards
jefftk
Jan 21, 2023, 2:50 PM
18
points
6
comments
2
min read
LW
link
(www.jefftk.com)
[Question]
Why are we so illogical?
Program Den
Jan 21, 2023, 8:28 AM
−25
points
0
comments
1
min read
LW
link
Announcing aisafety.training
JJ Hepburn
Jan 21, 2023, 1:01 AM
61
points
4
comments
1
min read
LW
link
Why real estate is the only investment that matters in AI dominated future
G
Jan 20, 2023, 7:40 PM
7
points
10
comments
1
min read
LW
link
Transcript of Sam Altman’s interview touching on AI safety
Andy_McKenzie
Jan 20, 2023, 4:14 PM
121
points
42
comments
10
min read
LW
link
[Question]
COVID contagiousness after negative tests?
wunan
Jan 20, 2023, 3:02 PM
10
points
2
comments
1
min read
LW
link
Critique of some recent philosophy of LLMs’ minds
Roman Leventov
Jan 20, 2023, 12:53 PM
52
points
8
comments
20
min read
LW
link
Preface
iy3d
Jan 20, 2023, 12:38 PM
4
points
0
comments
2
min read
LW
link
Lost in Innovation: The Case of Phlogiston
adamShimi
Jan 20, 2023, 12:19 PM
19
points
8
comments
4
min read
LW
link
(epistemologicalvigilance.substack.com)
finite, actual infinity, potential infinity
Alok Singh
Jan 20, 2023, 11:00 AM
3
points
15
comments
1
min read
LW
link
(alok.github.io)
Generalizability & Hope for AI [MLAISU W03]
Esben Kran
Jan 20, 2023, 10:06 AM
5
points
2
comments
2
min read
LW
link
(newsletter.apartresearch.com)
What’s going on with ‘crunch time’?
rosehadshar
Jan 20, 2023, 9:42 AM
54
points
6
comments
4
min read
LW
link
Shard theory alignment has important, often-overlooked free parameters.
Charlie Steiner
Jan 20, 2023, 9:30 AM
36
points
10
comments
3
min read
LW
link
Solving For Meta-Ethics By Inducing From The Self
VisionaryHera
Jan 20, 2023, 7:21 AM
4
points
1
comment
9
min read
LW
link
Vegan Nutrition Testing Project: Interim Report
Elizabeth
Jan 20, 2023, 5:50 AM
102
points
37
comments
8
min read
LW
link
(acesounderglass.com)
Maybe you can learn exotic experiences via analytical thought
Q Home
Jan 20, 2023, 1:50 AM
2
points
6
comments
15
min read
LW
link
The Gallery for Painting Transformations—A GPT-3 Analogy
Robert_AIZI
Jan 19, 2023, 11:32 PM
1
point
0
comments
6
min read
LW
link
(aizi.substack.com)
AGI safety field building projects I’d like to see
Severin T. Seehrich
Jan 19, 2023, 10:40 PM
68
points
28
comments
9
min read
LW
link
Extensionality and the univalence axiom of type theory
Thomas Kehrenberg
Jan 19, 2023, 10:36 PM
6
points
2
comments
16
min read
LW
link
The spiritual benefits of material progress
jasoncrawford
Jan 19, 2023, 9:35 PM
24
points
15
comments
7
min read
LW
link
(rootsofprogress.org)
Announcing Cavendish Labs
derikk
and
agg
Jan 19, 2023, 8:15 PM
59
points
5
comments
2
min read
LW
link
(forum.effectivealtruism.org)
Thoughts on refusing harmful requests to large language models
William_S
Jan 19, 2023, 7:49 PM
32
points
4
comments
2
min read
LW
link
MA RMV Overloaded
jefftk
Jan 19, 2023, 4:40 PM
16
points
0
comments
2
min read
LW
link
(www.jefftk.com)
“Heretical Thoughts on AI” by Eli Dourado
DragonGod
Jan 19, 2023, 4:11 PM
146
points
38
comments
3
min read
LW
link
(www.elidourado.com)
Covid 1/19/23: Flipped Numbers
Zvi
Jan 19, 2023, 1:30 PM
19
points
4
comments
4
min read
LW
link
(thezvi.wordpress.com)
List of technical AI safety exercises and projects
JakubK
Jan 19, 2023, 9:35 AM
41
points
5
comments
1
min read
LW
link
(docs.google.com)
Group-level Consequences of Psychological Problems
adamShimi
and
Gabriel Alfour
Jan 19, 2023, 9:27 AM
28
points
3
comments
2
min read
LW
link
6-paragraph AI risk intro for MAISI
JakubK
Jan 19, 2023, 9:22 AM
11
points
0
comments
2
min read
LW
link
(www.maisi.club)
200 COP in MI: Studying Learned Features in Language Models
Neel Nanda
Jan 19, 2023, 3:48 AM
24
points
2
comments
30
min read
LW
link
Amazon closing AmazonSmile to focus its philanthropic giving to programs with greater impact
Gordon Seidoh Worley
Jan 19, 2023, 1:15 AM
10
points
8
comments
LW
link
Gradient Filtering
Jozdien
and
janus
Jan 18, 2023, 8:09 PM
56
points
16
comments
13
min read
LW
link
[Cross-post] Is the Fermi Paradox due to the Flaw of Averages?
Aryeh Englander
,
Lonnie Chrisman
and
Yaakov T
Jan 18, 2023, 7:22 PM
41
points
27
comments
15
min read
LW
link
(lumina.com)
First Three Episodes of The Filan Cabinet
DanielFilan
Jan 18, 2023, 7:20 PM
17
points
1
comment
1
min read
LW
link
[Question]
Best Questions To Vet Potential Ai-Safety Applicants
jacksonjezion
Jan 18, 2023, 7:01 PM
6
points
1
comment
1
min read
LW
link
[Question]
Looking for a specific group of people
FriggenRedChickenMan
Jan 18, 2023, 7:00 PM
15
points
21
comments
1
min read
LW
link
A problem with group epistemics
Mckay Jensen
Jan 18, 2023, 5:06 PM
4
points
4
comments
3
min read
LW
link
(quevivasbien.github.io)
Why you should learn sign language
Noah Topper
Jan 18, 2023, 5:03 PM
53
points
23
comments
7
min read
LW
link
(naivebayes.substack.com)
Flying With Covid
jefftk
Jan 18, 2023, 5:00 PM
44
points
29
comments
3
min read
LW
link
(www.jefftk.com)
Prototype of Using GPT-3 to Generate Textbook-length Content
Rafael Cosman
Jan 18, 2023, 2:25 PM
2
points
8
comments
40
min read
LW
link
(github.com)
How many people are working (directly) on reducing existential risk from AI?
Benjamin Hilton
Jan 18, 2023, 8:46 AM
20
points
1
comment
LW
link
EA & LW Forum Summaries (9th Jan to 15th Jan 23′)
Zoe Williams
Jan 18, 2023, 7:29 AM
17
points
0
comments
LW
link
OpenAI’s Alignment Plan is not S.M.A.R.T.
Søren Elverlin
Jan 18, 2023, 6:39 AM
9
points
19
comments
4
min read
LW
link
[Question]
Formal definition of Ontology Mismatch?
NathanBarnard
Jan 18, 2023, 5:52 AM
6
points
0
comments
1
min read
LW
link
[Question]
Transformer Mech Interp: Any visualizations?
Joyee Chen
Jan 18, 2023, 4:32 AM
3
points
0
comments
1
min read
LW
link
Neural networks generalize because of this one weird trick
Jesse Hoogland
Jan 18, 2023, 12:10 AM
183
points
34
comments
15
min read
LW
link
1
review
(www.jessehoogland.com)
Progress links and tweets, 2023-01-17
jasoncrawford
Jan 17, 2023, 9:31 PM
13
points
3
comments
2
min read
LW
link
(rootsofprogress.org)
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel