Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
Book Review: Invisible China
Yudhister Kumar
Oct 14, 2023, 9:51 PM
4
points
0
comments
4
min read
LW
link
(ykumar.org)
Book Review: Radical Markets
Yudhister Kumar
Oct 14, 2023, 9:41 PM
11
points
0
comments
15
min read
LW
link
(ykumar.org)
[Question]
One-on-one tutoring for any subject
yakimoff
Oct 14, 2023, 8:58 PM
8
points
5
comments
1
min read
LW
link
The Puritans would one-box: evidential decision theory in the 17th century
Jacob G-W
Oct 14, 2023, 8:23 PM
86
points
5
comments
3
min read
LW
link
(jacobgw.com)
Natural Abstraction: Convergent Preferences Over Information Structures
paulom
Oct 14, 2023, 6:34 PM
28
points
1
comment
36
min read
LW
link
ChatGPT tells 20 versions of its prototypical story, with a short note on method
Bill Benzon
Oct 14, 2023, 3:27 PM
6
points
0
comments
5
min read
LW
link
Will no one rid me of this turbulent pest?
Metacelsus
Oct 14, 2023, 3:27 PM
154
points
23
comments
10
min read
LW
link
(denovo.substack.com)
Which Anaesthetic To Choose?
dadadarren
Oct 14, 2023, 2:55 PM
10
points
15
comments
1
min read
LW
link
Is the Wave non-disparagement thingy okay?
Ruby
,
Linch
and
Auckland
Oct 14, 2023, 5:31 AM
29
points
13
comments
11
min read
LW
link
The Gods of Straight Lines
Richard_Ngo
Oct 14, 2023, 4:10 AM
67
points
13
comments
5
min read
LW
link
(www.narrativeark.xyz)
Eight Magic Lamps
Richard_Ngo
Oct 14, 2023, 4:10 AM
40
points
0
comments
6
min read
LW
link
(www.narrativeark.xyz)
RSPs are pauses done right
evhub
Oct 14, 2023, 4:06 AM
164
points
73
comments
7
min read
LW
link
1
review
Dishonorable Gossip and Going Crazy
Ben Pace
and
Unreal
Oct 14, 2023, 4:00 AM
29
points
31
comments
23
min read
LW
link
Disentangling Our Terminal and Instrumental Values
PeterMcCluskey
Oct 14, 2023, 3:35 AM
11
points
1
comment
4
min read
LW
link
(bayesianinvestor.com)
Global Pause AI Protest 10/21
Holly_Elmore
,
Joseph Miller
and
joepio
Oct 14, 2023, 3:20 AM
5
points
0
comments
1
min read
LW
link
[Question]
Literature On Existential Risk From Atmospheric Contamination?
Yitz
Oct 13, 2023, 10:27 PM
6
points
3
comments
1
min read
LW
link
How to partition teams to move fast? Debating “low-dimensional cuts”
Bird Concept
and
kave
Oct 13, 2023, 9:43 PM
41
points
2
comments
11
min read
LW
link
Gothenburg LW / ACX meetup
Stefan
Oct 13, 2023, 9:39 PM
2
points
0
comments
1
min read
LW
link
Meta-Regulations
Sable
Oct 13, 2023, 9:23 PM
18
points
5
comments
10
min read
LW
link
(affablyevil.substack.com)
Hiring: Lighthaven Events & Venue Lead
Raemon
Oct 13, 2023, 9:02 PM
69
points
3
comments
4
min read
LW
link
Prediction markets covered in the NYT podcast “Hard Fork”
Austin Chen
Oct 13, 2023, 6:43 PM
56
points
6
comments
LW
link
(www.nytimes.com)
[Paper] All’s Fair In Love And Love: Copy Suppression in GPT-2 Small
CallumMcDougall
,
Arthur Conmy
,
Cody Rushing
,
Tom McGrath
and
Neel Nanda
Oct 13, 2023, 6:32 PM
82
points
4
comments
8
min read
LW
link
[Question]
Intelligence Enhancement (Monthly Thread) 13 Oct 2023
Nicholas / Heather Kross
Oct 13, 2023, 5:28 PM
52
points
40
comments
1
min read
LW
link
FLI podcast series, “Imagine A World”, about aspirational futures with AGI
Jackson Wagner
Oct 13, 2023, 4:07 PM
9
points
0
comments
4
min read
LW
link
To open-source or to not open-source, that is (an oversimplification of) the question.
Justin Bullock
Oct 13, 2023, 3:10 PM
12
points
5
comments
5
min read
LW
link
Combination Lock Boxes
jefftk
Oct 13, 2023, 12:50 PM
17
points
9
comments
1
min read
LW
link
(www.jefftk.com)
Circle of Support (Oct 14th @ 10am PST)
Alexei
Oct 13, 2023, 9:24 AM
19
points
1
comment
1
min read
LW
link
[Question]
How can the world handle the HAMAS situation?
Annapurna
Oct 13, 2023, 9:15 AM
5
points
43
comments
1
min read
LW
link
UVic AI Ethics Conference
TristanTrim
and
Leo Mckee-Reid
Oct 13, 2023, 7:31 AM
3
points
1
comment
1
min read
LW
link
LW UI features you might not have tried
Elizabeth
Oct 13, 2023, 3:04 AM
49
points
6
comments
1
min read
LW
link
Revisiting Guide Dogs and Blindness Prevention
jefftk
Oct 13, 2023, 2:30 AM
22
points
0
comments
2
min read
LW
link
(www.jefftk.com)
Paper: Understanding and Controlling a Maze-Solving Policy Network
TurnTrout
,
Ulisse Mini
,
peligrietzer
,
mrinank_sharma
,
Austin Meek
,
Monte M
and
lisathiergart
Oct 13, 2023, 1:38 AM
70
points
0
comments
1
min read
LW
link
(arxiv.org)
OPTIC: Announcing Intercollegiate Forecasting Tournaments in SF, DC, Boston
Saul Munn
,
Jingyi Wang
and
Tom Shlomi
Oct 13, 2023, 1:36 AM
6
points
0
comments
1
min read
LW
link
Progress links digest, 2023-10-12: Dyson sphere thermodynamics and a cure for cavities
jasoncrawford
Oct 13, 2023, 12:41 AM
15
points
1
comment
10
min read
LW
link
(rootsofprogress.org)
What do Marginal Grants at EAIF Look Like? Funding Priorities and Grantmaking Thresholds at the EA Infrastructure Fund
Linch
Oct 12, 2023, 9:40 PM
20
points
0
comments
LW
link
unRLHF—Efficiently undoing LLM safeguards
Pranav Gade
,
Jeffrey Ladish
and
Simon Lermen
Oct 12, 2023, 7:58 PM
117
points
15
comments
20
min read
LW
link
LoRA Fine-tuning Efficiently Undoes Safety Training from Llama 2-Chat 70B
Simon Lermen
and
Jeffrey Ladish
Oct 12, 2023, 7:58 PM
151
points
29
comments
14
min read
LW
link
[Question]
Looking for reading recommendations: Theories of right/justice that safeguard against having one’s job automated?
bulKlub
Oct 12, 2023, 7:40 PM
−1
points
1
comment
1
min read
LW
link
The International PauseAI Protest: Activism under uncertainty
Joseph Miller
Oct 12, 2023, 5:36 PM
32
points
1
comment
LW
link
AI #33: Cool New Interpretability Paper
Zvi
Oct 12, 2023, 4:20 PM
46
points
18
comments
46
min read
LW
link
(thezvi.wordpress.com)
Noticing confusion in physics
Jacob G-W
Oct 12, 2023, 3:21 PM
20
points
27
comments
2
min read
LW
link
(jacobgw.com)
[Question]
How to make to-do lists (and to get things done)?
TeaTieAndHat
Oct 12, 2023, 2:26 PM
9
points
13
comments
2
min read
LW
link
Relevance of ‘Harmful Intelligence’ Data in Training Datasets (WebText vs. Pile)
MiguelDev
Oct 12, 2023, 12:08 PM
12
points
0
comments
9
min read
LW
link
Soulmate Fermi Estimate + My A(ltr)u[t]istic Mating Strategy
Jordan Arel
Oct 12, 2023, 8:32 AM
0
points
9
comments
3
min read
LW
link
Evolution Solved Alignment (what sharp left turn?)
jacob_cannell
Oct 12, 2023, 4:15 AM
23
points
89
comments
4
min read
LW
link
The CHOICE
Gabi QUENE
Oct 12, 2023, 3:02 AM
−29
points
2
comments
3
min read
LW
link
Solstice 2023 Roundup
dspeyer
Oct 11, 2023, 11:09 PM
28
points
6
comments
1
min read
LW
link
Understanding LLMs: Some basic observations about words, syntax, and discourse [w/ a conjecture about grokking]
Bill Benzon
Oct 11, 2023, 7:13 PM
6
points
0
comments
5
min read
LW
link
[Linkpost] Generalization in diffusion models arises from geometry-adaptive harmonic representation
Bogdan Ionut Cirstea
Oct 11, 2023, 5:48 PM
4
points
3
comments
1
min read
LW
link
What I’ve been reading, October 2023: The stirrup in Europe, 19th-century art deco, and more
jasoncrawford
Oct 11, 2023, 4:11 PM
18
points
2
comments
11
min read
LW
link
(rootsofprogress.org)
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel