Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
1
A couple productivity tips for overthinkers
Steven Byrnes
Apr 20, 2024, 4:05 PM
79
points
13
comments
4
min read
LW
link
“You’re the most beautiful girl in the world” and Wittgensteinian Language Games
Chris_Leong
Apr 20, 2024, 2:54 PM
5
points
18
comments
1
min read
LW
link
Past Tense Features
Can
Apr 20, 2024, 2:34 PM
12
points
0
comments
4
min read
LW
link
Thoughts on seed oil
dynomight
Apr 20, 2024, 12:29 PM
357
points
129
comments
17
min read
LW
link
(dynomight.net)
How to know whether you are an idealist or a physicalist/materialist
JackOfAllTrades
Apr 20, 2024, 11:53 AM
−3
points
2
comments
1
min read
LW
link
How I Think, Part Four: Money is Weird
Richard Henage
Apr 20, 2024, 6:21 AM
0
points
3
comments
5
min read
LW
link
The power of finite and the weakness of infinite binary point numbers
AxiomWriter
Apr 20, 2024, 6:03 AM
−3
points
6
comments
2
min read
LW
link
WISDOMISM A Moral Theory for the Age of Information
Peter lawless
Apr 19, 2024, 11:06 PM
2
points
0
comments
9
min read
LW
link
Inducing Unprompted Misalignment in LLMs
Sam Svenningsen
,
evhub
and
Henry Sleight
Apr 19, 2024, 8:00 PM
38
points
7
comments
16
min read
LW
link
Introspection
A*
Apr 19, 2024, 7:10 PM
7
points
0
comments
1
min read
LW
link
[Full Post] Progress Update #1 from the GDM Mech Interp Team
Neel Nanda
,
Arthur Conmy
,
lewis smith
,
Senthooran Rajamanoharan
,
Tom Lieberum
,
János Kramár
and
Vikrant Varma
Apr 19, 2024, 7:06 PM
79
points
10
comments
8
min read
LW
link
[Summary] Progress Update #1 from the GDM Mech Interp Team
Neel Nanda
,
Arthur Conmy
,
lewis smith
,
Senthooran Rajamanoharan
,
Tom Lieberum
,
János Kramár
and
Vikrant Varma
Apr 19, 2024, 7:06 PM
72
points
0
comments
3
min read
LW
link
Daniel Dennett has died (1942-2024)
kave
Apr 19, 2024, 4:17 PM
150
points
5
comments
1
min read
LW
link
(dailynous.com)
Events Booking New Callers?
jefftk
Apr 19, 2024, 3:50 PM
9
points
0
comments
1
min read
LW
link
(www.jefftk.com)
[Question]
What is the best way to talk about probabilities you expect to change with evidence/experiments?
Will_Pearson
Apr 19, 2024, 3:35 PM
14
points
11
comments
1
min read
LW
link
CTMU insight: maybe consciousness *can* affect quantum outcomes?
zhukeepa
Apr 19, 2024, 3:23 PM
13
points
11
comments
5
min read
LW
link
Demonstrate and evaluate risks from AI to society at the AI x Democracy research hackathon
Esben Kran
Apr 19, 2024, 2:46 PM
5
points
0
comments
LW
link
(www.apartresearch.com)
[Question]
How to Model the Future of Open-Source LLMs?
Joel Burget
Apr 19, 2024, 2:28 PM
25
points
9
comments
1
min read
LW
link
What’s up with all the non-Mormons? Weirdly specific universalities across LLMs
mwatkins
Apr 19, 2024, 1:43 PM
40
points
13
comments
27
min read
LW
link
[Question]
If digital goods in virtual worlds increase GDP, do we actually become richer?
No77e
Apr 19, 2024, 10:06 AM
6
points
10
comments
1
min read
LW
link
Experiment on repeating choices
KatjaGrace
Apr 19, 2024, 4:20 AM
56
points
1
comment
3
min read
LW
link
(worldspiritsockpuppet.com)
Effective Altruists and Rationalists Views & The case for using marketing to highlight AI risks.
gilch
Apr 19, 2024, 4:16 AM
6
points
1
comment
1
min read
LW
link
(youtu.be)
Cohesion and business problems
Adam Zerner
Apr 19, 2024, 12:45 AM
12
points
8
comments
4
min read
LW
link
The Thermodynamics of Death
Peter lawless
Apr 19, 2024, 12:36 AM
6
points
0
comments
10
min read
LW
link
Backyard Office
jefftk
Apr 19, 2024, 12:31 AM
13
points
0
comments
1
min read
LW
link
(www.jefftk.com)
hydrogen tube transport
bhauth
Apr 18, 2024, 10:47 PM
34
points
12
comments
5
min read
LW
link
(www.bhauth.com)
LessOnline Festival Updates Thread
Ben Pace
Apr 18, 2024, 9:55 PM
59
points
26
comments
1
min read
LW
link
A Review of In-Context Learning Hypotheses for Automated AI Alignment Research
alamerton
Apr 18, 2024, 6:29 PM
25
points
4
comments
16
min read
LW
link
I’m open for projects (sort of)
cousin_it
Apr 18, 2024, 6:05 PM
46
points
13
comments
1
min read
LW
link
Blessed information, garbage information, cursed information
tailcalled
Apr 18, 2024, 4:56 PM
23
points
8
comments
3
min read
LW
link
[Fiction] A Confession
Arjun Panickssery
Apr 18, 2024, 4:28 PM
38
points
2
comments
5
min read
LW
link
(arjunpanickssery.substack.com)
Discriminating Behaviorally Identical Classifiers: a model problem for applying interpretability to scalable oversight
Sam Marks
Apr 18, 2024, 4:17 PM
113
points
10
comments
12
min read
LW
link
Cooperation is optimal, with weaker agents too - tldr
Ryo
Apr 18, 2024, 3:03 PM
12
points
22
comments
4
min read
LW
link
(medium.com)
How to coordinate despite our biases? - tldr
Ryo
Apr 18, 2024, 3:03 PM
3
points
2
comments
3
min read
LW
link
(medium.com)
Knowledge Base 7: Long-tail knowledge and collective intelligence
iwis
Apr 18, 2024, 2:21 PM
−6
points
0
comments
1
min read
LW
link
AI #60: Oh the Humanity
Zvi
Apr 18, 2024, 2:10 PM
44
points
7
comments
62
min read
LW
link
(thezvi.wordpress.com)
UDT1.01: Logical Inductors and Implicit Beliefs (5/10)
Diffractor
Apr 18, 2024, 8:39 AM
34
points
2
comments
19
min read
LW
link
An examination of GPT-2′s boring yet effective glitch
MiguelDev
Apr 18, 2024, 5:26 AM
5
points
3
comments
3
min read
LW
link
[Question]
What if Ethics is Provably Self-Contradictory?
Yitz
Apr 18, 2024, 5:12 AM
3
points
7
comments
2
min read
LW
link
The Mom Test: Summary and Thoughts
Adam Zerner
Apr 18, 2024, 3:34 AM
48
points
3
comments
10
min read
LW
link
Express interest in an “FHI of the West”
habryka
Apr 18, 2024, 3:32 AM
268
points
41
comments
3
min read
LW
link
Why Would Belief-States Have A Fractal Structure, And Why Would That Matter For Interpretability? An Explainer
johnswentworth
and
David Lorell
Apr 18, 2024, 12:27 AM
185
points
21
comments
7
min read
LW
link
AXRP Episode 28 - Suing Labs for AI Risk with Gabriel Weil
DanielFilan
17 Apr 2024 21:42 UTC
12
points
0
comments
65
min read
LW
link
LLM Evaluators Recognize and Favor Their Own Generations
Arjun Panickssery
,
Sam Bowman
and
Shi Feng
17 Apr 2024 21:09 UTC
45
points
1
comment
3
min read
LW
link
(tiny.cc)
SFS: Foundations of Forecasting
MAD2
17 Apr 2024 17:46 UTC
3
points
0
comments
1
min read
LW
link
An ethical framework to supersede Utilitarianism
metalcrow
17 Apr 2024 17:18 UTC
1
point
4
comments
4
min read
LW
link
Moving on from community living
Vika
17 Apr 2024 17:02 UTC
63
points
7
comments
3
min read
LW
link
(vkrakovna.wordpress.com)
Staged release
Zach Stein-Perlman
17 Apr 2024 16:00 UTC
11
points
4
comments
2
min read
LW
link
[Question]
Discomfort Stacking
Lewis O’Brien
17 Apr 2024 14:49 UTC
5
points
12
comments
1
min read
LW
link
FHI (Future of Humanity Institute) has shut down (2005–2024)
gwern
17 Apr 2024 13:54 UTC
176
points
22
comments
1
min read
LW
link
(www.futureofhumanityinstitute.org)
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel