Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
2
[Question]
Wouldn’t weak AI agents provide warning?
Mandatory Topic
Apr 26, 2024, 7:34 PM
5
points
0
comments
1
min read
LW
link
World models
A*
Apr 26, 2024, 7:11 PM
1
point
0
comments
1
min read
LW
link
Duct Tape security
Isaac King
Apr 26, 2024, 6:57 PM
69
points
11
comments
5
min read
LW
link
Fundamental Uncertainty: Chapter 8 - When does fundamental uncertainty matter?
Gordon Seidoh Worley
Apr 26, 2024, 6:10 PM
11
points
2
comments
32
min read
LW
link
Scaling of AI training runs will slow down after GPT-5
Maxime Riché
Apr 26, 2024, 4:05 PM
40
points
5
comments
3
min read
LW
link
Spatial attention as a “tell” for empathetic simulation?
Steven Byrnes
Apr 26, 2024, 3:10 PM
55
points
12
comments
8
min read
LW
link
Arch-anarchy
Peter lawless
Apr 26, 2024, 3:05 PM
−1
points
1
comment
25
min read
LW
link
Breadboarding a Whistle Synth
jefftk
Apr 26, 2024, 3:00 PM
9
points
2
comments
2
min read
LW
link
(www.jefftk.com)
An Introduction to AI Sandbagging
Teun van der Weij
,
Felix Hofstätter
and
Francis Rhys Ward
Apr 26, 2024, 1:40 PM
46
points
13
comments
8
min read
LW
link
LLMs seem (relatively) safe
JustisMills
Apr 25, 2024, 10:13 PM
53
points
24
comments
7
min read
LW
link
(justismills.substack.com)
Losing Faith In Contrarianism
Bentham's Bulldog
Apr 25, 2024, 8:53 PM
39
points
44
comments
5
min read
LW
link
Why I stopped being into basin broadness
tailcalled
Apr 25, 2024, 8:47 PM
16
points
3
comments
2
min read
LW
link
AXRP Episode 29 - Science of Deep Learning with Vikrant Varma
DanielFilan
Apr 25, 2024, 7:10 PM
20
points
1
comment
63
min read
LW
link
Improving Dictionary Learning with Gated Sparse Autoencoders
Senthooran Rajamanoharan
,
Arthur Conmy
,
lewis smith
,
Tom Lieberum
,
Vikrant Varma
,
János Kramár
,
Rohin Shah
and
Neel Nanda
Apr 25, 2024, 6:43 PM
63
points
38
comments
1
min read
LW
link
(arxiv.org)
“Why I Write” by George Orwell (1946)
Arjun Panickssery
Apr 25, 2024, 4:02 PM
59
points
2
comments
9
min read
LW
link
(www.orwellfoundation.com)
Knowledge Base 8: The truth as an attractor in the information space
iwis
Apr 25, 2024, 3:28 PM
−8
points
0
comments
2
min read
LW
link
Cybersecurity of Frontier AI Models: A Regulatory Review
Deric Cheng
and
Elliot Mckernon
Apr 25, 2024, 2:51 PM
8
points
0
comments
8
min read
LW
link
The first future and the best future
KatjaGrace
Apr 25, 2024, 6:40 AM
106
points
12
comments
1
min read
LW
link
(worldspiritsockpuppet.com)
NIH Cancer Myths Myths
belkarx
Apr 25, 2024, 5:43 AM
15
points
1
comment
2
min read
LW
link
social lemon markets
bhauth
Apr 25, 2024, 2:18 AM
22
points
6
comments
3
min read
LW
link
(www.bhauth.com)
Bayesian inference without priors
DanielFilan
Apr 24, 2024, 11:50 PM
26
points
8
comments
8
min read
LW
link
(danielfilan.com)
The Inner Ring by C. S. Lewis
Saul Munn
Apr 24, 2024, 10:48 PM
69
points
6
comments
13
min read
LW
link
(www.lewissociety.org)
This is Water by David Foster Wallace
Nathan Young
Apr 24, 2024, 9:21 PM
60
points
16
comments
13
min read
LW
link
(fs.blog)
Betadine oral rinses for covid and other viral infections
Elizabeth
Apr 24, 2024, 5:50 PM
22
points
3
comments
5
min read
LW
link
(acesounderglass.com)
At last! ChatGPT does, shall we say, interesting imitations of “Kubla Khan”
Bill Benzon
Apr 24, 2024, 2:56 PM
−3
points
0
comments
4
min read
LW
link
Magic by forgetting
avturchin
Apr 24, 2024, 2:32 PM
18
points
39
comments
4
min read
LW
link
Changes in College Admissions
Zvi
Apr 24, 2024, 1:50 PM
50
points
11
comments
39
min read
LW
link
(thezvi.wordpress.com)
1-page outline of Carlsmith’s otherness and control series
Nathan Young
Apr 24, 2024, 11:25 AM
22
points
3
comments
3
min read
LW
link
How to use and interpret activation patching
StefanHex
and
Neel Nanda
Apr 24, 2024, 8:35 AM
13
points
6
comments
18
min read
LW
link
AI Generated Music as a Method of Installing Essential Rationalist Skills
keltan
Apr 24, 2024, 7:48 AM
18
points
4
comments
1
min read
LW
link
Electronic Harp Mandolin Prototype
jefftk
Apr 24, 2024, 2:20 AM
9
points
0
comments
1
min read
LW
link
(www.jefftk.com)
[Question]
Examples of Highly Counterfactual Discoveries?
johnswentworth
Apr 23, 2024, 10:19 PM
197
points
108
comments
1
min read
LW
link
[Question]
Is there software to practice reading expressions?
lsusr
Apr 23, 2024, 9:53 PM
37
points
11
comments
1
min read
LW
link
Let’s Design A School, Part 1
Sable
Apr 23, 2024, 9:50 PM
56
points
5
comments
11
min read
LW
link
(affablyevil.substack.com)
WSJ: Inside Amazon’s Secret Operation to Gather Intel on Rivals
trevor
Apr 23, 2024, 9:33 PM
37
points
5
comments
5
min read
LW
link
(www.wsj.com)
On Minicircle
Metacelsus
Apr 23, 2024, 9:28 PM
10
points
0
comments
1
min read
LW
link
(docs.google.com)
Simple probes can catch sleeper agents
Monte M
,
Carson Denison
,
Zac Hatfield-Dodds
,
David Duvenaud
,
Sam Bowman
,
Ethan Perez
and
evhub
Apr 23, 2024, 9:10 PM
133
points
21
comments
1
min read
LW
link
(www.anthropic.com)
Manifold “exploring real cash prizes”
Rana Dexsin
Apr 23, 2024, 9:07 PM
7
points
0
comments
1
min read
LW
link
(manifoldmarkets.notion.site)
[Question]
(When) Should you work through the night when inspiration strikes you?
Chi Nguyen
Apr 23, 2024, 9:07 PM
21
points
4
comments
1
min read
LW
link
Book review: Deep Utopia
PeterMcCluskey
Apr 23, 2024, 7:55 PM
45
points
14
comments
4
min read
LW
link
(bayesianinvestor.com)
On what research policymakers actually need
MondSemmel
Apr 23, 2024, 7:50 PM
38
points
0
comments
3
min read
LW
link
(www.slowboring.com)
Dequantifying first-order theories
jessicata
Apr 23, 2024, 7:04 PM
40
points
9
comments
8
min read
LW
link
(unstableontology.com)
Vector Planning in a Lattice Graph
Johannes C. Mayer
and
Thomas Kehrenberg
Apr 23, 2024, 4:58 PM
20
points
7
comments
2
min read
LW
link
ProLU: A Nonlinearity for Sparse Autoencoders
Glen Taggart
Apr 23, 2024, 2:09 PM
44
points
4
comments
9
min read
LW
link
Subjective Questions Require Subjective information
Ben
Apr 23, 2024, 1:16 PM
7
points
4
comments
4
min read
LW
link
Rejecting Television
Declan Molony
Apr 23, 2024, 4:59 AM
90
points
10
comments
6
min read
LW
link
LW Frontpage Experiments! (aka “Take the wheel, Shoggoth!”)
Ruby
and
RobertM
Apr 23, 2024, 3:58 AM
71
points
27
comments
5
min read
LW
link
Thoughts on Zero Points
depressurize
Apr 23, 2024, 2:22 AM
31
points
1
comment
4
min read
LW
link
(sexandchicago.substack.com)
Funny Anecdote of Eliezer From His Sister
Noah Birnbaum
Apr 22, 2024, 10:05 PM
207
points
6
comments
2
min read
LW
link
How LLMs Work, in the Style of The Economist
utilistrutil
Apr 22, 2024, 7:06 PM
0
points
0
comments
2
min read
LW
link
Previous
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel