Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
1
A few more ants and grasshoppers
c.trout
Jun 17, 2023, 11:38 PM
16
points
3
comments
4
min read
LW
link
The “Loss Function of Reality” Is Not So Spiky and Unpredictable
Thoth Hermes
Jun 17, 2023, 9:43 PM
12
points
0
comments
6
min read
LW
link
(thothhermes.substack.com)
[Question]
What is the foundation of me experiencing the present moment being right now and not at some other point in time?
MvB
Jun 17, 2023, 8:47 PM
20
points
19
comments
1
min read
LW
link
Adventist Health Study-2 supports pescetarianism more than veganism
Elizabeth
Jun 17, 2023, 8:10 PM
67
points
11
comments
6
min read
LW
link
(acesounderglass.com)
The environment as infrastructure
jasoncrawford
Jun 17, 2023, 6:42 PM
28
points
9
comments
1
min read
LW
link
(rootsofprogress.org)
A summary of current work in AI governance
constructive
Jun 17, 2023, 6:41 PM
44
points
1
comment
11
min read
LW
link
(forum.effectivealtruism.org)
[Linkpost] Rosetta Neurons: Mining the Common Units in a Model Zoo
Bogdan Ionut Cirstea
Jun 17, 2023, 4:38 PM
12
points
0
comments
1
min read
LW
link
Partial Simulation Extrapolation: A Proposal for Building Safer Simulators
lukemarks
Jun 17, 2023, 1:55 PM
16
points
0
comments
10
min read
LW
link
Alewife Train is Now Arriving
jefftk
Jun 17, 2023, 1:20 PM
21
points
4
comments
1
min read
LW
link
(www.jefftk.com)
[Question]
What fraction of words written/read are AI-written?
Mati_Roy
Jun 17, 2023, 1:15 PM
8
points
6
comments
1
min read
LW
link
Are Bayesian methods guaranteed to overfit?
Ege Erdil
Jun 17, 2023, 12:52 PM
52
points
5
comments
3
min read
LW
link
(www.yulingyao.com)
The AI governance gaps in developing countries
ntran
Jun 17, 2023, 2:50 AM
20
points
1
comment
14
min read
LW
link
June and Mulberries
jefftk
Jun 17, 2023, 1:30 AM
13
points
2
comments
1
min read
LW
link
(www.jefftk.com)
Updating Drexler’s CAIS model
Matthew Barnett
Jun 16, 2023, 10:53 PM
47
points
32
comments
4
min read
LW
link
Avoiding metaphysics means giving bad philosophy a free pass
Aditya
Jun 16, 2023, 8:54 PM
5
points
9
comments
4
min read
LW
link
Criticism of Eliezer’s irrational moral beliefs
Jorterder
Jun 16, 2023, 8:47 PM
−17
points
21
comments
1
min read
LW
link
Cartography, blowing one’s mind, the illusion of separation and other general musings
Neil
Jun 16, 2023, 7:19 PM
0
points
4
comments
2
min read
LW
link
[Replication] Conjecture’s Sparse Coding in Small Transformers
Hoagy
and
Logan Riggs
Jun 16, 2023, 6:02 PM
52
points
0
comments
5
min read
LW
link
Longevity: Double Human Lifespan in the Next Decade?
Jannik Schg
Jun 16, 2023, 5:51 PM
1
point
0
comments
1
min read
LW
link
LLMs Sometimes Generate Purely Negatively-Reinforced Text
Fabien Roger
Jun 16, 2023, 4:31 PM
177
points
11
comments
7
min read
LW
link
Palantir’s AI models
ChristianKl
Jun 16, 2023, 4:20 PM
26
points
16
comments
1
min read
LW
link
(www.palantir.com)
[Linkpost] Faith and Fate: Limits of Transformers on Compositionality
Joe Kwon
Jun 16, 2023, 3:04 PM
19
points
4
comments
1
min read
LW
link
(arxiv.org)
The ones who endure
Richard_Ngo
Jun 16, 2023, 2:40 PM
65
points
16
comments
5
min read
LW
link
(www.thinkingcomplete.com)
Conjecture: A standing offer for public debates on AI
Andrea_Miotti
Jun 16, 2023, 2:33 PM
29
points
1
comment
2
min read
LW
link
(www.conjecture.dev)
Explaining “Taking features out of superposition with sparse autoencoders”
Robert_AIZI
Jun 16, 2023, 1:59 PM
10
points
0
comments
8
min read
LW
link
(aizi.substack.com)
[Question]
How not to write the Cookbook of Doom?
brunoparga
Jun 16, 2023, 1:37 PM
17
points
5
comments
1
min read
LW
link
Scaffolded LLMs: Less Obvious Concerns
Stephen Fowler
Jun 16, 2023, 10:39 AM
34
points
15
comments
14
min read
LW
link
Motivation in AI
nickasaf
Jun 16, 2023, 9:50 AM
−1
points
1
comment
2
min read
LW
link
DSLT 0. Distilling Singular Learning Theory
Liam Carroll
Jun 16, 2023, 9:50 AM
80
points
7
comments
5
min read
LW
link
DSLT 1. The RLCT Measures the Effective Dimension of Neural Networks
Liam Carroll
Jun 16, 2023, 9:50 AM
54
points
10
comments
13
min read
LW
link
[Linkpost] Mapping Brains with Language Models: A Survey
Bogdan Ionut Cirstea
Jun 16, 2023, 9:49 AM
5
points
0
comments
1
min read
LW
link
Rational Animations is looking for an AI Safety scriptwriter, a lead community manager, and other roles.
Writer
Jun 16, 2023, 9:41 AM
74
points
1
comment
3
min read
LW
link
[Question]
Does anyone’s full-time job include reading and understanding all the most-promising formal AI alignment work?
Nicholas / Heather Kross
Jun 16, 2023, 2:24 AM
15
points
2
comments
1
min read
LW
link
Leveling Up Or Leveling Off? Understanding The Science Behind Skill Plateaus
lynettebye
Jun 16, 2023, 12:18 AM
45
points
9
comments
18
min read
LW
link
human intelligence may be alignment-limited
bhauth
Jun 15, 2023, 10:32 PM
16
points
3
comments
2
min read
LW
link
Developing a technology with safety in mind: Lessons from the Wright Brothers
jasoncrawford
Jun 15, 2023, 9:08 PM
30
points
4
comments
3
min read
LW
link
(rootsofprogress.org)
AXRP Episode 22 - Shard Theory with Quintin Pope
DanielFilan
Jun 15, 2023, 7:00 PM
52
points
11
comments
93
min read
LW
link
Can we accelerate human progress? Moderated Conversation in NYC
Jannik Schg
Jun 15, 2023, 5:33 PM
1
point
0
comments
1
min read
LW
link
Group Prioritarianism: Why AI Should Not Replace Humanity [draft]
fsh
Jun 15, 2023, 5:33 PM
8
points
0
comments
25
min read
LW
link
Press the happiness button!
Spiarrow
Jun 15, 2023, 5:30 PM
5
points
3
comments
2
min read
LW
link
AI #16: AI in the UK
Zvi
Jun 15, 2023, 1:20 PM
46
points
20
comments
54
min read
LW
link
(thezvi.wordpress.com)
I still think it’s very unlikely we’re observing alien aircraft
dynomight
Jun 15, 2023, 1:01 PM
180
points
70
comments
5
min read
LW
link
(dynomight.net)
Aligned Objectives Prize Competition
Prometheus
15 Jun 2023 12:42 UTC
8
points
0
comments
2
min read
LW
link
(app.impactmarkets.io)
A more effective Elevator Pitch for AI risk
Iknownothing
15 Jun 2023 12:39 UTC
2
points
0
comments
1
min read
LW
link
Why “AI alignment” would better be renamed into “Artificial Intention research”
chaosmage
15 Jun 2023 10:32 UTC
29
points
12
comments
2
min read
LW
link
Matt Taibbi’s COVID reporting
ChristianKl
15 Jun 2023 9:49 UTC
21
points
34
comments
1
min read
LW
link
(www.racket.news)
Looking Back On Ads
jefftk
15 Jun 2023 2:10 UTC
30
points
11
comments
3
min read
LW
link
(www.jefftk.com)
Why libertarians are advocating for regulation on AI
RobertM
14 Jun 2023 20:59 UTC
36
points
13
comments
4
min read
LW
link
Instrumental Convergence? [Draft]
J. Dmitri Gallow
14 Jun 2023 20:21 UTC
48
points
20
comments
33
min read
LW
link
On the Apple Vision Pro
Zvi
14 Jun 2023 17:50 UTC
44
points
17
comments
11
min read
LW
link
(thezvi.wordpress.com)
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel