Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
Concrete Steps to Get Started in Transformer Mechanistic Interpretability
Neel Nanda
Dec 25, 2022, 10:21 PM
57
points
7
comments
12
min read
LW
link
(www.neelnanda.io)
It’s time to worry about online privacy again
Malmesbury
Dec 25, 2022, 9:05 PM
69
points
23
comments
6
min read
LW
link
[Hebbian Natural Abstractions] Mathematical Foundations
Samuel Nellessen
and
Jan
Dec 25, 2022, 8:58 PM
15
points
2
comments
6
min read
LW
link
(www.snellessen.com)
[Question]
Oracle AGI—How can it escape, other than security issues? (Steganography?)
RationalSieve
Dec 25, 2022, 8:14 PM
3
points
6
comments
1
min read
LW
link
YCombinator fraud rates
Xodarap
Dec 25, 2022, 7:21 PM
56
points
3
comments
LW
link
How evolutionary lineages of LLMs can plan their own future and act on these plans
Roman Leventov
Dec 25, 2022, 6:11 PM
39
points
16
comments
8
min read
LW
link
Accurate Models of AI Risk Are Hyperexistential Exfohazards
Thane Ruthenis
Dec 25, 2022, 4:50 PM
33
points
38
comments
9
min read
LW
link
ChatGPT is our Wright Brothers moment
Ron J
Dec 25, 2022, 4:26 PM
10
points
9
comments
1
min read
LW
link
The Meditation on Winter
Raemon
Dec 25, 2022, 4:12 PM
59
points
3
comments
3
min read
LW
link
I’ve updated towards AI boxing being surprisingly easy
Noosphere89
Dec 25, 2022, 3:40 PM
8
points
20
comments
2
min read
LW
link
Take 14: Corrigibility isn’t that great.
Charlie Steiner
Dec 25, 2022, 1:04 PM
15
points
3
comments
3
min read
LW
link
Simplified Level Up
jefftk
Dec 25, 2022, 1:00 PM
12
points
16
comments
2
min read
LW
link
(www.jefftk.com)
Hyperfinite graphs ~ manifolds
Alok Singh
Dec 25, 2022, 12:24 PM
11
points
5
comments
2
min read
LW
link
Inconsistent math is great
Alok Singh
Dec 25, 2022, 3:20 AM
1
point
2
comments
1
min read
LW
link
A hundredth of a bit of extra entropy
Adam Scherlis
Dec 24, 2022, 9:12 PM
83
points
4
comments
3
min read
LW
link
Shared reality: a key driver of human behavior
kdbscott
Dec 24, 2022, 7:35 PM
126
points
25
comments
4
min read
LW
link
Contra Steiner on Too Many Natural Abstractions
DragonGod
Dec 24, 2022, 5:42 PM
10
points
6
comments
1
min read
LW
link
Three reasons to cooperate
paulfchristiano
Dec 24, 2022, 5:40 PM
86
points
14
comments
10
min read
LW
link
(sideways-view.com)
Practical AI risk I: Watching large compute
Gustavo Ramires
Dec 24, 2022, 1:25 PM
3
points
0
comments
1
min read
LW
link
Non-Elevated Air Purifiers
jefftk
Dec 24, 2022, 12:40 PM
10
points
2
comments
1
min read
LW
link
(www.jefftk.com)
The Case for Chip-Backed Dollars
AnthonyRepetto
Dec 24, 2022, 10:28 AM
0
points
1
comment
4
min read
LW
link
List #3: Why not to assume on prior that AGI-alignment workarounds are available
Remmelt
Dec 24, 2022, 9:54 AM
4
points
1
comment
3
min read
LW
link
List #2: Why coordinating to align as humans to not develop AGI is a lot easier than, well… coordinating as humans with AGI coordinating to be aligned with humans
Remmelt
Dec 24, 2022, 9:53 AM
1
point
0
comments
3
min read
LW
link
List #1: Why stopping the development of AGI is hard but doable
Remmelt
Dec 24, 2022, 9:52 AM
6
points
11
comments
5
min read
LW
link
The case against AI alignment
andrew sauer
Dec 24, 2022, 6:57 AM
128
points
110
comments
5
min read
LW
link
Content and Takeaways from SERI MATS Training Program with John Wentworth
RohanS
Dec 24, 2022, 4:17 AM
28
points
3
comments
12
min read
LW
link
Löb’s Lemma: an easier approach to Löb’s Theorem
Andrew_Critch
Dec 24, 2022, 2:02 AM
30
points
16
comments
3
min read
LW
link
Durkon, an open-source tool for Inherently Interpretable Modelling
abstractapplic
Dec 24, 2022, 1:49 AM
37
points
0
comments
4
min read
LW
link
Issues with uneven AI resource distribution
User_Luke
Dec 24, 2022, 1:18 AM
3
points
9
comments
5
min read
LW
link
(temporal.substack.com)
Loose Threads on Intelligence
Shoshannah Tekofsky
Dec 24, 2022, 12:38 AM
11
points
3
comments
8
min read
LW
link
[Question]
If you factor out next token prediction, what are the remaining salient features of human cognition?
Shmi
Dec 24, 2022, 12:38 AM
9
points
7
comments
1
min read
LW
link
[Question]
Why is “Argument Mapping” Not More Common in EA/Rationality (And What Objections Should I Address in a Post on the Topic?)
HarrisonDurland
Dec 23, 2022, 9:58 PM
10
points
5
comments
1
min read
LW
link
The Fear [Fiction]
Yitz
Dec 23, 2022, 9:21 PM
7
points
0
comments
1
min read
LW
link
To err is neural: select logs with ChatGPT
VipulNaik
Dec 23, 2022, 8:26 PM
22
points
2
comments
38
min read
LW
link
AISER—AIS Europe Retreat
Carolin
Dec 23, 2022, 7:03 PM
5
points
0
comments
1
min read
LW
link
Two Truths and a Prediction Market
Screwtape
Dec 23, 2022, 6:52 PM
22
points
2
comments
6
min read
LW
link
ChatGPT understands, but largely does not generate Spanglish (and other code-mixed) text
Milan W
Dec 23, 2022, 5:40 PM
15
points
5
comments
4
min read
LW
link
On sincerity
Joe Carlsmith
Dec 23, 2022, 5:13 PM
76
points
6
comments
42
min read
LW
link
Epigenetics of the mammalian germline
Metacelsus
Dec 23, 2022, 3:21 PM
37
points
0
comments
7
min read
LW
link
(denovo.substack.com)
Boston Solstice Songs
jefftk
Dec 23, 2022, 1:00 PM
9
points
0
comments
1
min read
LW
link
(www.jefftk.com)
Are there any reliable CAPTCHAs? Competition for CAPTCHA ideas that AIs can’t solve.
MrThink
Dec 23, 2022, 12:52 PM
7
points
37
comments
1
min read
LW
link
“Search” is dead. What is the new paradigm?
Shmi
Dec 23, 2022, 10:33 AM
15
points
9
comments
1
min read
LW
link
Article Review: Discovering Latent Knowledge (Burns, Ye, et al)
Robert_AIZI
Dec 22, 2022, 6:16 PM
13
points
4
comments
6
min read
LW
link
(aizi.substack.com)
Let’s think about slowing down AI
KatjaGrace
Dec 22, 2022, 5:40 PM
551
points
182
comments
38
min read
LW
link
3
reviews
(aiimpacts.org)
Some Notes on the mathematics of Toy Autoencoding Problems
carboniferous_umbraculum
22 Dec 2022 17:21 UTC
18
points
1
comment
12
min read
LW
link
December 2022 updates and fundraising
AI Impacts
22 Dec 2022 17:20 UTC
39
points
1
comment
3
min read
LW
link
(aiimpacts.org)
Covid 12/22/22: Reevaluating Past Options
Zvi
22 Dec 2022 16:50 UTC
30
points
2
comments
9
min read
LW
link
(thezvi.wordpress.com)
China Covid #4
Zvi
22 Dec 2022 16:30 UTC
50
points
2
comments
11
min read
LW
link
(thezvi.wordpress.com)
Racing through a minefield: the AI deployment problem
HoldenKarnofsky
22 Dec 2022 16:10 UTC
38
points
2
comments
13
min read
LW
link
(www.cold-takes.com)
Lead in Chocolate?
jefftk
22 Dec 2022 16:10 UTC
41
points
6
comments
2
min read
LW
link
(www.jefftk.com)
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel