Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
1
Scanning your Brain with 100,000,000,000 wires?
Johannes C. Mayer
Jun 1, 2024, 6:37 PM
6
points
6
comments
2
min read
LW
link
[Question]
Turning latexed notes into blog posts
Terence Coelho
Jun 1, 2024, 6:03 PM
5
points
2
comments
1
min read
LW
link
How do you know you are right when debating? Calculate your AmIRight score.
MrThink
Jun 1, 2024, 3:55 PM
2
points
5
comments
2
min read
LW
link
Links for May
Kaj_Sotala
Jun 1, 2024, 10:20 AM
20
points
16
comments
18
min read
LW
link
(kajsotala.fi)
[Question]
What do coherence arguments actually prove about agentic behavior?
sunwillrise
Jun 1, 2024, 9:37 AM
123
points
39
comments
6
min read
LW
link
AI Safety: A Climb To Armageddon?
kmenou
Jun 1, 2024, 6:02 AM
8
points
3
comments
1
min read
LW
link
(arxiv.org)
When does external behaviour imply interal structure?
Tyler Tracy
May 31, 2024, 4:41 PM
6
points
5
comments
7
min read
LW
link
[Question]
We might be dropping the ball on Autonomous Replication and Adaptation.
Charbel-Raphaël
and
Épiphanie Gédéon
May 31, 2024, 1:49 PM
63
points
30
comments
4
min read
LW
link
Tax Cuts and Innovation
Maxwell Tabarrok
May 31, 2024, 12:58 PM
3
points
0
comments
6
min read
LW
link
(www.maximum-progress.com)
The Gemini 1.5 Report
Zvi
May 31, 2024, 12:20 PM
18
points
0
comments
17
min read
LW
link
(thezvi.wordpress.com)
Less Anti-Dakka
Mateusz Bagiński
May 31, 2024, 9:07 AM
24
points
5
comments
3
min read
LW
link
Web-surfing tips for strange times
eukaryote
May 31, 2024, 7:10 AM
48
points
19
comments
9
min read
LW
link
(eukaryotewritesblog.substack.com)
There Should Be More Alignment-Driven Startups
Vaniver
,
Judd Rosenblatt
,
Cameron Berg
and
phgubbins
May 31, 2024, 2:05 AM
62
points
14
comments
11
min read
LW
link
[Question]
How likely is it that AI will torture us until the end of time?
Damilo
May 31, 2024, 1:26 AM
4
points
24
comments
2
min read
LW
link
Twin Peaks: under the air
KatjaGrace
May 31, 2024, 1:20 AM
25
points
2
comments
2
min read
LW
link
(worldspiritsockpuppet.com)
Is suffering like shit?
KatjaGrace
May 31, 2024, 1:20 AM
32
points
5
comments
1
min read
LW
link
(worldspiritsockpuppet.com)
Foresight Vision Weekend Europe 2024
Allison Duettmann
May 31, 2024, 12:07 AM
3
points
0
comments
1
min read
LW
link
[Question]
How have analogous Industries solved Interested > Trained > Employed bottlenecks?
yanni kyriacos
May 30, 2024, 11:59 PM
4
points
1
comment
1
min read
LW
link
Duckbill Masks Better?
jefftk
May 30, 2024, 11:40 PM
20
points
3
comments
1
min read
LW
link
(www.jefftk.com)
OpenAI: Helen Toner Speaks
Zvi
May 30, 2024, 9:10 PM
86
points
8
comments
13
min read
LW
link
(thezvi.wordpress.com)
Non-Disparagement Canaries for OpenAI
aysja
and
Adam Scholl
May 30, 2024, 7:20 PM
288
points
51
comments
2
min read
LW
link
Clarifying METR’s Auditing Role
Beth Barnes
May 30, 2024, 6:41 PM
108
points
1
comment
2
min read
LW
link
A civilization ran by amateurs
Olli Järviniemi
May 30, 2024, 5:57 PM
61
points
8
comments
6
min read
LW
link
One week left to apply for the Roots of Progress Blog-Building Intensive
jasoncrawford
May 30, 2024, 4:55 PM
8
points
0
comments
3
min read
LW
link
(rootsofprogress.org)
Getting started with AI Alignment research: how to reproduce an experiment from research paper
Alexander230
May 30, 2024, 2:51 PM
3
points
0
comments
3
min read
LW
link
AI #66: Oh to Be Less Online
Zvi
May 30, 2024, 2:20 PM
37
points
6
comments
56
min read
LW
link
(thezvi.wordpress.com)
The 27 papers
WitheringWeights
May 30, 2024, 8:46 AM
18
points
2
comments
1
min read
LW
link
The Market Singularity: A New Perspective
azsantosk
May 30, 2024, 7:05 AM
1
point
0
comments
15
min read
LW
link
Awakening
lsusr
May 30, 2024, 7:03 AM
124
points
79
comments
9
min read
LW
link
Value Claims (In Particular) Are Usually Bullshit
johnswentworth
May 30, 2024, 6:26 AM
144
points
18
comments
2
min read
LW
link
The Pearly Gates
lsusr
May 30, 2024, 4:01 AM
127
points
6
comments
3
min read
LW
link
AXRP Episode 32 - Understanding Agency with Jan Kulveit
DanielFilan
May 30, 2024, 3:50 AM
20
points
0
comments
53
min read
LW
link
US Presidential Election: Tractability, Importance, and Urgency
kuhanj
May 29, 2024, 11:52 PM
42
points
2
comments
3
min read
LW
link
Thoughts on SB-1047
ryan_greenblatt
May 29, 2024, 11:26 PM
60
points
1
comment
11
min read
LW
link
How I designed my own writing system, VJScript
vkethana
May 29, 2024, 11:18 PM
2
points
1
comment
1
min read
LW
link
(www.vkethana.com)
AI and integrity
Nathan Young
May 29, 2024, 8:45 PM
10
points
0
comments
2
min read
LW
link
(nathanpmyoung.substack.com)
MIRI 2024 Communications Strategy
Gretta Duleba
May 29, 2024, 7:33 PM
325
points
216
comments
7
min read
LW
link
2024 Summer AI Safety Intro Fellowship and Socials in Boston
KevinWei
May 29, 2024, 6:27 PM
8
points
0
comments
1
min read
LW
link
Apollo Research 1-year update
Marius Hobbhahn
,
Lee Sharkey
,
Lucius Bushnaq
,
Dan Braun
,
Mikita Balesni
,
Jérémy Scheurer
,
Nicholas Goldowsky-Dill
,
StefanHex
,
jake_mendel
,
AlexMeinke
and
rusheb
May 29, 2024, 5:44 PM
93
points
0
comments
7
min read
LW
link
Response to nostalgebraist: proudly waving my moral-antirealist battle flag
Steven Byrnes
May 29, 2024, 4:48 PM
103
points
29
comments
11
min read
LW
link
Looking beyond Everett in multiversal views of LLMs
kromem
May 29, 2024, 12:35 PM
10
points
0
comments
8
min read
LW
link
[Question]
Inviting discussion of “Beat AI: A contest using philosophical concepts”
David James
May 29, 2024, 11:55 AM
2
points
1
comment
1
min read
LW
link
AI companies’ commitments
Zach Stein-Perlman
May 29, 2024, 11:00 AM
36
points
0
comments
1
min read
LW
link
One way violinists fail
Solenoid_Entity
May 29, 2024, 4:08 AM
33
points
5
comments
3
min read
LW
link
Hardshipification
Jonathan Moregård
May 28, 2024, 8:02 PM
88
points
17
comments
2
min read
LW
link
(honestliving.substack.com)
When Are Circular Definitions A Problem?
johnswentworth
May 28, 2024, 8:00 PM
68
points
15
comments
3
min read
LW
link
Notes on Gracefulness
David Gross
May 28, 2024, 6:40 PM
20
points
2
comments
25
min read
LW
link
[Question]
What’s a better term now that “AGI” is too vague?
Seth Herd
May 28, 2024, 6:02 PM
15
points
9
comments
2
min read
LW
link
Reward hacking behavior can generalize across tasks
Kei
,
Isaac Dunn
,
Henry Sleight
,
Miles Turpin
,
evhub
,
Carson Denison
and
Ethan Perez
May 28, 2024, 4:33 PM
79
points
5
comments
21
min read
LW
link
Quick Advice on Writing Essays
Niko_McCarty
28 May 2024 15:02 UTC
11
points
0
comments
3
min read
LW
link
(www.nikomccarty.com)
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel