Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
1
How to (hopefully ethically) make money off of AGI
habryka
,
Zvi
,
Cosmos
and
NoahK
Nov 6, 2023, 11:35 PM
171
points
95
comments
32
min read
LW
link
1
review
cost estimation for 2 grid energy storage systems
bhauth
Nov 6, 2023, 11:32 PM
16
points
12
comments
7
min read
LW
link
(www.bhauth.com)
A bet on critical periods in neural networks
kave
and
Garrett Baker
Nov 6, 2023, 11:21 PM
24
points
1
comment
6
min read
LW
link
Job listing: Communications Generalist / Project Manager
Gretta Duleba
Nov 6, 2023, 8:21 PM
49
points
7
comments
1
min read
LW
link
Askesis: a model of the cerebellum
MadHatter
Nov 6, 2023, 8:19 PM
7
points
2
comments
1
min read
LW
link
(github.com)
LQPR: An Algorithm for Reinforcement Learning with Provable Safety Guarantees
MadHatter
Nov 6, 2023, 8:17 PM
6
points
0
comments
1
min read
LW
link
(github.com)
ACX Meetup Leipzig
Roman Leipe
Nov 6, 2023, 6:33 PM
1
point
0
comments
1
min read
LW
link
[Question]
Does bulemia work?
lc
Nov 6, 2023, 5:58 PM
5
points
18
comments
1
min read
LW
link
Why building ventures in AI Safety is particularly challenging
Heramb
Nov 6, 2023, 4:27 PM
1
point
0
comments
1
min read
LW
link
(forum.effectivealtruism.org)
What is true is already so. Owning up to it doesn’t make it worse.
RamblinDash
Nov 6, 2023, 3:49 PM
20
points
2
comments
1
min read
LW
link
An illustrative model of backfire risks from pausing AI research
Maxime Riché
Nov 6, 2023, 2:30 PM
33
points
3
comments
11
min read
LW
link
Proposal for improving state of alignment research
Iknownothing
Nov 6, 2023, 1:55 PM
2
points
0
comments
1
min read
LW
link
Are language models good at making predictions?
dynomight
Nov 6, 2023, 1:10 PM
76
points
14
comments
4
min read
LW
link
(dynomight.net)
Tips, tricks, lessons and thoughts on hosting hackathons
gergogaspar
Nov 6, 2023, 11:03 AM
3
points
0
comments
11
min read
LW
link
Announcing TAIS 2024
Blaine
Nov 6, 2023, 8:38 AM
23
points
0
comments
1
min read
LW
link
(tais2024.cc)
Taboo Wall
Screwtape
Nov 6, 2023, 3:51 AM
19
points
0
comments
3
min read
LW
link
When and why should you use the Kelly criterion?
Garrett Baker
,
philh
and
River
Nov 5, 2023, 11:26 PM
27
points
25
comments
16
min read
LW
link
On Overhangs and Technological Change
Roko
Nov 5, 2023, 10:58 PM
50
points
19
comments
2
min read
LW
link
xAI announces Grok, beats GPT-3.5
Nikola Jurkovic
Nov 5, 2023, 10:11 PM
10
points
6
comments
1
min read
LW
link
(x.ai)
Disentangling four motivations for acting in accordance with UDT
Julian Stastny
Nov 5, 2023, 9:26 PM
35
points
3
comments
7
min read
LW
link
AI as Super-Demagogue
RationalDino
Nov 5, 2023, 9:21 PM
11
points
12
comments
9
min read
LW
link
EA orgs’ legal structure inhibits risk taking and information sharing on the margin
Elizabeth
Nov 5, 2023, 7:13 PM
136
points
17
comments
4
min read
LW
link
Eric Schmidt on recursive self-improvement
Nikola Jurkovic
Nov 5, 2023, 7:05 PM
24
points
3
comments
1
min read
LW
link
(www.youtube.com)
Pivotal Acts might Not be what You Think they are
Johannes C. Mayer
Nov 5, 2023, 5:23 PM
41
points
13
comments
3
min read
LW
link
The Assumed Intent Bias
silentbob
Nov 5, 2023, 4:28 PM
51
points
13
comments
6
min read
LW
link
Go flash blinking lights at printed text right now
lemonhope
Nov 5, 2023, 7:29 AM
15
points
9
comments
1
min read
LW
link
Life of GPT
Odd anon
Nov 5, 2023, 4:55 AM
6
points
2
comments
5
min read
LW
link
Lightning Talks
Screwtape
Nov 5, 2023, 3:27 AM
6
points
3
comments
4
min read
LW
link
Utility is not the selection target
tailcalled
Nov 4, 2023, 10:48 PM
24
points
1
comment
1
min read
LW
link
Stuxnet, not Skynet: Humanity’s disempowerment by AI
Roko
Nov 4, 2023, 10:23 PM
107
points
24
comments
6
min read
LW
link
The 6D effect: When companies take risks, one email can be very powerful.
scasper
Nov 4, 2023, 8:08 PM
279
points
42
comments
3
min read
LW
link
Genetic fitness is a measure of selection strength, not the selection target
Kaj_Sotala
Nov 4, 2023, 7:02 PM
58
points
44
comments
18
min read
LW
link
The Soul Key
Richard_Ngo
Nov 4, 2023, 5:51 PM
112
points
10
comments
8
min read
LW
link
1
review
(www.narrativeark.xyz)
[Linkpost] Concept Alignment as a Prerequisite for Value Alignment
Bogdan Ionut Cirstea
Nov 4, 2023, 5:34 PM
27
points
0
comments
1
min read
LW
link
(arxiv.org)
We are already in a persuasion-transformed world and must take precautions
trevor
Nov 4, 2023, 3:53 PM
37
points
14
comments
6
min read
LW
link
Being good at the basics
dominicq
Nov 4, 2023, 2:18 PM
33
points
1
comment
3
min read
LW
link
If a little is good, is more better?
DanielFilan
Nov 4, 2023, 7:10 AM
25
points
16
comments
2
min read
LW
link
(danielfilan.com)
Untrusted smart models and trusted dumb models
Buck
Nov 4, 2023, 3:06 AM
87
points
17
comments
6
min read
LW
link
1
review
As Many Ideas
Screwtape
Nov 3, 2023, 10:47 PM
11
points
0
comments
4
min read
LW
link
Paul Christiano on Dwarkesh Podcast
ESRogs
Nov 3, 2023, 10:13 PM
19
points
0
comments
1
min read
LW
link
(www.dwarkeshpatel.com)
Deception Chess: Game #1
Zane
,
aphyer
,
Alex A
and
AdamYedidia
Nov 3, 2023, 9:13 PM
111
points
22
comments
8
min read
LW
link
1
review
8 examples informing my pessimism on uploading without reverse engineering
Steven Byrnes
Nov 3, 2023, 8:03 PM
118
points
12
comments
12
min read
LW
link
Integrity in AI Governance and Advocacy
habryka
and
OliviaJ
Nov 3, 2023, 7:52 PM
134
points
57
comments
23
min read
LW
link
Averaging samples from a population with log-normal distribution
CrimsonChin
Nov 3, 2023, 7:42 PM
8
points
2
comments
1
min read
LW
link
Securing Civilization Against Catastrophic Pandemics
jefftk
Nov 3, 2023, 7:33 PM
13
points
0
comments
1
min read
LW
link
(dam.gcsp.ch)
The Unavoidable Experience of Free Will in a Deterministic World
gmax
Nov 3, 2023, 5:55 PM
−12
points
0
comments
3
min read
LW
link
Thoughts on open source AI
Sam Marks
Nov 3, 2023, 3:35 PM
62
points
17
comments
10
min read
LW
link
Knowledge Base 6: Consensus theory of truth
iwis
Nov 3, 2023, 1:56 PM
−8
points
0
comments
1
min read
LW
link
[Question]
Shouldn’t we ‘Just’ Superimitate Low-Res Uploads?
lukemarks
Nov 3, 2023, 7:42 AM
15
points
2
comments
2
min read
LW
link
The other side of the tidal wave
KatjaGrace
Nov 3, 2023, 5:40 AM
189
points
86
comments
1
min read
LW
link
(worldspiritsockpuppet.com)
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel