Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
GÖDEL GOING DOWN
Jimdrix_Hendri
Mar 6, 2023, 11:06 PM
−9
points
3
comments
1
min read
LW
link
Against ubiquitous alignment taxes
beren
Mar 6, 2023, 7:50 PM
57
points
10
comments
2
min read
LW
link
Addendum: basic facts about language models during training
beren
Mar 6, 2023, 7:24 PM
22
points
2
comments
5
min read
LW
link
Understanding The Roots Of Mathematics Before Finding The Roots Of A Function.
LiesLaris
Mar 6, 2023, 6:47 PM
2
points
0
comments
1
min read
LW
link
Discussion: LLaMA Leak & Whistleblowing in pre-AGI era
jirahim
Mar 6, 2023, 6:47 PM
1
point
4
comments
1
min read
LW
link
[Question]
Are we too confident about unaligned AGI killing off humanity?
RomanS
Mar 6, 2023, 4:19 PM
21
points
63
comments
1
min read
LW
link
Introducing Leap Labs, an AI interpretability startup
Jessica Rumbelow
Mar 6, 2023, 4:16 PM
103
points
12
comments
1
min read
LW
link
Monthly Roundup #4: March 2023
Zvi
Mar 6, 2023, 2:10 PM
31
points
0
comments
24
min read
LW
link
(thezvi.wordpress.com)
Fundamental Uncertainty: Chapter 6 - How can we be certain about the truth?
Gordon Seidoh Worley
Mar 6, 2023, 1:52 PM
11
points
19
comments
16
min read
LW
link
The idea
JNS
Mar 6, 2023, 1:42 PM
3
points
0
comments
9
min read
LW
link
Honesty, Openness, Trustworthiness, and Secrets
NormanPerlmutter
Mar 6, 2023, 9:03 AM
13
points
0
comments
9
min read
LW
link
EA & LW Forum Weekly Summary (27th Feb − 5th Mar 2023)
Zoe Williams
Mar 6, 2023, 3:18 AM
12
points
0
comments
LW
link
The Type II Inner-Compass Theorem
Tristan Miano
Mar 6, 2023, 2:35 AM
−16
points
0
comments
22
min read
LW
link
AGI’s Impact on Employment
TheUnkown
Mar 6, 2023, 1:56 AM
1
point
1
comment
1
min read
LW
link
(www.apricitas.io)
Why did you trash the old HPMOR.com?
AnnoyedReader
Mar 6, 2023, 1:55 AM
54
points
68
comments
2
min read
LW
link
Cap Model Size for AI Safety
research_prime_space
Mar 6, 2023, 1:11 AM
0
points
4
comments
1
min read
LW
link
What should we do about network-effect monopolies?
benkuhn
Mar 6, 2023, 12:50 AM
31
points
7
comments
1
min read
LW
link
(www.benkuhn.net)
Who Aligns the Alignment Researchers?
Ben Smith
Mar 5, 2023, 11:22 PM
48
points
0
comments
11
min read
LW
link
Startups are like firewood
Adam Zerner
Mar 5, 2023, 11:09 PM
26
points
2
comments
3
min read
LW
link
A concerning observation from media coverage of AI industry dynamics
Justin Olive
Mar 5, 2023, 9:38 PM
8
points
3
comments
3
min read
LW
link
Steven Pinker on ChatGPT and AGI (Feb 2023)
Evan R. Murphy
Mar 5, 2023, 9:34 PM
11
points
8
comments
1
min read
LW
link
(news.harvard.edu)
Is it time to talk about AI doomsday prepping yet?
bokov
Mar 5, 2023, 9:17 PM
0
points
8
comments
1
min read
LW
link
Coordination explosion before intelligence explosion...?
tailcalled
Mar 5, 2023, 8:48 PM
47
points
9
comments
2
min read
LW
link
The Ogdoad
Tristan Miano
Mar 5, 2023, 8:01 PM
−15
points
1
comment
37
min read
LW
link
[Question]
What are some good ways to heighten my emotions?
oh54321
Mar 5, 2023, 6:06 PM
5
points
5
comments
1
min read
LW
link
Research proposal: Leveraging Jungian archetypes to create values-based models
MiguelDev
Mar 5, 2023, 5:39 PM
5
points
2
comments
2
min read
LW
link
Abusing Snap Circuits IC
jefftk
Mar 5, 2023, 5:00 PM
19
points
3
comments
3
min read
LW
link
(www.jefftk.com)
Do humans derive values from fictitious imputed coherence?
TsviBT
Mar 5, 2023, 3:23 PM
45
points
8
comments
14
min read
LW
link
The Inner-Compass Theorem
Tristan Miano
Mar 5, 2023, 3:21 PM
−18
points
12
comments
16
min read
LW
link
Halifax Monthly Meetup: AI Safety Discussion
Ideopunk
Mar 5, 2023, 12:42 PM
10
points
0
comments
1
min read
LW
link
Why kill everyone?
arisAlexis
Mar 5, 2023, 11:53 AM
7
points
5
comments
2
min read
LW
link
Selective, Corrective, Structural: Three Ways of Making Social Systems Work
Said Achmiz
Mar 5, 2023, 8:45 AM
100
points
13
comments
2
min read
LW
link
Substitute goods for leisure are abundant
Adam Zerner
Mar 5, 2023, 3:45 AM
20
points
7
comments
5
min read
LW
link
[Question]
Does polyamory at a workplace turn nepotism up to eleven?
Viliam
Mar 5, 2023, 12:57 AM
45
points
11
comments
2
min read
LW
link
Why We MUST Build an (aligned) Artificial Superintelligence That Takes Over Human Society—A Thought Experiment
twkaiser
Mar 5, 2023, 12:47 AM
−13
points
12
comments
2
min read
LW
link
Forecasts on Moore v Harper from Samotsvety
gregjustice
Mar 5, 2023, 12:47 AM
7
points
0
comments
1
min read
LW
link
(samotsvety.org)
Why Not Just… Build Weak AI Tools For AI Alignment Research?
johnswentworth
Mar 5, 2023, 12:12 AM
184
points
18
comments
6
min read
LW
link
Consciousness is irrelevant—instead solve alignment by asking this question
Oliver Siegel
Mar 4, 2023, 10:06 PM
−10
points
6
comments
1
min read
LW
link
More money with less risk: sell services instead of model access
lemonhope
Mar 4, 2023, 8:51 PM
9
points
3
comments
1
min read
LW
link
Contra “Strong Coherence”
DragonGod
Mar 4, 2023, 8:05 PM
39
points
24
comments
1
min read
LW
link
The Practitioner’s Path 2.0: A new framework for structured self-improvement
Evenflair
Mar 4, 2023, 7:19 PM
32
points
2
comments
11
min read
LW
link
(guildoftherose.org)
The Benefits of Distillation in Research
Jonas Hallgren
Mar 4, 2023, 5:45 PM
15
points
2
comments
5
min read
LW
link
Optimal Music Choice
mbazzani
Mar 4, 2023, 5:26 PM
5
points
0
comments
1
min read
LW
link
Why don’t more people talk about ecological psychology?
Ppau
Mar 4, 2023, 5:03 PM
21
points
10
comments
7
min read
LW
link
Switching to Electric Mandolin
jefftk
Mar 4, 2023, 3:40 PM
16
points
1
comment
1
min read
LW
link
(www.jefftk.com)
Predictive Performance on Metaculus vs. Manifold Markets
nikos
Mar 4, 2023, 8:10 AM
18
points
0
comments
5
min read
LW
link
Contra Hanson on AI Risk
Liron
Mar 4, 2023, 8:02 AM
36
points
23
comments
8
min read
LW
link
Bite Sized Tasks
Johannes C. Mayer
Mar 4, 2023, 3:31 AM
18
points
2
comments
2
min read
LW
link
How popular is ChatGPT? Part 2: slower growth than Pokémon GO
Richard Korzekwa
Mar 3, 2023, 11:40 PM
42
points
4
comments
6
min read
LW
link
(aiimpacts.org)
Acausal normalcy
Andrew_Critch
Mar 3, 2023, 11:34 PM
195
points
36
comments
8
min read
LW
link
1
review
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel