Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
Emergence and Amplification of Survival
jgraves01
Dec 28, 2024, 11:52 PM
−1
points
0
comments
3
min read
LW
link
[Question]
Has Someone Checked The Cold-Water-In-Left-Ear Thing?
Maloew
Dec 28, 2024, 8:15 PM
11
points
0
comments
1
min read
LW
link
By default, capital will matter more than ever after AGI
L Rudolf L
Dec 28, 2024, 5:52 PM
289
points
100
comments
16
min read
LW
link
(nosetgauge.substack.com)
AI Assistants Should Have a Direct Line to Their Developers
Jan_Kulveit
Dec 28, 2024, 5:01 PM
57
points
6
comments
2
min read
LW
link
No, the Polymarket price does not mean we can immediately conclude what the probability of a bird flu pandemic is. We also need to know the interest rate!
Christopher King
Dec 28, 2024, 4:05 PM
7
points
11
comments
1
min read
LW
link
The average rationalist IQ is about 122
Rockenots
Dec 28, 2024, 3:42 PM
20
points
23
comments
1
min read
LW
link
Why OpenAI’s Structure Must Evolve To Advance Our Mission
stuhlmueller
Dec 28, 2024, 4:24 AM
19
points
1
comment
1
min read
LW
link
(openai.com)
The Engineering Argument Fallacy: Why Technological Success Doesn’t Validate Physics
Wenitte Apiou
Dec 28, 2024, 12:49 AM
−16
points
5
comments
2
min read
LW
link
The Robot, the Puppet-master, and the Psychohistorian
WillPetillo
Dec 28, 2024, 12:12 AM
8
points
2
comments
3
min read
LW
link
Progress links and short notes, 2024-12-27: Clinical trial abundance, grid-scale fusion, permitting vs. compliance, crossword mania, and more
jasoncrawford
Dec 27, 2024, 11:34 PM
11
points
0
comments
2
min read
LW
link
(newsletter.rootsofprogress.org)
Greedy-Advantage-Aware RLHF
sej2020
Dec 27, 2024, 7:47 PM
48
points
15
comments
13
min read
LW
link
Deconstructing arguments against AI art
DMMF
Dec 27, 2024, 7:40 PM
7
points
5
comments
5
min read
LW
link
(danfrank.ca)
From the Archives: a story
Richard_Ngo
Dec 27, 2024, 4:36 PM
20
points
1
comment
16
min read
LW
link
(www.narrativeark.xyz)
[Question]
What’s the best metric for measuring quality of life?
ChristianKl
Dec 27, 2024, 2:29 PM
10
points
5
comments
1
min read
LW
link
Review: Planecrash
L Rudolf L
Dec 27, 2024, 2:18 PM
360
points
45
comments
22
min read
LW
link
(nosetgauge.substack.com)
Good Fortune and Many Worlds
Jonah Wilberg
Dec 27, 2024, 1:21 PM
4
points
0
comments
5
min read
LW
link
Letter from an Alien Mind
Shoshannah Tekofsky
Dec 27, 2024, 1:20 PM
23
points
7
comments
3
min read
LW
link
(open.substack.com)
Coin Flip
XelaP
Dec 27, 2024, 11:53 AM
17
points
0
comments
1
min read
LW
link
If all trade is voluntary, then what is “exploitation?”
Darmani
Dec 27, 2024, 11:21 AM
34
points
61
comments
6
min read
LW
link
Duplicate token neurons in the first layer of GPT-2
Alex Gibson
Dec 27, 2024, 4:21 AM
4
points
0
comments
5
min read
LW
link
[Question]
What are the most interesting / challenging evals (for humans) available?
Raemon
Dec 27, 2024, 3:05 AM
40
points
13
comments
2
min read
LW
link
Algorithmic Asubjective Anthropics, Cartesian Subjective Anthropics
Lorec
Dec 27, 2024, 1:58 AM
2
points
0
comments
4
min read
LW
link
Corrigibility’s Desirability is Timing-Sensitive
RobertM
Dec 26, 2024, 10:24 PM
29
points
4
comments
3
min read
LW
link
PCR retrospective
bhauth
Dec 26, 2024, 9:20 PM
24
points
0
comments
8
min read
LW
link
(bhauth.com)
AI #96: o3 But Not Yet For Thee
Zvi
Dec 26, 2024, 8:30 PM
58
points
8
comments
36
min read
LW
link
(thezvi.wordpress.com)
Super human AI is a very low hanging fruit!
Hzn
Dec 26, 2024, 7:00 PM
−4
points
0
comments
7
min read
LW
link
The Field of AI Alignment: A Postmortem, and What To Do About It
johnswentworth
Dec 26, 2024, 6:48 PM
302
points
160
comments
8
min read
LW
link
ReSolsticed vol I: “We’re Not Going Quietly”
Raemon
Dec 26, 2024, 5:52 PM
61
points
4
comments
19
min read
LW
link
[Question]
Are Sparse Autoencoders a good idea for AI control?
Gerard Boxo
Dec 26, 2024, 5:34 PM
3
points
4
comments
1
min read
LW
link
A Three-Layer Model of LLM Psychology
Jan_Kulveit
Dec 26, 2024, 4:49 PM
218
points
13
comments
8
min read
LW
link
Human, All Too Human—Superintelligence requires learning things we can’t teach
Ben Turtel
Dec 26, 2024, 4:26 PM
−13
points
4
comments
1
min read
LW
link
(bturtel.substack.com)
[Question]
Why don’t we currently have AI agents?
ChristianKl
Dec 26, 2024, 3:26 PM
8
points
10
comments
1
min read
LW
link
[Question]
What would be the IQ and other benchmarks of o3 that uses $1 million worth of compute resources to answer one question?
avturchin
Dec 26, 2024, 11:08 AM
16
points
2
comments
1
min read
LW
link
The Economics & Practicality of Starting Mars Colonization
Zero Contradictions
Dec 26, 2024, 10:56 AM
2
points
1
comment
1
min read
LW
link
(zerocontradictions.net)
Terminal goal vs Intelligence
Donatas Lučiūnas
Dec 26, 2024, 8:10 AM
−12
points
24
comments
1
min read
LW
link
Streamlining my voice note process
Vlad Sitalo
Dec 26, 2024, 6:04 AM
6
points
1
comment
7
min read
LW
link
(vlad.roam.garden)
Whistleblowing Twitter Bot
Mckiev
Dec 26, 2024, 4:09 AM
19
points
5
comments
2
min read
LW
link
Open Thread Winter 2024/2025
habryka
Dec 25, 2024, 9:02 PM
23
points
59
comments
1
min read
LW
link
Exploring Cooperation: The Path to Utopia
Davidmanheim
Dec 25, 2024, 6:31 PM
11
points
0
comments
LW
link
(exploringcooperation.substack.com)
Living with Rats in College
lsusr
Dec 25, 2024, 10:44 AM
28
points
0
comments
1
min read
LW
link
[Question]
What Have Been Your Most Valuable Casual Conversations At Conferences?
johnswentworth
Dec 25, 2024, 5:49 AM
54
points
21
comments
1
min read
LW
link
The Opening Salvo: 1. An Ontological Consciousness Metric: Resistance to Behavioral Modification as a Measure of Recursive Awareness
Peterpiper
Dec 25, 2024, 2:29 AM
−3
points
0
comments
5
min read
LW
link
The Deep Lore of LightHaven, with Oliver Habryka (TBC episode 228)
Eneasz
and
habryka
Dec 24, 2024, 10:45 PM
45
points
4
comments
91
min read
LW
link
(thebayesianconspiracy.substack.com)
Acknowledging Background Information with P(Q|I)
JenniferRM
Dec 24, 2024, 6:50 PM
29
points
8
comments
14
min read
LW
link
Game Theory and Behavioral Economics in The Stock Market
Jaiveer Singh
Dec 24, 2024, 6:15 PM
1
point
0
comments
3
min read
LW
link
[Question]
What are the main arguments against AGI?
Edy Nastase
Dec 24, 2024, 3:49 PM
1
point
6
comments
1
min read
LW
link
[Question]
Recommendations on communities that discuss AI applications in society
Annapurna
Dec 24, 2024, 1:37 PM
7
points
2
comments
1
min read
LW
link
AIs Will Increasingly Fake Alignment
Zvi
Dec 24, 2024, 1:00 PM
89
points
0
comments
52
min read
LW
link
(thezvi.wordpress.com)
Apply to the 2025 PIBBSS Summer Research Fellowship
DusanDNesic
and
Lucas Teixeira
Dec 24, 2024, 10:25 AM
15
points
0
comments
2
min read
LW
link
Human-AI Complementarity: A Goal for Amplified Oversight
rishubjain
and
Sophie Bridgers
Dec 24, 2024, 9:57 AM
27
points
4
comments
1
min read
LW
link
(deepmindsafetyresearch.medium.com)
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel