Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
Talk: “AI Would Be A Lot Less Alarming If We Understood Agents”
johnswentworth
Dec 17, 2023, 11:46 PM
58
points
3
comments
1
min read
LW
link
(www.youtube.com)
∀: a story
Richard_Ngo
Dec 17, 2023, 10:42 PM
38
points
1
comment
8
min read
LW
link
(www.narrativeark.xyz)
Reviving a 2015 MacBook
jefftk
Dec 17, 2023, 9:00 PM
11
points
0
comments
1
min read
LW
link
(www.jefftk.com)
A Common-Sense Case For Mutually-Misaligned AGIs Allying Against Humans
Thane Ruthenis
Dec 17, 2023, 8:28 PM
29
points
7
comments
11
min read
LW
link
The Limits of Artificial Consciousness: A Biology-Based Critique of Chalmers’ Fading Qualia Argument
Štěpán Los
Dec 17, 2023, 7:11 PM
−6
points
9
comments
17
min read
LW
link
What makes teaching math special
Viliam
Dec 17, 2023, 2:15 PM
45
points
27
comments
11
min read
LW
link
The predictive power of dissipative adaptation
dr_s
Dec 17, 2023, 2:01 PM
56
points
14
comments
19
min read
LW
link
Linkpost: Francesca v Harvard
Linch
Dec 17, 2023, 6:18 AM
5
points
5
comments
2
min read
LW
link
(www.francesca-v-harvard.org)
Lessons from massaging myself, others, dogs, and cats
Chipmonk
Dec 17, 2023, 4:28 AM
2
points
27
comments
5
min read
LW
link
(chipmonk.blog)
The Serendipity of Density
jefftk
Dec 17, 2023, 3:50 AM
40
points
4
comments
1
min read
LW
link
(www.jefftk.com)
Bounty: Diverse hard tasks for LLM agents
Beth Barnes
and
Megan Kinniment
Dec 17, 2023, 1:04 AM
49
points
31
comments
16
min read
LW
link
2022 (and All Time) Posts by Pingback Count
Raemon
Dec 16, 2023, 9:17 PM
53
points
14
comments
6
min read
LW
link
“Humanity vs. AGI” Will Never Look Like “Humanity vs. AGI” to Humanity
Thane Ruthenis
Dec 16, 2023, 8:08 PM
191
points
34
comments
5
min read
LW
link
A visual analogy for text generation by LLMs?
Bill Benzon
Dec 16, 2023, 5:58 PM
3
points
0
comments
1
min read
LW
link
Upgrading the AI Safety Community
trevor
and
Nicholas / Heather Kross
Dec 16, 2023, 3:34 PM
42
points
9
comments
42
min read
LW
link
cold aluminum for medicine
bhauth
Dec 16, 2023, 2:38 PM
42
points
4
comments
4
min read
LW
link
(www.bhauth.com)
Scalable Oversight and Weak-to-Strong Generalization: Compatible approaches to the same problem
Ansh Radhakrishnan
,
Buck
,
ryan_greenblatt
and
Fabien Roger
Dec 16, 2023, 5:49 AM
76
points
4
comments
6
min read
LW
link
1
review
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
leogao
Dec 16, 2023, 5:39 AM
55
points
5
comments
1
min read
LW
link
Pope Francis shares thoughts on responsible AI development
corruptedCatapillar
Dec 16, 2023, 3:49 AM
15
points
4
comments
1
min read
LW
link
(www.vatican.va)
Current AIs Provide Nearly No Data Relevant to AGI Alignment
Thane Ruthenis
Dec 15, 2023, 8:16 PM
132
points
157
comments
8
min read
LW
link
1
review
Agglomeration of ‘Ought’
DavidAndresBloom
Dec 15, 2023, 7:07 PM
1
point
1
comment
11
min read
LW
link
Predicting the future with the power of the Internet (and pissing off Rob Miles)
Writer
Dec 15, 2023, 5:37 PM
23
points
9
comments
4
min read
LW
link
(youtu.be)
Progress links digest, 2023-12-15: Vitalik on d/acc, $100M+ in prizes, and more
jasoncrawford
Dec 15, 2023, 3:52 PM
20
points
0
comments
12
min read
LW
link
(rootsofprogress.org)
“AI Alignment” is a Dangerously Overloaded Term
Roko
Dec 15, 2023, 2:34 PM
108
points
100
comments
3
min read
LW
link
[Valence series] 4. Valence & Social Status (deprecated)
Steven Byrnes
Dec 15, 2023, 2:24 PM
35
points
19
comments
11
min read
LW
link
Contra Scott on Abolishing the FDA
Maxwell Tabarrok
Dec 15, 2023, 2:00 PM
46
points
3
comments
6
min read
LW
link
(maximumprogress.substack.com)
[Paper] Trajectories through semantic spaces in schizophrenia and the relationship to ripple bursts
bvbvbvbvbvbvbvbvbvbvbv
Dec 15, 2023, 1:37 PM
3
points
0
comments
1
min read
LW
link
(www.pnas.org)
Takeaways from a Mechanistic Interpretability project on “Forbidden Facts”
Tony Wang
,
Miles Wang
and
kaivu
Dec 15, 2023, 11:05 AM
33
points
8
comments
10
min read
LW
link
Refinement of Active Inference agency ontology
Roman Leventov
Dec 15, 2023, 9:31 AM
16
points
0
comments
5
min read
LW
link
(arxiv.org)
EU policymakers reach an agreement on the AI Act
tlevin
Dec 15, 2023, 6:02 AM
78
points
7
comments
7
min read
LW
link
Where Does Adversarial Pressure Come From?
quetzal_rainbow
Dec 14, 2023, 10:31 PM
17
points
1
comment
2
min read
LW
link
Epoch wise critical periods, and singular learning theory
Garrett Baker
Dec 14, 2023, 8:55 PM
16
points
1
comment
5
min read
LW
link
OpenAI Superalignment: Weak-to-strong generalization
Dalmert
Dec 14, 2023, 7:47 PM
25
points
3
comments
1
min read
LW
link
(openai.com)
Applications for EA Global are still open!
Eli_Nathan
Dec 14, 2023, 7:10 PM
1
point
0
comments
1
min read
LW
link
Personal Development System: Winning Repeatedly and Growing Effectively With The BIG4
Paul Rohde
Dec 14, 2023, 6:49 PM
13
points
0
comments
33
min read
LW
link
(blog.paul-rohde.com)
Introducing The ‘From Big Ideas To Real-World Results’: A Series for Effective Personal Development
Paul Rohde
Dec 14, 2023, 6:49 PM
13
points
1
comment
8
min read
LW
link
(blog.paul-rohde.com)
Talking With People Who Speak to Congressional Staffers about AI risk
Eneasz
Dec 14, 2023, 5:55 PM
32
points
0
comments
1
min read
LW
link
(www.thebayesianconspiracy.com)
Bayesian Injustice
Kevin Dorst
Dec 14, 2023, 3:44 PM
124
points
10
comments
6
min read
LW
link
(kevindorst.substack.com)
AI #42: The Wrong Answer
Zvi
Dec 14, 2023, 2:50 PM
67
points
6
comments
54
min read
LW
link
(thezvi.wordpress.com)
Some for-profit AI alignment org ideas
Eric Ho
Dec 14, 2023, 2:23 PM
87
points
19
comments
9
min read
LW
link
Mapping the semantic void: Strange goings-on in GPT embedding spaces
mwatkins
Dec 14, 2023, 1:10 PM
114
points
31
comments
14
min read
LW
link
Categorical Organization in Memory: ChatGPT Organizes the 665 Topic Tags from My New Savanna Blog
Bill Benzon
Dec 14, 2023, 1:02 PM
0
points
6
comments
2
min read
LW
link
Moral Mountains
Adam Zerner
Dec 14, 2023, 10:40 AM
8
points
10
comments
2
min read
LW
link
Update on Chinese IQ-related gene panels
Lao Mein
Dec 14, 2023, 10:12 AM
70
points
7
comments
1
min read
LW
link
Red Line Ashmont Train is Now Approaching
jefftk
Dec 14, 2023, 2:50 AM
23
points
2
comments
1
min read
LW
link
(www.jefftk.com)
Various AI doom pathways (and how likely they are)
Logan Zoellner
Dec 14, 2023, 12:54 AM
1
point
1
comment
4
min read
LW
link
(midwitalignment.substack.com)
Are There Examples of Overhang for Other Technologies?
Jeffrey Heninger
Dec 13, 2023, 9:48 PM
59
points
50
comments
11
min read
LW
link
(blog.aiimpacts.org)
Is being sexy for your homies?
Valentine
Dec 13, 2023, 8:37 PM
195
points
100
comments
14
min read
LW
link
2
reviews
How bad is chlorinated water?
bhauth
Dec 13, 2023, 6:00 PM
43
points
18
comments
3
min read
LW
link
(www.bhauth.com)
[Question]
Suggestions for net positive LLM research
Cole Wyeth
Dec 13, 2023, 5:29 PM
13
points
6
comments
1
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel