Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
2022 (and All Time) Posts by Pingback Count
Raemon
Dec 16, 2023, 9:17 PM
53
points
14
comments
6
min read
LW
link
“Humanity vs. AGI” Will Never Look Like “Humanity vs. AGI” to Humanity
Thane Ruthenis
Dec 16, 2023, 8:08 PM
191
points
34
comments
5
min read
LW
link
A visual analogy for text generation by LLMs?
Bill Benzon
Dec 16, 2023, 5:58 PM
3
points
0
comments
1
min read
LW
link
Upgrading the AI Safety Community
trevor
and
Nicholas / Heather Kross
Dec 16, 2023, 3:34 PM
42
points
9
comments
42
min read
LW
link
cold aluminum for medicine
bhauth
Dec 16, 2023, 2:38 PM
42
points
4
comments
4
min read
LW
link
(www.bhauth.com)
Scalable Oversight and Weak-to-Strong Generalization: Compatible approaches to the same problem
Ansh Radhakrishnan
,
Buck
,
ryan_greenblatt
and
Fabien Roger
Dec 16, 2023, 5:49 AM
76
points
4
comments
6
min read
LW
link
1
review
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
leogao
Dec 16, 2023, 5:39 AM
55
points
5
comments
1
min read
LW
link
Pope Francis shares thoughts on responsible AI development
corruptedCatapillar
Dec 16, 2023, 3:49 AM
15
points
4
comments
1
min read
LW
link
(www.vatican.va)
Current AIs Provide Nearly No Data Relevant to AGI Alignment
Thane Ruthenis
Dec 15, 2023, 8:16 PM
132
points
157
comments
8
min read
LW
link
1
review
Agglomeration of ‘Ought’
DavidAndresBloom
Dec 15, 2023, 7:07 PM
1
point
1
comment
11
min read
LW
link
Predicting the future with the power of the Internet (and pissing off Rob Miles)
Writer
Dec 15, 2023, 5:37 PM
23
points
9
comments
4
min read
LW
link
(youtu.be)
Progress links digest, 2023-12-15: Vitalik on d/acc, $100M+ in prizes, and more
jasoncrawford
Dec 15, 2023, 3:52 PM
20
points
0
comments
12
min read
LW
link
(rootsofprogress.org)
“AI Alignment” is a Dangerously Overloaded Term
Roko
Dec 15, 2023, 2:34 PM
108
points
100
comments
3
min read
LW
link
[Valence series] 4. Valence & Social Status (deprecated)
Steven Byrnes
Dec 15, 2023, 2:24 PM
35
points
19
comments
11
min read
LW
link
Contra Scott on Abolishing the FDA
Maxwell Tabarrok
Dec 15, 2023, 2:00 PM
46
points
3
comments
6
min read
LW
link
(maximumprogress.substack.com)
[Paper] Trajectories through semantic spaces in schizophrenia and the relationship to ripple bursts
bvbvbvbvbvbvbvbvbvbvbv
Dec 15, 2023, 1:37 PM
3
points
0
comments
1
min read
LW
link
(www.pnas.org)
Takeaways from a Mechanistic Interpretability project on “Forbidden Facts”
Tony Wang
,
Miles Wang
and
kaivu
Dec 15, 2023, 11:05 AM
33
points
8
comments
10
min read
LW
link
Refinement of Active Inference agency ontology
Roman Leventov
Dec 15, 2023, 9:31 AM
16
points
0
comments
5
min read
LW
link
(arxiv.org)
EU policymakers reach an agreement on the AI Act
tlevin
Dec 15, 2023, 6:02 AM
78
points
7
comments
7
min read
LW
link
Where Does Adversarial Pressure Come From?
quetzal_rainbow
Dec 14, 2023, 10:31 PM
17
points
1
comment
2
min read
LW
link
Epoch wise critical periods, and singular learning theory
Garrett Baker
Dec 14, 2023, 8:55 PM
16
points
1
comment
5
min read
LW
link
OpenAI Superalignment: Weak-to-strong generalization
Dalmert
Dec 14, 2023, 7:47 PM
25
points
3
comments
1
min read
LW
link
(openai.com)
Applications for EA Global are still open!
Eli_Nathan
Dec 14, 2023, 7:10 PM
1
point
0
comments
1
min read
LW
link
Personal Development System: Winning Repeatedly and Growing Effectively With The BIG4
Paul Rohde
14 Dec 2023 18:49 UTC
13
points
0
comments
33
min read
LW
link
(blog.paul-rohde.com)
Introducing The ‘From Big Ideas To Real-World Results’: A Series for Effective Personal Development
Paul Rohde
14 Dec 2023 18:49 UTC
13
points
1
comment
8
min read
LW
link
(blog.paul-rohde.com)
Talking With People Who Speak to Congressional Staffers about AI risk
Eneasz
14 Dec 2023 17:55 UTC
32
points
0
comments
1
min read
LW
link
(www.thebayesianconspiracy.com)
Bayesian Injustice
Kevin Dorst
14 Dec 2023 15:44 UTC
124
points
10
comments
6
min read
LW
link
(kevindorst.substack.com)
AI #42: The Wrong Answer
Zvi
14 Dec 2023 14:50 UTC
67
points
6
comments
54
min read
LW
link
(thezvi.wordpress.com)
Some for-profit AI alignment org ideas
Eric Ho
14 Dec 2023 14:23 UTC
87
points
19
comments
9
min read
LW
link
Mapping the semantic void: Strange goings-on in GPT embedding spaces
mwatkins
14 Dec 2023 13:10 UTC
114
points
31
comments
14
min read
LW
link
Categorical Organization in Memory: ChatGPT Organizes the 665 Topic Tags from My New Savanna Blog
Bill Benzon
14 Dec 2023 13:02 UTC
0
points
6
comments
2
min read
LW
link
Moral Mountains
Adam Zerner
14 Dec 2023 10:40 UTC
8
points
10
comments
2
min read
LW
link
Update on Chinese IQ-related gene panels
Lao Mein
14 Dec 2023 10:12 UTC
70
points
7
comments
1
min read
LW
link
Red Line Ashmont Train is Now Approaching
jefftk
14 Dec 2023 2:50 UTC
23
points
2
comments
1
min read
LW
link
(www.jefftk.com)
Various AI doom pathways (and how likely they are)
Logan Zoellner
14 Dec 2023 0:54 UTC
1
point
1
comment
4
min read
LW
link
(midwitalignment.substack.com)
Are There Examples of Overhang for Other Technologies?
Jeffrey Heninger
13 Dec 2023 21:48 UTC
59
points
50
comments
11
min read
LW
link
(blog.aiimpacts.org)
Is being sexy for your homies?
Valentine
13 Dec 2023 20:37 UTC
195
points
100
comments
14
min read
LW
link
2
reviews
How bad is chlorinated water?
bhauth
13 Dec 2023 18:00 UTC
43
points
18
comments
3
min read
LW
link
(www.bhauth.com)
[Question]
Suggestions for net positive LLM research
Cole Wyeth
13 Dec 2023 17:29 UTC
13
points
6
comments
1
min read
LW
link
AI Control: Improving Safety Despite Intentional Subversion
Buck
,
Fabien Roger
,
ryan_greenblatt
and
Kshitij Sachan
13 Dec 2023 15:51 UTC
236
points
24
comments
10
min read
LW
link
4
reviews
The Busy Bee Brain
Bill Benzon
13 Dec 2023 13:10 UTC
11
points
0
comments
6
min read
LW
link
The Best of Don’t Worry About the Vase
Zvi
13 Dec 2023 12:50 UTC
55
points
4
comments
13
min read
LW
link
(thezvi.wordpress.com)
[Question]
Has anyone here investigated the occult community? It is curious to me that many magicians consider themselves empiricists.
SpectrumDT
13 Dec 2023 11:09 UTC
5
points
10
comments
1
min read
LW
link
AI Views Snapshots
Rob Bensinger
13 Dec 2023 0:45 UTC
142
points
61
comments
1
min read
LW
link
The convergent dynamic we missed
Remmelt
12 Dec 2023 23:19 UTC
2
points
2
comments
LW
link
A Kindness, or The Inevitable Consequence of Perfect Inference (a short story)
samhealy
12 Dec 2023 23:03 UTC
6
points
0
comments
9
min read
LW
link
Love, Reverence, and Life
Elizabeth
and
Tristan Williams
12 Dec 2023 21:49 UTC
36
points
9
comments
28
min read
LW
link
2
reviews
Taboo “procrastination”
Neil
12 Dec 2023 21:33 UTC
19
points
7
comments
1
min read
LW
link
Enhancing intelligence by banging your head on the wall
Bezzi
12 Dec 2023 21:00 UTC
38
points
26
comments
1
min read
LW
link
Yamaha P-Series Overview
jefftk
12 Dec 2023 20:30 UTC
10
points
1
comment
1
min read
LW
link
(www.jefftk.com)
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel