Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
2
A car journey with conservative evangelicals—Understanding some British political-religious beliefs
Nathan Young
Dec 6, 2024, 11:22 AM
41
points
8
comments
6
min read
LW
link
(nathanpmyoung.substack.com)
Frontier Models are Capable of In-context Scheming
Marius Hobbhahn
,
AlexMeinke
,
Bronson Schoen
,
rusheb
,
Jérémy Scheurer
and
Mikita Balesni
Dec 5, 2024, 10:11 PM
203
points
24
comments
7
min read
LW
link
Should you be worried about H5N1?
gw
Dec 5, 2024, 9:11 PM
89
points
2
comments
5
min read
LW
link
(www.georgeyw.com)
o1 tried to avoid being shut down
Raelifin
Dec 5, 2024, 7:52 PM
10
points
5
comments
1
min read
LW
link
(www.transformernews.ai)
More Growth, Melancholy, and MindCraft @3QD [revised and updated]
Bill Benzon
Dec 5, 2024, 7:36 PM
4
points
0
comments
4
min read
LW
link
Expevolu, a laissez-faire approach to country creation
Fernando
Dec 5, 2024, 7:29 PM
4
points
4
comments
44
min read
LW
link
(expevolu.substack.com)
Are SAE features from the Base Model still meaningful to LLaVA?
Shan23Chen
Dec 5, 2024, 7:24 PM
5
points
2
comments
10
min read
LW
link
OpenAI o1 + ChatGPT Pro release
anaguma
Dec 5, 2024, 7:13 PM
5
points
0
comments
1
min read
LW
link
(openai.com)
Smart people should do biology
Haotian
Dec 5, 2024, 7:11 PM
11
points
2
comments
3
min read
LW
link
Announcement: AI for Math Fund
sarahconstantin
Dec 5, 2024, 6:33 PM
20
points
9
comments
2
min read
LW
link
(renaissancephilanthropy.org)
Detection of Asymptomatically Spreading Pathogens
jefftk
Dec 5, 2024, 6:20 PM
45
points
8
comments
7
min read
LW
link
(www.jefftk.com)
Model Integrity: MAI on Value Alignment
Jonas Hallgren
Dec 5, 2024, 5:11 PM
6
points
11
comments
1
min read
LW
link
(meaningalignment.substack.com)
Social Science in its epistemological context
Arturo Macias
Dec 5, 2024, 4:12 PM
3
points
0
comments
1
min read
LW
link
(www.theseedsofscience.pub)
Higher and lower pleasures
Chris_Leong
Dec 5, 2024, 1:13 PM
19
points
3
comments
1
min read
LW
link
Sam Harris’s Argument For Objective Morality
Zero Contradictions
Dec 5, 2024, 10:19 AM
7
points
5
comments
1
min read
LW
link
(thewaywardaxolotl.blogspot.com)
Morality as Cooperation Part III: Failure Modes
DeLesley Hutchins
Dec 5, 2024, 9:39 AM
4
points
0
comments
20
min read
LW
link
Morality as Cooperation Part II: Theory and Experiment
DeLesley Hutchins
Dec 5, 2024, 9:04 AM
2
points
0
comments
17
min read
LW
link
Morality as Cooperation Part I: Humans
DeLesley Hutchins
Dec 5, 2024, 8:16 AM
5
points
0
comments
19
min read
LW
link
I Finally Worked Through Bayes’ Theorem (Personal Achievement)
keltan
Dec 5, 2024, 2:04 AM
53
points
7
comments
9
min read
LW
link
The Dream Machine
sarahconstantin
Dec 5, 2024, 12:00 AM
117
points
6
comments
12
min read
LW
link
(sarahconstantin.substack.com)
Should you have children? A decision framework for a crucial life choice that affects yourself, your child and the world
Sherrinford
Dec 4, 2024, 11:14 PM
0
points
1
comment
20
min read
LW
link
CCing Mailing Lists on External Communication
jefftk
Dec 4, 2024, 10:00 PM
9
points
0
comments
1
min read
LW
link
(www.jefftk.com)
Picking favourites is hard
dkl9
Dec 4, 2024, 8:46 PM
11
points
3
comments
1
min read
LW
link
(dkl9.net)
[Question]
How can I convince my cryptobro friend that S&P500 is efficient?
AhmedNeedsATherapist
Dec 4, 2024, 8:04 PM
−7
points
10
comments
1
min read
LW
link
The 2023 LessWrong Review: The Basic Ask
Raemon
Dec 4, 2024, 7:52 PM
77
points
25
comments
9
min read
LW
link
Is the AI Doomsday Narrative the Product of a Big Tech Conspiracy?
garrison
Dec 4, 2024, 7:20 PM
35
points
1
comment
LW
link
(garrisonlovely.substack.com)
[Question]
AI box question
KvmanThinking
Dec 4, 2024, 7:03 PM
2
points
2
comments
1
min read
LW
link
The Polite Coup
Charlie Sanders
Dec 4, 2024, 2:03 PM
3
points
0
comments
3
min read
LW
link
(www.dailymicrofiction.com)
Analysis of Global AI Governance Strategies
Sammy Martin
,
Justin Bullock
and
Corin Katzke
Dec 4, 2024, 10:45 AM
49
points
10
comments
36
min read
LW
link
[Question]
Cryonics considerations: how big of a problem is ischemia?
kman
Dec 4, 2024, 4:45 AM
8
points
1
comment
1
min read
LW
link
AI #93: Happy Tuesday
Zvi
Dec 4, 2024, 12:30 AM
26
points
2
comments
23
min read
LW
link
(thezvi.wordpress.com)
A Qualitative Case for LTFF: Filling Critical Ecosystem Gaps
Linch
Dec 3, 2024, 9:57 PM
64
points
2
comments
LW
link
Deep Causal Transcoding: A Framework for Mechanistically Eliciting Latent Behaviors in Language Models
Andrew Mack
and
TurnTrout
Dec 3, 2024, 9:19 PM
106
points
8
comments
41
min read
LW
link
“Alignment at Large”: Bending the Arc of History Towards Life-Affirming Futures
welfvh
Dec 3, 2024, 9:17 PM
5
points
0
comments
4
min read
LW
link
Roots of Progress is hiring an event manager
jasoncrawford
Dec 3, 2024, 8:46 PM
10
points
0
comments
7
min read
LW
link
(rootsofprogress.notion.site)
Do simulacra dream of digital sheep?
EuanMcLean
Dec 3, 2024, 8:25 PM
16
points
36
comments
10
min read
LW
link
Orca communication project—seeking feedback (and collaborators)
Towards_Keeperhood
Dec 3, 2024, 5:29 PM
38
points
16
comments
2
min read
LW
link
Book a Time to Chat about Interp Research
Logan Riggs
Dec 3, 2024, 5:27 PM
47
points
3
comments
1
min read
LW
link
Balsa Research 2024 Update
Zvi
Dec 3, 2024, 12:30 PM
21
points
0
comments
5
min read
LW
link
(thezvi.wordpress.com)
First Solo Bus Ride
jefftk
Dec 3, 2024, 12:20 PM
28
points
1
comment
1
min read
LW
link
(www.jefftk.com)
How to make evals for the AISI evals bounty
TheManxLoiner
Dec 3, 2024, 10:44 AM
9
points
0
comments
5
min read
LW
link
Should there be just one western AGI project?
rosehadshar
and
Tom Davidson
Dec 3, 2024, 10:11 AM
78
points
75
comments
15
min read
LW
link
(www.forethought.org)
Cognitive Biases Contributing to AI X-risk — a deleted excerpt from my 2018 ARCHES draft
Andrew_Critch
Dec 3, 2024, 9:29 AM
48
points
2
comments
5
min read
LW
link
[Question]
What is your opinion of Dr. Angelo Dilullo(meditation)?
Suh_Prance_Alot
Dec 3, 2024, 5:54 AM
0
points
2
comments
1
min read
LW
link
Chemical Turing Machines
Yudhister Kumar
Dec 3, 2024, 5:26 AM
10
points
2
comments
4
min read
LW
link
(www.yudhister.me)
MIRI’s 2024 End-of-Year Update
Rob Bensinger
Dec 3, 2024, 4:33 AM
98
points
2
comments
4
min read
LW
link
Linkpost: Rat Traps by Sheon Han in Asterisk Mag
Chris_Leong
Dec 3, 2024, 3:22 AM
12
points
7
comments
1
min read
LW
link
(asteriskmag.com)
[Question]
Who are the worthwhile non-European pre-Industrial thinkers?
Lorec
Dec 3, 2024, 1:45 AM
12
points
4
comments
1
min read
LW
link
A Paradox of Simulated Suffering
arusarda
Dec 2, 2024, 11:44 PM
−3
points
3
comments
1
min read
LW
link
Levels of Thought: from Points to Fields
HNX
Dec 2, 2024, 8:25 PM
4
points
2
comments
23
min read
LW
link
Previous
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel