Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
2
[Question]
What are the primary drivers that caused selection pressure for intelligence in humans?
Towards_Keeperhood
Nov 7, 2024, 9:40 AM
8
points
15
comments
1
min read
LW
link
The Logistics of Distribution of Meaning: Against Epistemic Bureaucratization
Sahil
Nov 7, 2024, 5:27 AM
27
points
7
comments
12
min read
LW
link
SAEs are highly dataset dependent: a case study on the refusal direction
Connor Kissane
,
robertzk
,
Neel Nanda
and
Arthur Conmy
Nov 7, 2024, 5:22 AM
66
points
4
comments
14
min read
LW
link
Should CA, TX, OK, and LA merge into a giant swing state, just for elections?
Thomas Kwa
Nov 6, 2024, 11:01 PM
115
points
35
comments
1
min read
LW
link
New Funding Category Open in Foresight’s AI Safety Grants
Allison Duettmann
Nov 6, 2024, 10:59 PM
15
points
0
comments
1
min read
LW
link
Scattered thoughts on what it means for an LLM to believe
TheManxLoiner
Nov 6, 2024, 10:10 PM
5
points
4
comments
5
min read
LW
link
The Bayesian Conspiracy Live Recording
Eneasz
Nov 6, 2024, 4:25 PM
9
points
0
comments
1
min read
LW
link
Anthropic: Three Sketches of ASL-4 Safety Case Components
Zach Stein-Perlman
Nov 6, 2024, 4:00 PM
95
points
33
comments
1
min read
LW
link
(alignment.anthropic.com)
Meme Talking Points
ymeskhout
Nov 6, 2024, 3:27 PM
34
points
0
comments
3
min read
LW
link
Advisors for Smaller Major Donors?
jefftk
Nov 6, 2024, 2:30 PM
18
points
2
comments
3
min read
LW
link
(www.jefftk.com)
Scissors Statements for President?
AnnaSalamon
Nov 6, 2024, 10:38 AM
118
points
32
comments
1
min read
LW
link
[Question]
How to cite LessWrong as an academic source?
PhilosophicalSoul
Nov 6, 2024, 8:28 AM
6
points
6
comments
1
min read
LW
link
How to put California and Texas on the campaign trail!
Yair Halberstadt
Nov 6, 2024, 6:08 AM
25
points
4
comments
1
min read
LW
link
LDT (and everything else) can be irrational
Christopher King
Nov 6, 2024, 4:05 AM
10
points
15
comments
2
min read
LW
link
Join my new subscriber chat
sarahconstantin
Nov 6, 2024, 2:30 AM
7
points
0
comments
1
min read
LW
link
(sarahconstantin.substack.com)
Graceful Degradation
Screwtape
Nov 5, 2024, 11:57 PM
83
points
8
comments
4
min read
LW
link
An alternative approach to superbabies
Towards_Keeperhood
Nov 5, 2024, 10:56 PM
48
points
19
comments
3
min read
LW
link
Apply to be a mentor in SPAR!
agucova
Nov 5, 2024, 9:32 PM
5
points
0
comments
LW
link
Going Beyond “immaturity”
moisentinel
Nov 5, 2024, 8:51 PM
−3
points
2
comments
2
min read
LW
link
Intent alignment as a stepping-stone to value alignment
Seth Herd
Nov 5, 2024, 8:43 PM
37
points
8
comments
3
min read
LW
link
Why Recursion Pharmaceuticals abandoned cell painting for brightfield imaging
Abhishaike Mahajan
Nov 5, 2024, 2:51 PM
29
points
1
comment
18
min read
LW
link
(www.owlposting.com)
Winning isn’t enough
JesseClifton
and
Anthony DiGiovanni
Nov 5, 2024, 11:37 AM
40
points
18
comments
9
min read
LW
link
Anthropic—The case for targeted regulation
anaguma
Nov 5, 2024, 7:07 AM
11
points
0
comments
2
min read
LW
link
(www.anthropic.com)
The Shallow Bench
Karl Faulks
Nov 5, 2024, 5:07 AM
48
points
5
comments
3
min read
LW
link
Using Narrative Prompting to Extract Policy Forecasts from LLMs
Max Ghenis
Nov 5, 2024, 4:37 AM
5
points
0
comments
1
min read
LW
link
ML4Good (AI Safety Bootcamp) - Experience report
JanEbbing
Nov 5, 2024, 1:18 AM
13
points
0
comments
3
min read
LW
link
Catastrophic Cyber Capabilities Benchmark (3CB): Robustly Evaluating LLM Agent Cyber Offense Capabilities
Jonathan N
,
abra
,
Connor Axiotes
and
Esben Kran
Nov 5, 2024, 1:01 AM
8
points
0
comments
6
min read
LW
link
(www.apartresearch.com)
[Question]
Could orcas be (trained to be) smarter than humans?
Towards_Keeperhood
Nov 4, 2024, 11:29 PM
56
points
23
comments
1
min read
LW
link
Metastatic Cancer Treatment Since 2010: The Success Stories
sarahconstantin
Nov 4, 2024, 10:50 PM
51
points
2
comments
6
min read
LW
link
(sarahconstantin.substack.com)
Bay Winter Solstice 2024: Speech Auditions
ozymandias
Nov 4, 2024, 10:31 PM
32
points
1
comment
1
min read
LW
link
Empathy/Systemizing Quotient is a poor/biased model for the autism/sex link
tailcalled
Nov 4, 2024, 9:11 PM
43
points
0
comments
7
min read
LW
link
Distributed espionage
margetmagenta
Nov 4, 2024, 7:43 PM
3
points
0
comments
1
min read
LW
link
GPT-8 may not be ASI
rvzlxax409
Nov 4, 2024, 7:31 PM
−2
points
1
comment
3
min read
LW
link
AI timelines don’t account for base rate of tech progress
rvzlxax409
Nov 4, 2024, 7:31 PM
−10
points
2
comments
1
min read
LW
link
Update on the Mysterious Trump Buyers on Polymarket
Annapurna
Nov 4, 2024, 7:22 PM
19
points
9
comments
1
min read
LW
link
(jorgevelez.substack.com)
[Intuitive self-models] 8. Rooting Out Free Will Intuitions
Steven Byrnes
Nov 4, 2024, 6:16 PM
70
points
19
comments
24
min read
LW
link
Option control
Joe Carlsmith
Nov 4, 2024, 5:54 PM
28
points
0
comments
54
min read
LW
link
[Question]
Noticing the World
EvolutionByDesign
Nov 4, 2024, 4:41 PM
4
points
1
comment
1
min read
LW
link
The current state of RSPs
Zach Stein-Perlman
Nov 4, 2024, 4:00 PM
23
points
2
comments
9
min read
LW
link
[Question]
Does the “ancient wisdom” argument have any validity? If a particular teaching or tradition is old, to what extent does this make it more trustworthy?
SpectrumDT
Nov 4, 2024, 3:20 PM
18
points
49
comments
1
min read
LW
link
A brief history of the automated corporation
owencb
Nov 4, 2024, 2:35 PM
26
points
1
comment
5
min read
LW
link
(strangecities.substack.com)
Abstractions are not Natural
Alfred Harwood
Nov 4, 2024, 11:10 AM
25
points
21
comments
11
min read
LW
link
[Linkpost] Building Altruistic and Moral AI Agent with Brain-inspired Affective Empathy Mechanisms
Gunnar_Zarncke
Nov 4, 2024, 10:15 AM
13
points
0
comments
1
min read
LW
link
(arxiv.org)
Context-dependent consequentialism
Jeremy Gillen
and
mattmacdermott
Nov 4, 2024, 9:29 AM
31
points
6
comments
27
min read
LW
link
Survival without dignity
L Rudolf L
Nov 4, 2024, 2:29 AM
369
points
29
comments
15
min read
LW
link
(nosetgauge.substack.com)
Drug development costs can range over two orders of magnitude
rossry
Nov 3, 2024, 11:13 PM
38
points
0
comments
11
min read
LW
link
Redefining Tolerance: Beyond Popper’s Paradox
mindprison
Nov 3, 2024, 10:23 PM
−1
points
0
comments
3
min read
LW
link
Goal: Understand Intelligence
Johannes C. Mayer
Nov 3, 2024, 9:20 PM
14
points
19
comments
1
min read
LW
link
Current safety training techniques do not fully transfer to the agent setting
Simon Lermen
and
Govind Pimpale
Nov 3, 2024, 7:24 PM
158
points
9
comments
5
min read
LW
link
Why our politicians aren’t Median
Yair Halberstadt
Nov 3, 2024, 2:03 PM
62
points
15
comments
3
min read
LW
link
Previous
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel