Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
Page
1
An EA used deceptive messaging to advance their project; we need mechanisms to avoid deontologically dubious plans
Mikhail Samin
Feb 13, 2024, 11:15 PM
24
points
1
comment
LW
link
Useful starting code for interpretability
eggsyntax
Feb 13, 2024, 11:13 PM
26
points
2
comments
1
min read
LW
link
Masterpiece
Richard_Ngo
Feb 13, 2024, 11:10 PM
166
points
21
comments
4
min read
LW
link
(www.narrativeark.xyz)
A Bridge Between Utilitarianism & Stoicism
Jonathan Moregård
Feb 13, 2024, 10:46 PM
5
points
0
comments
5
min read
LW
link
(honestliving.substack.com)
The “context window” analogy for human minds
Ruby
Feb 13, 2024, 7:29 PM
38
points
0
comments
2
min read
LW
link
More on the Apple Vision Pro
Zvi
Feb 13, 2024, 5:40 PM
33
points
5
comments
8
min read
LW
link
(thezvi.wordpress.com)
Linear White
Teja Prabhu
Feb 13, 2024, 4:31 PM
−3
points
3
comments
3
min read
LW
link
(krez.expert)
Causality is Everywhere
silentbob
Feb 13, 2024, 1:44 PM
26
points
12
comments
8
min read
LW
link
Technologies and Terminology: AI isn’t Software, it’s… Deepware?
Davidmanheim
and
abramdemski
Feb 13, 2024, 1:37 PM
40
points
10
comments
8
min read
LW
link
[Question]
LessWrong Is Very Wrong: Ultimately All Social Media Platforms Are The Same
Amritesh Kumar
Feb 13, 2024, 6:53 AM
−16
points
2
comments
1
min read
LW
link
Lsusr’s Rationality Dojo
lsusr
Feb 13, 2024, 5:52 AM
103
points
17
comments
2
min read
LW
link
[Question]
Where is the Town Square?
Gretta Duleba
Feb 13, 2024, 3:53 AM
46
points
8
comments
1
min read
LW
link
My cover story in Jacobin on AI capitalism and the x-risk debates
garrison
Feb 12, 2024, 11:34 PM
98
points
5
comments
LW
link
(jacobin.com)
What is Ontology?
martinkunev
Feb 12, 2024, 11:01 PM
4
points
0
comments
4
min read
LW
link
Thank you for triggering me
Cissy
Feb 12, 2024, 8:09 PM
6
points
1
comment
6
min read
LW
link
(www.moremyself.xyz)
Interpreting Quantum Mechanics in Infra-Bayesian Physicalism
Yegreg
Feb 12, 2024, 6:56 PM
30
points
6
comments
43
min read
LW
link
I played the AI box game as the Gatekeeper — and lost
datawitch
Feb 12, 2024, 6:39 PM
33
points
54
comments
4
min read
LW
link
The Last Laugh: Exploring the Role of Humor as a Benchmark for Large Language Models
Greg Robison
Feb 12, 2024, 6:34 PM
4
points
6
comments
11
min read
LW
link
Natural abstractions are observer-dependent: a conversation with John Wentworth
Martín Soto
Feb 12, 2024, 5:28 PM
39
points
13
comments
7
min read
LW
link
Tort Law Can Play an Important Role in Mitigating AI Risk
Gabriel Weil
Feb 12, 2024, 5:17 PM
39
points
9
comments
5
min read
LW
link
On the Proposed California SB 1047
Zvi
Feb 12, 2024, 4:40 PM
46
points
18
comments
12
min read
LW
link
(thezvi.wordpress.com)
Thoughts on “The Offense-Defense Balance Rarely Changes”
Cullen
Feb 12, 2024, 3:26 AM
46
points
4
comments
LW
link
Skepticism About DeepMind’s “Grandmaster-Level” Chess Without Search
Arjun Panickssery
Feb 12, 2024, 12:56 AM
57
points
13
comments
3
min read
LW
link
[Question]
What are the known difficulties with this alignment approach?
tailcalled
Feb 11, 2024, 10:52 PM
18
points
24
comments
1
min read
LW
link
[Question]
What are the deciding factors of human cognitive endurance?
koratkar
Feb 11, 2024, 9:56 PM
22
points
3
comments
1
min read
LW
link
Carl Shulman On Dwarkesh Podcast June 2023
Moonicker
Feb 11, 2024, 9:02 PM
18
points
0
comments
159
min read
LW
link
How do you actually obtain and report a likelihood function for scientific research?
Peter Berggren
Feb 11, 2024, 5:42 PM
55
points
4
comments
1
min read
LW
link
The entropy maxim for binary questions
dkl9
Feb 11, 2024, 5:17 PM
2
points
1
comment
1
min read
LW
link
(dkl9.net)
GPT2XL_RLLMv3 vs. BetterDAN, AI Machiavelli & Oppo Jailbreaks
MiguelDev
Feb 11, 2024, 11:03 AM
16
points
4
comments
14
min read
LW
link
[Question]
What’s the theory of impact for activation vectors?
Chris_Leong
Feb 11, 2024, 7:34 AM
61
points
12
comments
1
min read
LW
link
Experimenting With Footboard Piezos
jefftk
Feb 11, 2024, 3:00 AM
11
points
2
comments
2
min read
LW
link
(www.jefftk.com)
The Core Values of Life—A proposal for a universal theory of ethics
Thomas Gjøstøl
Feb 10, 2024, 9:48 PM
2
points
4
comments
18
min read
LW
link
And All the Shoggoths Merely Players
Zack_M_Davis
Feb 10, 2024, 7:56 PM
170
points
57
comments
12
min read
LW
link
Sam Altman’s Chip Ambitions Undercut OpenAI’s Safety Strategy
garrison
Feb 10, 2024, 7:52 PM
198
points
52
comments
LW
link
(garrisonlovely.substack.com)
The lattice of partial updatelessness
Martín Soto
Feb 10, 2024, 5:34 PM
23
points
5
comments
5
min read
LW
link
A Strange ACH Corner Case
jefftk
Feb 10, 2024, 3:00 AM
27
points
2
comments
2
min read
LW
link
(www.jefftk.com)
Dreams of AI alignment: The danger of suggestive names
TurnTrout
Feb 10, 2024, 1:22 AM
103
points
59
comments
4
min read
LW
link
Scenario planning for AI x-risk
Corin Katzke
Feb 10, 2024, 12:14 AM
24
points
12
comments
14
min read
LW
link
(forum.effectivealtruism.org)
Close the Gates to an Inhuman Future: How and why we should choose to not develop superhuman general-purpose artificial intelligence
aaguirre
Feb 9, 2024, 8:25 PM
13
points
0
comments
1
min read
LW
link
(arxiv.org)
[Crosspost] Deep Dive: The Coming Technological Singularity—How to survive in a Post-human Era
simulacra.exe
Feb 9, 2024, 6:49 PM
2
points
2
comments
9
min read
LW
link
The Ideal Speech Situation as a Tool for AI Ethical Reflection: A Framework for Alignment
kenneth myers
Feb 9, 2024, 6:40 PM
6
points
12
comments
3
min read
LW
link
What’s ChatGPT’s Favorite Ice Cream Flavor? An Investigation Into Synthetic Respondents
Greg Robison
Feb 9, 2024, 6:38 PM
19
points
4
comments
15
min read
LW
link
OpenAI wants to raise 5-7 trillion
O O
Feb 9, 2024, 4:15 PM
13
points
29
comments
1
min read
LW
link
(decrypt.co)
[Question]
Constituency-sized AI congress?
Nathan Helm-Burger
Feb 9, 2024, 4:01 PM
11
points
5
comments
1
min read
LW
link
One True Love
Zvi
Feb 9, 2024, 3:10 PM
34
points
7
comments
10
min read
LW
link
(thezvi.wordpress.com)
[Question]
Executive function advice from people who are good at it?
TeaTieAndHat
Feb 9, 2024, 10:11 AM
7
points
1
comment
1
min read
LW
link
[Question]
Do you want to make an AI Alignment song?
Kabir Kumar
Feb 9, 2024, 8:22 AM
4
points
0
comments
1
min read
LW
link
Skills I’d like my collaborators to have
Raemon
Feb 9, 2024, 8:20 AM
106
points
9
comments
8
min read
LW
link
Transfer learning and generalization-qua-capability in Babbage and Davinci (or, why division is better than Spanish)
RP
and
agg
Feb 9, 2024, 7:00 AM
50
points
6
comments
3
min read
LW
link
Biden-Harris Administration Announces First-Ever Consortium Dedicated to AI Safety
Ben Smith
Feb 9, 2024, 6:40 AM
22
points
0
comments
LW
link
(www.nist.gov)
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel