Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
Page
1
Searching for Searching for Search
Rubi J. Hudson
Feb 14, 2024, 11:51 PM
21
points
4
comments
7
min read
LW
link
Some questions for the people at 80,000 Hours
yanni kyriacos
Feb 14, 2024, 11:15 PM
1
point
0
comments
1
min read
LW
link
(forum.effectivealtruism.org)
Disrupting malicious uses of AI by state-affiliated threat actors
agucova
Feb 14, 2024, 9:28 PM
11
points
2
comments
LW
link
(openai.com)
Critiques of the AI control agenda
Jozdien
Feb 14, 2024, 7:25 PM
48
points
14
comments
9
min read
LW
link
Bad business advice
Logan Kieller
Feb 14, 2024, 5:01 PM
12
points
2
comments
3
min read
LW
link
(logankieller.substack.com)
Examples of governments doing good in house (or contracted) technical research
NathanBarnard
Feb 14, 2024, 4:22 PM
12
points
2
comments
2
min read
LW
link
[Question]
How can we legally/illegally enhance the progress of the law of accelerating returns in AI learning?
Gabi QUENE
Feb 14, 2024, 11:06 AM
−25
points
0
comments
1
min read
LW
link
[Question]
What experiment settles the Gary Marcus vs Geoffrey Hinton debate?
Valentin Baltadzhiev
Feb 14, 2024, 9:06 AM
12
points
8
comments
1
min read
LW
link
[Question]
Optimizing for Agency?
Michael Soareverix
Feb 14, 2024, 8:31 AM
10
points
9
comments
2
min read
LW
link
Requirements for a Basin of Attraction to Alignment
RogerDearnaley
Feb 14, 2024, 7:10 AM
41
points
12
comments
31
min read
LW
link
FTX expects to return all customer money; clawbacks may go away
Mikhail Samin
Feb 14, 2024, 3:43 AM
33
points
1
comment
LW
link
(www.nytimes.com)
Scale Was All We Needed, At First
Gabe M
Feb 14, 2024, 1:49 AM
295
points
34
comments
8
min read
LW
link
(aiacumen.substack.com)
CFAR Takeaways: Andrew Critch
Raemon
Feb 14, 2024, 1:37 AM
217
points
64
comments
5
min read
LW
link
Meetup In a Box: Year In Review
Czynski
Feb 14, 2024, 1:18 AM
26
points
1
comment
4
min read
LW
link
An EA used deceptive messaging to advance their project; we need mechanisms to avoid deontologically dubious plans
Mikhail Samin
Feb 13, 2024, 11:15 PM
24
points
1
comment
LW
link
Useful starting code for interpretability
eggsyntax
Feb 13, 2024, 11:13 PM
26
points
2
comments
1
min read
LW
link
Masterpiece
Richard_Ngo
Feb 13, 2024, 11:10 PM
166
points
21
comments
4
min read
LW
link
(www.narrativeark.xyz)
A Bridge Between Utilitarianism & Stoicism
Jonathan Moregård
Feb 13, 2024, 10:46 PM
5
points
0
comments
5
min read
LW
link
(honestliving.substack.com)
The “context window” analogy for human minds
Ruby
Feb 13, 2024, 7:29 PM
38
points
0
comments
2
min read
LW
link
More on the Apple Vision Pro
Zvi
Feb 13, 2024, 5:40 PM
33
points
5
comments
8
min read
LW
link
(thezvi.wordpress.com)
Linear White
Teja Prabhu
Feb 13, 2024, 4:31 PM
−3
points
3
comments
3
min read
LW
link
(krez.expert)
Causality is Everywhere
silentbob
Feb 13, 2024, 1:44 PM
26
points
12
comments
8
min read
LW
link
Technologies and Terminology: AI isn’t Software, it’s… Deepware?
Davidmanheim
and
abramdemski
Feb 13, 2024, 1:37 PM
40
points
10
comments
8
min read
LW
link
[Question]
LessWrong Is Very Wrong: Ultimately All Social Media Platforms Are The Same
Amritesh Kumar
Feb 13, 2024, 6:53 AM
−16
points
2
comments
1
min read
LW
link
Lsusr’s Rationality Dojo
lsusr
Feb 13, 2024, 5:52 AM
103
points
17
comments
2
min read
LW
link
[Question]
Where is the Town Square?
Gretta Duleba
Feb 13, 2024, 3:53 AM
46
points
8
comments
1
min read
LW
link
My cover story in Jacobin on AI capitalism and the x-risk debates
garrison
Feb 12, 2024, 11:34 PM
98
points
5
comments
LW
link
(jacobin.com)
What is Ontology?
martinkunev
Feb 12, 2024, 11:01 PM
4
points
0
comments
4
min read
LW
link
Thank you for triggering me
Cissy
Feb 12, 2024, 8:09 PM
6
points
1
comment
6
min read
LW
link
(www.moremyself.xyz)
Interpreting Quantum Mechanics in Infra-Bayesian Physicalism
Yegreg
Feb 12, 2024, 6:56 PM
30
points
6
comments
43
min read
LW
link
I played the AI box game as the Gatekeeper — and lost
datawitch
Feb 12, 2024, 6:39 PM
33
points
54
comments
4
min read
LW
link
The Last Laugh: Exploring the Role of Humor as a Benchmark for Large Language Models
Greg Robison
Feb 12, 2024, 6:34 PM
4
points
6
comments
11
min read
LW
link
Natural abstractions are observer-dependent: a conversation with John Wentworth
Martín Soto
Feb 12, 2024, 5:28 PM
39
points
13
comments
7
min read
LW
link
Tort Law Can Play an Important Role in Mitigating AI Risk
Gabriel Weil
Feb 12, 2024, 5:17 PM
39
points
9
comments
5
min read
LW
link
On the Proposed California SB 1047
Zvi
Feb 12, 2024, 4:40 PM
46
points
18
comments
12
min read
LW
link
(thezvi.wordpress.com)
Thoughts on “The Offense-Defense Balance Rarely Changes”
Cullen
Feb 12, 2024, 3:26 AM
46
points
4
comments
LW
link
Skepticism About DeepMind’s “Grandmaster-Level” Chess Without Search
Arjun Panickssery
Feb 12, 2024, 12:56 AM
57
points
13
comments
3
min read
LW
link
[Question]
What are the known difficulties with this alignment approach?
tailcalled
Feb 11, 2024, 10:52 PM
18
points
24
comments
1
min read
LW
link
[Question]
What are the deciding factors of human cognitive endurance?
koratkar
Feb 11, 2024, 9:56 PM
22
points
3
comments
1
min read
LW
link
Carl Shulman On Dwarkesh Podcast June 2023
Moonicker
Feb 11, 2024, 9:02 PM
18
points
0
comments
159
min read
LW
link
How do you actually obtain and report a likelihood function for scientific research?
Peter Berggren
Feb 11, 2024, 5:42 PM
55
points
4
comments
1
min read
LW
link
The entropy maxim for binary questions
dkl9
Feb 11, 2024, 5:17 PM
2
points
1
comment
1
min read
LW
link
(dkl9.net)
GPT2XL_RLLMv3 vs. BetterDAN, AI Machiavelli & Oppo Jailbreaks
MiguelDev
Feb 11, 2024, 11:03 AM
16
points
4
comments
14
min read
LW
link
[Question]
What’s the theory of impact for activation vectors?
Chris_Leong
Feb 11, 2024, 7:34 AM
61
points
12
comments
1
min read
LW
link
Experimenting With Footboard Piezos
jefftk
Feb 11, 2024, 3:00 AM
11
points
2
comments
2
min read
LW
link
(www.jefftk.com)
The Core Values of Life—A proposal for a universal theory of ethics
Thomas Gjøstøl
Feb 10, 2024, 9:48 PM
2
points
4
comments
18
min read
LW
link
And All the Shoggoths Merely Players
Zack_M_Davis
10 Feb 2024 19:56 UTC
170
points
57
comments
12
min read
LW
link
Sam Altman’s Chip Ambitions Undercut OpenAI’s Safety Strategy
garrison
10 Feb 2024 19:52 UTC
198
points
52
comments
LW
link
(garrisonlovely.substack.com)
The lattice of partial updatelessness
Martín Soto
10 Feb 2024 17:34 UTC
23
points
5
comments
5
min read
LW
link
A Strange ACH Corner Case
jefftk
10 Feb 2024 3:00 UTC
27
points
2
comments
2
min read
LW
link
(www.jefftk.com)
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel