Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Cleo Nardo
Karma:
2,717
DMs open.
All
Posts
Comments
New
Top
Old
Page
1
Can SAE steering reveal sandbagging?
jordine
,
Hoang Khiem
,
Felix Hofstätter
and
Cleo Nardo
Apr 15, 2025, 12:33 PM
35
points
3
comments
4
min read
LW
link
Rethinking Laplace’s Rule of Succession
Cleo Nardo
Nov 22, 2024, 6:46 PM
11
points
5
comments
2
min read
LW
link
Appraising aggregativism and utilitarianism
Cleo Nardo
Jun 21, 2024, 11:10 PM
27
points
10
comments
19
min read
LW
link
Aggregative principles approximate utilitarian principles
Cleo Nardo
Jun 12, 2024, 4:27 PM
28
points
3
comments
23
min read
LW
link
Aggregative Principles of Social Justice
Cleo Nardo
Jun 5, 2024, 1:44 PM
29
points
10
comments
37
min read
LW
link
Shortform
Cleo Nardo
Mar 1, 2024, 6:20 PM
5
points
101
comments
1
min read
LW
link
Uncertainty in all its flavours
Cleo Nardo
Jan 9, 2024, 4:21 PM
34
points
6
comments
35
min read
LW
link
Game Theory without Argmax [Part 2]
Cleo Nardo
Nov 11, 2023, 4:02 PM
31
points
14
comments
13
min read
LW
link
Game Theory without Argmax [Part 1]
Cleo Nardo
Nov 11, 2023, 3:59 PM
70
points
18
comments
19
min read
LW
link
MetaAI: less is less for alignment.
Cleo Nardo
Jun 13, 2023, 2:08 PM
71
points
17
comments
5
min read
LW
link
Rishi Sunak mentions “existential threats” in talk with OpenAI, DeepMind, Anthropic CEOs
Arjun Panickssery
,
Baldassare Castiglione
and
Cleo Nardo
May 24, 2023, 9:06 PM
34
points
1
comment
1
min read
LW
link
(www.gov.uk)
List of requests for an AI slowdown/halt.
Cleo Nardo
Apr 14, 2023, 11:55 PM
46
points
6
comments
1
min read
LW
link
Excessive AI growth-rate yields little socio-economic benefit.
Cleo Nardo
Apr 4, 2023, 7:13 PM
27
points
22
comments
4
min read
LW
link
AI Summer Harvest
Cleo Nardo
Apr 4, 2023, 3:35 AM
130
points
10
comments
1
min read
LW
link
The 0.2 OOMs/year target
Cleo Nardo
Mar 30, 2023, 6:15 PM
84
points
24
comments
5
min read
LW
link
Wittgenstein and ML — parameters vs architecture
Cleo Nardo
Mar 24, 2023, 4:54 AM
44
points
9
comments
5
min read
LW
link
Remarks 1–18 on GPT (compressed)
Cleo Nardo
Mar 20, 2023, 10:27 PM
145
points
35
comments
31
min read
LW
link
The algorithm isn’t doing X, it’s just doing Y.
Cleo Nardo
Mar 16, 2023, 11:28 PM
53
points
43
comments
5
min read
LW
link
Want to predict/explain/control the output of GPT-4? Then learn about the world, not about transformers.
Cleo Nardo
Mar 16, 2023, 3:08 AM
107
points
26
comments
5
min read
LW
link
The Waluigi Effect (mega-post)
Cleo Nardo
Mar 3, 2023, 3:22 AM
629
points
188
comments
16
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel