Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
David Udell
Karma:
2,347
All
Posts
Comments
New
Top
Old
Page
1
Shard Theory: An Overview
David Udell
11 Aug 2022 5:44 UTC
161
points
34
comments
10
min read
LW
link
Gato as the Dawn of Early AGI
David Udell
15 May 2022 6:52 UTC
85
points
29
comments
12
min read
LW
link
Dath Ilan’s Views on Stopgap Corrigibility
David Udell
22 Sep 2022 16:16 UTC
77
points
19
comments
13
min read
LW
link
(www.glowfic.com)
Linear Algebra Done Right, Axler
David Udell
2 Jan 2023 22:54 UTC
56
points
6
comments
9
min read
LW
link
Consequentialists: One-Way Pattern Traps
David Udell
16 Jan 2023 20:48 UTC
54
points
3
comments
14
min read
LW
link
Acceptability Verification: A Research Agenda
David Udell
and
evhub
12 Jul 2022 20:11 UTC
50
points
0
comments
1
min read
LW
link
(docs.google.com)
The “Adults in the Room”
David Udell
17 May 2022 4:03 UTC
49
points
2
comments
4
min read
LW
link
The Shard Theory Alignment Scheme
David Udell
25 Aug 2022 4:52 UTC
47
points
32
comments
2
min read
LW
link
Sparse Coding, for Mechanistic Interpretability and Activation Engineering
David Udell
23 Sep 2023 19:16 UTC
42
points
7
comments
34
min read
LW
link
Your Utility Function is Your Utility Function
David Udell
6 May 2022 7:15 UTC
39
points
17
comments
2
min read
LW
link
Team Shard Status Report
David Udell
9 Aug 2022 5:33 UTC
38
points
8
comments
3
min read
LW
link
Framing AI Childhoods
David Udell
6 Sep 2022 23:40 UTC
37
points
8
comments
4
min read
LW
link
On Defecting On Yourself
David Udell
18 Mar 2022 2:21 UTC
35
points
6
comments
4
min read
LW
link
Dath Ilan vs. Sid Meier’s Alpha Centauri: Pareto Improvements
David Udell
28 Apr 2022 19:26 UTC
34
points
16
comments
2
min read
LW
link
Finding Skeletons on Rashomon Ridge
David Udell
,
Peter S. Park
and
NickyP
24 Jul 2022 22:31 UTC
30
points
2
comments
7
min read
LW
link
Probability Theory: The Logic of Science, Jaynes
David Udell
16 Feb 2023 21:57 UTC
29
points
0
comments
18
min read
LW
link
Guidelines for Mad Entrepreneurs
David Udell
16 Sep 2022 6:33 UTC
26
points
0
comments
11
min read
LW
link
Agency and Coherence
David Udell
26 Mar 2022 19:25 UTC
25
points
2
comments
3
min read
LW
link
But What’s Your *New Alignment Insight,* out of a Future-Textbook Paragraph?
David Udell
7 May 2022 3:10 UTC
25
points
18
comments
5
min read
LW
link
Negotiating Up and Down the Simulation Hierarchy: Why We Might Survive the Unaligned Singularity
David Udell
4 May 2022 4:21 UTC
25
points
14
comments
2
min read
LW
link
Back to top
Next