RSS

David Udell

Karma: 2,347

Shard The­ory: An Overview

David Udell11 Aug 2022 5:44 UTC
161 points
34 comments10 min readLW link

Gato as the Dawn of Early AGI

David Udell15 May 2022 6:52 UTC
85 points
29 comments12 min readLW link

Dath Ilan’s Views on Stop­gap Corrigibility

David Udell22 Sep 2022 16:16 UTC
77 points
19 comments13 min readLW link
(www.glowfic.com)

Lin­ear Alge­bra Done Right, Axler

David Udell2 Jan 2023 22:54 UTC
56 points
6 comments9 min readLW link

Con­se­quen­tial­ists: One-Way Pat­tern Traps

David Udell16 Jan 2023 20:48 UTC
54 points
3 comments14 min readLW link

Ac­cept­abil­ity Ver­ifi­ca­tion: A Re­search Agenda

12 Jul 2022 20:11 UTC
50 points
0 comments1 min readLW link
(docs.google.com)

The “Adults in the Room”

David Udell17 May 2022 4:03 UTC
49 points
2 comments4 min readLW link

The Shard The­ory Align­ment Scheme

David Udell25 Aug 2022 4:52 UTC
47 points
32 comments2 min readLW link

Sparse Cod­ing, for Mechanis­tic In­ter­pretabil­ity and Ac­ti­va­tion Engineering

David Udell23 Sep 2023 19:16 UTC
42 points
7 comments34 min readLW link

Your Utility Func­tion is Your Utility Function

David Udell6 May 2022 7:15 UTC
39 points
17 comments2 min readLW link

Team Shard Sta­tus Report

David Udell9 Aug 2022 5:33 UTC
38 points
8 comments3 min readLW link

Fram­ing AI Childhoods

David Udell6 Sep 2022 23:40 UTC
37 points
8 comments4 min readLW link

On Defect­ing On Yourself

David Udell18 Mar 2022 2:21 UTC
35 points
6 comments4 min readLW link

Dath Ilan vs. Sid Meier’s Alpha Cen­tauri: Pareto Improvements

David Udell28 Apr 2022 19:26 UTC
34 points
16 comments2 min readLW link

Find­ing Skele­tons on Rashomon Ridge

24 Jul 2022 22:31 UTC
30 points
2 comments7 min readLW link

Prob­a­bil­ity The­ory: The Logic of Science, Jaynes

David Udell16 Feb 2023 21:57 UTC
29 points
0 comments18 min readLW link

Guidelines for Mad Entrepreneurs

David Udell16 Sep 2022 6:33 UTC
26 points
0 comments11 min readLW link

Agency and Coherence

David Udell26 Mar 2022 19:25 UTC
25 points
2 comments3 min readLW link

But What’s Your *New Align­ment In­sight,* out of a Fu­ture-Text­book Para­graph?

David Udell7 May 2022 3:10 UTC
25 points
18 comments5 min readLW link

Ne­go­ti­at­ing Up and Down the Si­mu­la­tion Hier­ar­chy: Why We Might Sur­vive the Unal­igned Singularity

David Udell4 May 2022 4:21 UTC
25 points
14 comments2 min readLW link