RSS

Tamsin Leake

Karma: 2,589

I’m Tamsin Leake, co-founder and head of research at Orthogonal, doing agent foundations.

Orthog­o­nal: A new agent foun­da­tions al­ign­ment organization

Tamsin Leake19 Apr 2023 20:17 UTC
207 points
4 comments1 min readLW link
(orxl.org)

the QACI al­ign­ment plan: table of contents

Tamsin Leake21 Mar 2023 20:22 UTC
102 points
0 comments1 min readLW link
(carado.moe)

ev­ery­thing is okay

Tamsin Leake23 Aug 2022 9:20 UTC
98 points
22 comments7 min readLW link2 reviews
(carado.moe)

con­tinue work­ing on hard al­ign­ment! don’t give up!

Tamsin Leake24 Mar 2023 0:14 UTC
82 points
45 comments1 min readLW link
(carado.moe)

pub­lish­ing al­ign­ment re­search and exfohazards

Tamsin Leake31 Oct 2022 18:02 UTC
80 points
12 comments1 min readLW link1 review
(carado.moe)

How LDT helps re­duce the AI arms race

Tamsin Leake10 Dec 2023 16:21 UTC
70 points
13 comments4 min readLW link
(carado.moe)

We’re all in this together

Tamsin Leake5 Dec 2023 13:57 UTC
68 points
65 comments2 min readLW link
(carado.moe)

Orthog­o­nal’s For­mal-Goal Align­ment the­ory of change

Tamsin Leake5 May 2023 22:36 UTC
68 points
12 comments4 min readLW link
(carado.moe)

So you want to save the world? An ac­count in paladinhood

Tamsin Leake22 Nov 2023 17:40 UTC
65 points
19 comments15 min readLW link
(carado.moe)

my cur­rent out­look on AI risk mitigation

Tamsin Leake3 Oct 2022 20:06 UTC
63 points
6 comments11 min readLW link
(carado.moe)

your ter­mi­nal val­ues are com­plex and not objective

Tamsin Leake13 Mar 2023 13:34 UTC
60 points
6 comments2 min readLW link
(carado.moe)

so you think you’re not qual­ified to do tech­ni­cal al­ign­ment re­search?

Tamsin Leake7 Feb 2023 1:54 UTC
55 points
7 comments1 min readLW link
(carado.moe)

for­mal al­ign­ment: what it is, and some proposals

Tamsin Leake29 Jan 2023 11:32 UTC
53 points
3 comments1 min readLW link
(carado.moe)

a nar­ra­tive ex­pla­na­tion of the QACI al­ign­ment plan

Tamsin Leake15 Feb 2023 3:28 UTC
53 points
29 comments6 min readLW link
(carado.moe)

for­mal­iz­ing the QACI al­ign­ment for­mal-goal

10 Jun 2023 3:28 UTC
53 points
6 comments14 min readLW link
(carado.moe)

state of my al­ign­ment re­search, and what needs work

Tamsin Leake3 Mar 2023 10:28 UTC
51 points
0 comments2 min readLW link
(carado.moe)

PreDCA: vanessa kosoy’s al­ign­ment protocol

Tamsin Leake20 Aug 2022 10:03 UTC
50 points
8 comments7 min readLW link
(carado.moe)

ethics and an­throp­ics of ho­mo­mor­phi­cally en­crypted computations

Tamsin Leake9 Sep 2022 10:49 UTC
47 points
49 comments3 min readLW link
(carado.moe)

an Evan­ge­lion di­alogue ex­plain­ing the QACI al­ign­ment plan

Tamsin Leake10 Jun 2023 3:28 UTC
45 points
15 comments43 min readLW link
(carado.moe)

the In­su­lated Goal-Pro­gram idea

Tamsin Leake13 Aug 2022 9:57 UTC
43 points
4 comments2 min readLW link
(carado.moe)