Tamsin Leake

Karma: 2,589

I’m Tamsin Leake, co-founder and head of research at Orthogonal, doing agent foundations.

Orthogonal: A new agent foundations alignment organization

Tamsin Leake19 Apr 2023 20:17 UTC

207 points

4 comments1 min readLW link

(orxl.org)

the QACI alignment plan: table of contents

Tamsin Leake21 Mar 2023 20:22 UTC

102 points

0 comments1 min readLW link

(carado.moe)

everything is okay

Tamsin Leake23 Aug 2022 9:20 UTC

98 points

22 comments7 min readLW link 2 reviews

(carado.moe)

continue working on hard alignment! don’t give up!

Tamsin Leake24 Mar 2023 0:14 UTC

82 points

45 comments1 min readLW link

(carado.moe)

publishing alignment research and exfohazards

Tamsin Leake31 Oct 2022 18:02 UTC

80 points

12 comments1 min readLW link 1 review

(carado.moe)

How LDT helps reduce the AI arms race

Tamsin Leake10 Dec 2023 16:21 UTC

70 points

13 comments4 min readLW link

(carado.moe)

We’re all in this together

Tamsin Leake5 Dec 2023 13:57 UTC

68 points

65 comments2 min readLW link

(carado.moe)

Orthogonal’s Formal-Goal Alignment theory of change

Tamsin Leake5 May 2023 22:36 UTC

68 points

12 comments4 min readLW link

(carado.moe)

So you want to save the world? An account in paladinhood

Tamsin Leake22 Nov 2023 17:40 UTC

65 points

19 comments15 min readLW link

(carado.moe)

my current outlook on AI risk mitigation

Tamsin Leake3 Oct 2022 20:06 UTC

63 points

6 comments11 min readLW link

(carado.moe)

your terminal values are complex and not objective

Tamsin Leake13 Mar 2023 13:34 UTC

60 points

6 comments2 min readLW link

(carado.moe)

so you think you’re not qualified to do technical alignment research?

Tamsin Leake7 Feb 2023 1:54 UTC

55 points

7 comments1 min readLW link

(carado.moe)

formal alignment: what it is, and some proposals

Tamsin Leake29 Jan 2023 11:32 UTC

53 points

3 comments1 min readLW link

(carado.moe)

a narrative explanation of the QACI alignment plan

Tamsin Leake15 Feb 2023 3:28 UTC

53 points

29 comments6 min readLW link

(carado.moe)

formalizing the QACI alignment formal-goal

Tamsin Leake and JuliaHP

10 Jun 2023 3:28 UTC

53 points

6 comments14 min readLW link

(carado.moe)

state of my alignment research, and what needs work

Tamsin Leake3 Mar 2023 10:28 UTC

51 points

0 comments2 min readLW link

(carado.moe)

PreDCA: vanessa kosoy’s alignment protocol

Tamsin Leake20 Aug 2022 10:03 UTC

50 points

8 comments7 min readLW link

(carado.moe)

ethics and anthropics of homomorphically encrypted computations

Tamsin Leake9 Sep 2022 10:49 UTC

47 points

49 comments3 min readLW link

(carado.moe)

an Evangelion dialogue explaining the QACI alignment plan

Tamsin Leake10 Jun 2023 3:28 UTC

45 points

15 comments43 min readLW link

(carado.moe)

the Insulated Goal-Program idea

Tamsin Leake13 Aug 2022 9:57 UTC

43 points

4 comments2 min readLW link

(carado.moe)