RSS

tom4everitt(Tom Everitt)

Karma: 432

Research Scientist at DeepMind

tomeveritt.se

Re­ward Hack­ing from a Causal Perspective

21 Jul 2023 18:27 UTC
29 points
5 comments7 min readLW link

In­cen­tives from a causal perspective

10 Jul 2023 17:16 UTC
27 points
0 comments6 min readLW link

Agency from a causal perspective

30 Jun 2023 17:37 UTC
38 points
5 comments6 min readLW link

Causal­ity: A Brief Introduction

20 Jun 2023 15:01 UTC
48 points
18 comments6 min readLW link

In­tro­duc­tion to Towards Causal Foun­da­tions of Safe AGI

12 Jun 2023 17:55 UTC
67 points
6 comments4 min readLW link

Progress on Causal In­fluence Diagrams

tom4everitt30 Jun 2021 15:34 UTC
73 points
6 comments9 min readLW link

Speci­fi­ca­tion gam­ing: the flip side of AI ingenuity

6 May 2020 23:51 UTC
65 points
9 comments6 min readLW link

CIRL Wireheading

tom4everitt8 Aug 2017 6:33 UTC
3 points
4 comments2 min readLW link

Se­quen­tial Ex­ten­sions of Causal and Ev­i­den­tial De­ci­sion Theory

tom4everitt15 Oct 2015 23:45 UTC
2 points
0 comments1 min readLW link
(arxiv.org)