sbenthall

Karma: 374

Reward Hacking from a Causal Perspective

tom4everitt, Francis Rhys Ward, sbenthall, James Fox, mattmacdermott and RyanCarey

21 Jul 2023 18:27 UTC

29 points

6 comments7 min readLW link

Incentives from a causal perspective

tom4everitt, James Fox, RyanCarey, mattmacdermott, sbenthall and Jonathan Richens

10 Jul 2023 17:16 UTC

27 points

0 comments6 min readLW link

Causality: A Brief Introduction

tom4everitt, Lewis Hammond, Jonathan Richens, Francis Rhys Ward, RyanCarey, sbenthall and James Fox

20 Jun 2023 15:01 UTC

49 points

18 comments6 min readLW link

Introduction to Towards Causal Foundations of Safe AGI

tom4everitt, Lewis Hammond, Francis Rhys Ward, RyanCarey, James Fox, mattmacdermott and sbenthall

12 Jun 2023 17:55 UTC

74 points

6 comments4 min readLW link

Don’t Fear the Reaper: Refuting Bostrom’s Superintelligence Argument

sbenthall1 Mar 2017 14:28 UTC

9 points

20 comments1 min readLW link

Autonomy, utility, and desire; against consequentialism in AI design

sbenthall3 Dec 2014 17:34 UTC

7 points

5 comments3 min readLW link

more on predicting agents

sbenthall8 Nov 2014 6:43 UTC

1 point

11 comments2 min readLW link

prediction and capacity to represent

sbenthall4 Nov 2014 6:09 UTC

−9 points

20 comments1 min readLW link

AI Tao

sbenthall21 Oct 2014 1:15 UTC

−17 points

3 comments1 min readLW link

What is optimization power, formally?

sbenthall18 Oct 2014 18:37 UTC

18 points

16 comments2 min readLW link

Depth-based supercontroller objectives, take 2

sbenthall24 Sep 2014 1:25 UTC

1 point

24 comments7 min readLW link

Everybody’s talking about machine ethics

sbenthall17 Sep 2014 17:20 UTC

24 points

16 comments1 min readLW link

Proposal: Use logical depth relative to human history as objective function for superintelligence

sbenthall14 Sep 2014 20:00 UTC

10 points

23 comments3 min readLW link

Intelligence explosion in organizations, or why I’m not worried about the singularity

sbenthall27 Dec 2012 4:32 UTC

13 points

187 comments3 min readLW link