Toying With Goal-Directedness

11 Jun 2020 18:27 UTC

We (Adam Shimi, Joe Collman, Michele Campolo and Sabrina Tang) are studying both how to formalize the intuitions behind goal-directedness, and what is its relevance to AI Safety. This sequence is here for the not fully polished posts that represent more the ideas of their specific author than a consensus inside the group.

Goal-directed = Model-based RL?

adamShimi20 Feb 2020 19:13 UTC

21 points

10 comments3 min readLW link

Focus: you are allowed to be bad at accomplishing your goals

adamShimi3 Jun 2020 21:04 UTC

19 points

17 comments3 min readLW link

Goal-directedness is behavioral, not structural

adamShimi8 Jun 2020 23:05 UTC

6 points

12 comments3 min readLW link

Locality of goals

adamShimi22 Jun 2020 21:56 UTC

16 points

8 comments6 min readLW link

Goals and short descriptions

Michele Campolo2 Jul 2020 17:41 UTC

14 points

8 comments5 min readLW link

Goal-Directedness: What Success Looks Like

adamShimi16 Aug 2020 18:33 UTC

9 points

0 comments2 min readLW link