Experiments in instrumental convergence

12 Oct 2022 21:15 UTC

This sequence investigates instrumental convergence and power-seeking through a series of experiments in multi-agent RL.

The key question we explore: If humans build AIs that learn faster than we do, will those AIs compete with us by default?

Instrumental convergence in single-agent systems

12 Oct 2022 12:24 UTC

33 points

(www.gladstone.ai)

13 Oct 2022 15:38 UTC

21 points

(www.gladstone.ai)

14 Oct 2022 15:50 UTC

22 points

(www.gladstone.ai)

Edouard Harris24 Oct 2022 20:03 UTC

29 points

(github.com)