RSS

Francis Rhys Ward

Karma: 302

On Agent In­cen­tives to Ma­nipu­late Hu­man Feed­back in Multi-Agent Re­ward Learn­ing Scenarios

Francis Rhys Ward3 Apr 2022 18:20 UTC
27 points
10 comments8 min readLW link

For ev­ery choice of AGI difficulty, con­di­tion­ing on grad­ual take-off im­plies shorter timelines.

Francis Rhys Ward21 Apr 2022 7:44 UTC
31 points
13 comments3 min readLW link