Cooperation, Conflict, and Transformative Artificial Intelligence: A Research Agenda

This is a se­quence ver­sion of the Effec­tive Altru­ism Foun­da­tion’s re­search agenda on Co­op­er­a­tion, Con­flict, and Trans­for­ma­tive Ar­tifi­cial In­tel­li­gence. The agenda out­lines what we think are the most promis­ing av­enues for de­vel­op­ing tech­ni­cal and gov­er­nance in­ter­ven­tions aimed at avoid­ing con­flict be­tween trans­for­ma­tive AI sys­tems. We draw on in­ter­na­tional re­la­tions, game the­ory, be­hav­ioral eco­nomics, ma­chine learn­ing, de­ci­sion the­ory, and for­mal episte­mol­ogy.

Pre­face to EAF’s Re­search Agenda on Co­op­er­a­tion, Con­flict, and TAI

Sec­tions 1 & 2: In­tro­duc­tion, Strat­egy and Governance

Sec­tions 3 & 4: Cred­i­bil­ity, Peace­ful Bar­gain­ing Mechanisms

Sec­tions 5 & 6: Con­tem­po­rary Ar­chi­tec­tures, Hu­mans in the Loop

Sec­tion 7: Foun­da­tions of Ra­tional Agency

Ac­knowl­edge­ments & References