List: value-al­ign­ment subjects

WikiLast edit: 12 Feb 2026 10:38 UTC by Mateusz Bagiński

Safety paradigm for advanced agents

Foreseen difficulties

Reflectivity problems

Foreseen normal difficulties

General agent theory

Value theory

Larger research agendas

Possible future use-cases

Possible escape routes

Background

Strategy