Error

LW server reports: not allowed.

This probably means the post has been deleted or moved back to the author's drafts.

Charbel-Raphaël 29 Jan 2023 11:38 UTC
10 points
0
There is also davidad’s Open Agency Architecture
https://www.alignmentforum.org/posts/pKSmEkSQJsCSTK6nH/an-open-agency-architecture-for-safe-transformative-ai
Mitchell_Porter 14 Feb 2023 5:28 UTC
5 points
1
Nice to see someone who wants to directly tackle the big problem. Also nice to see someone who appreciates June Ku’s work.
Roman Leventov 6 May 2023 15:00 UTC
1 point
0
the core motivation for formal alignment, for me, is that a working solution is at least eventually aligned: there is an objective answer to the question “will maximizing this with arbitrary capabilities produce desirable outcomes?” where the answer does not depend, at the limit, on what does the maximization.
I don’t know about other proposals because I’m not familiar with them, but Methaethical AI actually describes the machinery of the agent, hence “the answer” does depend “on what does the maximisation”.