Open Agency Architecture

TagLast edit: 10 Apr 2024 22:53 UTC by Chris Lakin

The Open Agency Architecture (“OAA”) is an AI alignment proposal by (among others) @davidad and @Eric Drexler.

See Davidad’s Provably Safe AI Architecture—ARIA’s Programme Thesis for the most up-to-date (2024 Feb 1) and quite readable explanation.

A shorter but older explanation is also available here.

New better link: https://www.aria.org.uk/programme-safeguarded-ai/

Atlas Computing is the org intended to house OAA.
Gaia Network is a variant of an open agency architecture. Gaia Network is related to (the) OAA, but not directly descending from it.

Davidad’s Bold Plan for Alignment: An In-Depth Explanation

Charbel-Raphaël and Gabin

19 Apr 2023 16:09 UTC

167 points

40 comments21 min readLW link 2 reviews

An Open Agency Architecture for Safe Transformative AI

davidad20 Dec 2022 13:04 UTC

80 points

22 comments4 min readLW link

A list of core AI safety problems and how I hope to solve them

davidad26 Aug 2023 15:12 UTC

165 points

29 comments5 min readLW link

The Open Agency Model

Eric Drexler22 Feb 2023 10:35 UTC

114 points

19 comments4 min readLW link

Gaia Network: a practical, incremental pathway to Open Agency Architecture

Roman Leventov and Rafael Kaufmann Nedal

20 Dec 2023 17:11 UTC

23 points

8 comments16 min readLW link

SociaLLM: proposal for a language model design for personalised apps, social science, and AI safety research

Roman Leventov19 Dec 2023 16:49 UTC

17 points

5 comments3 min readLW link

AGI will be made of heterogeneous components, Transformer and Selective SSM blocks will be among them

Roman Leventov27 Dec 2023 14:51 UTC

33 points

9 comments4 min readLW link

Roadmap for a collaborative prototype of an Open Agency Architecture

Deger Turan10 May 2023 17:41 UTC

31 points

0 comments12 min readLW link

Safety First: safety before full alignment. The deontic sufficiency hypothesis.

Chris Lakin3 Jan 2024 17:55 UTC

48 points

3 comments3 min readLW link

Towards Guaranteed Safe AI: A Framework for Ensuring Robust and Reliable AI Systems

Joar Skalse17 May 2024 19:13 UTC

67 points

10 comments2 min readLW link

Apply to the Conceptual Boundaries Workshop for AI Safety

Chris Lakin27 Nov 2023 21:04 UTC

50 points

0 comments3 min readLW link

Exploring a Vision for AI as Compassionate, Emotionally Intelligent Partners — Seeking Collaboration and Insights

theophilos14 Jul 2025 23:22 UTC

1 point

0 comments1 min readLW link

What does davidad want from «boundaries»?

Chris Lakin and davidad

6 Feb 2024 17:45 UTC

46 points

1 comment5 min readLW link

Provably Safe AI: Worldview and Projects

Ben Goldhaber and Steve_Omohundro

9 Aug 2024 23:21 UTC

58 points

44 comments7 min readLW link

Why I find Davidad’s plan interesting

Paul W20 May 2024 8:13 UTC

18 points

0 comments6 min readLW link

«Boundaries/Membranes» and AI safety compilation

Chris Lakin3 May 2023 21:41 UTC

56 points

17 comments8 min readLW link

Davidad’s Provably Safe AI Architecture—ARIA’s Programme Thesis

simeon_c1 Feb 2024 21:30 UTC

69 points

17 comments1 min readLW link

(www.aria.org.uk)

Announcing Atlas Computing

miyazono11 Apr 2024 15:56 UTC

45 points

4 comments4 min readLW link

No comments.