RSS

Open Agency Architecture

TagLast edit: 10 Apr 2024 22:53 UTC by Chipmonk

The Open Agency Architecture (“OAA”) is an AI alignment proposal by (among others) @davidad and @Eric Drexler.

See Davidad’s Provably Safe AI Architecture—ARIA’s Programme Thesis for the most up-to-date (2024 Feb 1) and quite readable explanation.

A shorter but older explanation is also available here.

New better link: https://​​www.aria.org.uk/​​programme-safeguarded-ai/​​

Related:

Davi­dad’s Bold Plan for Align­ment: An In-Depth Explanation

19 Apr 2023 16:09 UTC
157 points
34 comments21 min readLW link

An Open Agency Ar­chi­tec­ture for Safe Trans­for­ma­tive AI

davidad20 Dec 2022 13:04 UTC
79 points
22 comments4 min readLW link

A list of core AI safety prob­lems and how I hope to solve them

davidad26 Aug 2023 15:12 UTC
165 points
29 comments5 min readLW link

Gaia Net­work: a prac­ti­cal, in­cre­men­tal path­way to Open Agency Architecture

20 Dec 2023 17:11 UTC
22 points
8 comments16 min readLW link

The Open Agency Model

Eric Drexler22 Feb 2023 10:35 UTC
114 points
18 comments4 min readLW link

So­ci­aLLM: pro­posal for a lan­guage model de­sign for per­son­al­ised apps, so­cial sci­ence, and AI safety research

Roman Leventov19 Dec 2023 16:49 UTC
17 points
5 comments3 min readLW link

AGI will be made of het­ero­ge­neous com­po­nents, Trans­former and Selec­tive SSM blocks will be among them

Roman Leventov27 Dec 2023 14:51 UTC
33 points
9 comments4 min readLW link

Prov­ably Safe AI: Wor­ld­view and Projects

9 Aug 2024 23:21 UTC
51 points
43 comments7 min readLW link

Roadmap for a col­lab­o­ra­tive pro­to­type of an Open Agency Architecture

Deger Turan10 May 2023 17:41 UTC
31 points
0 comments12 min readLW link

Safety First: safety be­fore full al­ign­ment. The de­on­tic suffi­ciency hy­poth­e­sis.

Chipmonk3 Jan 2024 17:55 UTC
48 points
3 comments3 min readLW link

Ap­ply to the Con­cep­tual Boundaries Work­shop for AI Safety

Chipmonk27 Nov 2023 21:04 UTC
50 points
0 comments3 min readLW link

Towards Guaran­teed Safe AI: A Frame­work for En­sur­ing Ro­bust and Reli­able AI Systems

Joar Skalse17 May 2024 19:13 UTC
65 points
10 comments2 min readLW link

«Boundaries/​Mem­branes» and AI safety compilation

Chipmonk3 May 2023 21:41 UTC
57 points
17 comments8 min readLW link

Davi­dad’s Prov­ably Safe AI Ar­chi­tec­ture—ARIA’s Pro­gramme Thesis

simeon_c1 Feb 2024 21:30 UTC
69 points
17 comments1 min readLW link
(www.aria.org.uk)

What does davi­dad want from «bound­aries»?

6 Feb 2024 17:45 UTC
44 points
1 comment5 min readLW link

An­nounc­ing At­las Computing

miyazono11 Apr 2024 15:56 UTC
44 points
4 comments4 min readLW link

Why I find Davi­dad’s plan interesting

Paul W20 May 2024 8:13 UTC
18 points
0 comments6 min readLW link
No comments.