RSS

Tiling Agents

TagLast edit: 16 Jul 2024 10:54 UTC by Mateusz Bagiński

An agent might have the ability to create similar or slightly better versions of itself. These new agents can in turn create similar /​ better versions of themselves, and so on in a repeating pattern. This is referred to as an agent tiling itself.

This leads to the question: How can the original agent trust that these recursively generated agents maintain goals that are similar to the original agent’s objective?

In a deterministic logical system, assuming that all agents will share the same axioms, “trust” arises from being able to formally prove that the conclusions reached by any subsequently generated agents will be true. The possibility to be able to have this form of trust is influenced by Löb’s theorem. The inability to form this trust is called the Löbian obstacle.

See Also: Löbian obstacle, Löbs theorem, Vingean Agents, Vingean Reflection

References :

Tiling Agents for Self-Mod­ify­ing AI (OPFAI #2)

Eliezer Yudkowsky6 Jun 2013 20:24 UTC
88 points
259 comments3 min readLW link

Walk­through of the Tiling Agents for Self-Mod­ify­ing AI paper

So8res13 Dec 2013 3:23 UTC
29 points
18 comments21 min readLW link

Vingean Reflec­tion: Reli­able Rea­son­ing for Self-Im­prov­ing Agents

So8res15 Jan 2015 22:47 UTC
37 points
5 comments9 min readLW link

The Löbian Ob­sta­cle, And Why You Should Care

lukemarks7 Sep 2023 23:59 UTC
18 points
6 comments2 min readLW link

Log­i­cal In­duc­tor Tiling and Why it’s Hard

Diffractor14 Jun 2018 6:34 UTC
4 points
0 comments12 min readLW link

Prob­a­bil­is­tic Tiling (Pre­limi­nary At­tempt)

Diffractor7 Aug 2018 1:14 UTC
14 points
8 comments9 min readLW link

Leav­ing MIRI, Seek­ing Funding

abramdemski8 Aug 2024 18:32 UTC
264 points
19 comments2 min readLW link

Seek­ing Collaborators

abramdemski1 Nov 2024 17:13 UTC
61 points
15 comments7 min readLW link

Tiling agents with trans­finite para­met­ric polymorphism

Squark9 May 2014 17:32 UTC
6 points
11 comments2 min readLW link

The al­ign­ment sta­bil­ity problem

Seth Herd26 Mar 2023 2:10 UTC
35 points
15 comments4 min readLW link

Para­con­sis­tent Tiling Agents (Very Early Draft)

IAFF-User-42 Apr 2015 7:27 UTC
8 points
5 comments1 min readLW link
(github.com)

Lec­ture Series on Tiling Agents #2

abramdemski20 Jan 2025 21:02 UTC
16 points
0 comments1 min readLW link

The Pando Prob­lem: Re­think­ing AI Individuality

Jan_Kulveit28 Mar 2025 21:03 UTC
127 points
13 comments13 min readLW link

Lec­ture Series on Tiling Agents

abramdemski14 Jan 2025 21:34 UTC
38 points
14 comments1 min readLW link

Vingean Reflec­tion: Open Problems

abramdemski3 Jul 2015 18:44 UTC
5 points
3 comments5 min readLW link

Higher Di­men­sion Carte­sian Ob­jects and Align­ing ‘Tiling Si­mu­la­tors’

lukemarks11 Jun 2023 0:13 UTC
22 points
0 comments5 min readLW link

Ra­tional Effec­tive Utopia & Nar­row Way There: Mul­tiver­sal AI Align­ment, Place AI, New Ethico­physics… (Up­dated)

ank11 Feb 2025 3:21 UTC
13 points
8 comments35 min readLW link
No comments.