Things that can go wrong with decomposed modules

Some thoughts on what could go wrong with the decomposed modules that Eric Drexler is currently analysing:

There are systems that are unsafe if:

a) Every module is a superintelligent agent. b) One module is a superintelligent agent. c) No modules are superintelligent agents, but their input-output behaviour is not fully understood. d) No modules are superintelligent agents, but their input-output behaviour are fully understood.

Now, d) seems unrealistic; my main objections are relevant to b), but they can shade into c) too.

Consider a system withe at least two modules, module P which produces plans for bridges, and module C which checks these plans. Because of imperfections in P, it is not only motivated to produce the best bridge plans it can, but it’s also motivated to have C accept the plans.

Except that C is on a timer, and, if it can’t check the plans within a set time, it rejects them. The complicated plans that P is motivated to produce are likely to be rejected on these grounds. Therefore P is not only motivated to produce complicated plans, but to “phrase” them in ways that C can quickly parse. Maybe including executive summaries, or pre-computing part of what C needs to do, and so on.

How would it acquire the ability to do so? Well, if we assume b), then P is already a superintetelligent agent, so that’s easy. If not, it might acquire knowledge of C through testing some plans with it. Or there might be a module C’ in the system, that also checks plans, but in a way that P doesn’t care about. If P has access to C’, it can rapidly test a lot of plans with it, and pass those on to C.

And, of course, there is but one step from “making your plans much easier to check” to “fooling the checker into accepting an imperfect plan”.

A last element that relevant to this is that modules may come to realise the capabilities of other modules, and thus gain them by passing that message on. If module A can demonstrate to module B that a certain message came from module C, then module A effectively has the capabilities of C, and the fact that C and B are not connected ceases to matter.

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer