Isomorphic agents with different preferences: any suggestions?

Stuart_Armstrong19 Sep 2016 13:15 UTC

7 points

In order to better understand how AI might succeed and fail at learning knowledge, I’ll be trying to construct models of limited agents (with bias, knowledge, and preferences) that display identical behaviour in a wide range of circumstance (but not all). This means their preferences cannot be deduced merely/easily from observations.

Does anyone have any suggestions for possible agent models to use in this project?

Stuart_Armstrong19 Sep 2016 13:15 UTC

7 points

6 comments1 min readLW link Archive

Gunnar_Zarncke 20 Sep 2016 22:18 UTC
4 points
0
Would you consider computer viruses as limited agents trying to appear as identical (superficially) as the unaltered system as possible?

Also note that the actual change between the original system and the altered system can be arbitrarily small though the change in behavior can be extremely large. Consider for example the Ken Thompson hack or the recent single gate security attack.
- Stuart_Armstrong 21 Sep 2016 8:19 UTC
  0 points
  0
  Parent
  Not looking for exactly this, but somewhat related.
  - Gunnar_Zarncke 21 Sep 2016 18:42 UTC
    2 points
    0
    Parent
    I guess what you are missing is the agentyness or intelligence. But consider that already now Android comes with ‘assistants’ that make recommendations and that soon may cooperate with other such agents to arrange for appointments, flights and such.
turchin 19 Sep 2016 23:32 UTC
4 points
0
Farmers are nursing small pigs like their children, but later kill them and eat them. It may be unpredictable for pigs.

A spy who works like an ordinary person, but sometimes stole information.
MrMind 26 Sep 2016 8:34 UTC
2 points
0
I think you should make a distinction if the different behaviours comes from different circumstances or not.
If their environment is always the same, then I think the only to have what you ask is if the system has a hidden, very specific parameter, that says “when X and Y and Z happens, zig instead of zagging”.
Otherwise, if the model is slightly chaotic, then an important alteration to the environment might provoke very different behaviour.

For the first type of agent, think of two Markov chains almost identical, only one has a very improbable arc to a stable subnet that doesn’t exists in the other chain.
For the second type, think of two similar strange attractors, that have different behaviours away from the stable parameters. They will be approximately identical in the same zone and be very different away from that zone.
MattG2 20 Sep 2016 16:09 UTC
0 points
0
Agents based on lookup tables.