red75 comments on Imagine a world where minds run on physics

red75 4 Nov 2010 14:54 UTC
0 points
0
You seem to assume that world is indifferent to agent’s goals. But if there’s another agent, then it can be not the case.

Let G1 be “tile the universe with still-life”, G2 be “tile upper half of the universe with still-life”.

If agent A has goal G1, it will be provable destroyed by agent B, if A will change it’s goal to G2, then B will not interfere.

A and B have full information on world’s state.

Should A modify its goal?

Edit: Goal stability != value stability. So my point isn’t valid.
- cousin_it 4 Nov 2010 16:13 UTC
  0 points
  0
  Parent
  
  You seem to assume that world is indifferent to agent’s goals.
  
  No, I don’t need that assumption. What conclusion in the post depends on it, in your opinion?
  - red75 4 Nov 2010 16:43 UTC
    2 points
    0
    Parent
    It’s error on my part, I assumed that goal stability equals value stability. But then it looks like that it can be impossible to reconstruct agent’s values given only its current state.
    - cousin_it 4 Nov 2010 18:38 UTC
      0 points
      0
      Parent
      I’m afraid I still don’t understand your reasoning. How are “goals” different from “values”, in your terms?
      - red75 4 Nov 2010 20:26 UTC
        0 points
        0
        Parent
        Goal is what an agent optimizes for at a given point in time. Value is the initial goal of an agent (in your toy model at least).
        
        In my root post it seems to be optimal for agent A to self-modify into agent A’, which optimizes for G2, thus agent A’ succeeds in optimizing world according to its values (goal of agent A). But original goal doesn’t influence its optimization procedure anymore. Thus if we’ll analyze agent A’ (without knowledge of world’s history), we’ll be unable to infer its values (its original goal).
        cousin_it 4 Nov 2010 20:30 UTC
        1 point
        0
        Parent
        Yes, that seems to be correct.