TsviBT comments on TsviBT’s Shortform

TsviBT 26 Jun 2025 20:01 UTC
20 points
2
Humans are (weak) evidence for the instrumental utility for mind-designers to design terminal-goal-construction mechanisms.

Evolution couldn’t directly encode IGF into humans. So what was it supposed to do? One answer would be to make us vibe-machines: You bop around, you satisfy your immediate needs, you hang out with people, etc. And that is sort of what you are. But also there are the Unreasonable, who think and plan long-term, who investigate secrets, who build things for 10 or 100 or 1000 years—why? Maybe it’s because having terminal-like goals (here meaning, aims that are fairly fixed and fairly ambitious) is so useful that you want to have them anyway even if you can’t make them be the right ones. Instead you build machines to guess / make up terminal goals (https://tsvibt.blogspot.com/2022/11/do-humans-derive-values-from-fictitious.html).
- quetzal_rainbow 26 Jun 2025 21:01 UTC
  6 points
  2
  Parent
  I think more correct picture is that it’s useful to have programmable behavior and then programmable system suddenly becomes Turing-complete weird machine and some of resulting programs are terminal-goal-oriented, which are favored by selection pressures: terminal goals are self-preserving.
  
  Humans in native enviornment have programmable behavior in form of social regulation, information exchange and communicating instructions, if you add sufficient amount of computational power in this system you can get very wide spectrum of behaviors.
  
  I think it’s general picture of inner misalignment.
  - TsviBT 26 Jun 2025 21:08 UTC
    3 points
    0
    Parent
    That seems like part of the picture, but far from all of it. Manufactured stone tools have been around for well over 2 million years. That’s the sort of thing you do when you already have a significant amount of “hold weeks-long goal in mind long and strong enough that you put in a couple day’s effort towards it” (or something like that). Another example is Richard Alexander’s hypothesis: warfare --> strong pressure toward cognitive mechanisms for group-goal-construction. Neither of these are mainly about programmability (though the latter is maybe somewhat). I don’t think we see “random self-preserving terminal goals installed exogenously”, I think we see goals being self-constructed and then flung into long-termness.