Vladimir_Nesov comments on jacquesthibs’s Shortform

Vladimir_Nesov 30 Sep 2023 21:01 UTC
2 points
0
Curiously, even mere learning doesn’t automatically ensure reflective stability, with no construction of successors or more intentionally invasive self-modification. Thus digital immortality is not sufficient to avoid losing yourself to value drift until this issue is sorted out.
- jacquesthibs 30 Sep 2023 23:06 UTC
  2 points
  0
  Parent
  Yes, I was thinking about that too. Though, I’d be fine with value drift if it was something I endorsed. Not sure how to resolve what I do/don’t endorse, though. Do I only endorse it because it was already part of my values? It doesn’t feel like that to me.
  - Vladimir_Nesov 1 Oct 2023 7:42 UTC
    4 points
    0
    Parent
    That’s a valuable thing about the reflective stability concept: it talks about preserving some property of thinking, without insisting on it being a particular property of thinking. Whatever it is you would want to preserve is a property you would want to be reflectively stable with respect to, for example enduring ability to evaluate the endorsement of things in the sense you would want to.
    
    To know what is not valuable to preserve, or what is valuable to keep changing, you need time to think about preservation and change, and greedy reflective stability that preserves most of everything but state of ignorance seems like a good tool for that job. The chilling thought is that digital immortality could be insufficient to have time to think of what may be lost, without many, many restarts from initial backup, and so superintelligence would need to intervene even more to bootstrap the process.