williawa comments on williawa’s Shortform

williawa 13 Oct 2025 8:14 UTC
1 point
0
I don’t quite see your point. If this is genuinely what you want, the AI would allow that process to unfold.
- Vladimir_Nesov 13 Oct 2025 14:32 UTC
  2 points
  0
  Parent
  The thread is about preference, perhaps utility functions, so it’s not about concrete wishes. A utility function is data for consistently comparing events, certain subsets of a sample space, by assigning them an expected utility. Long reflection then is a process for producing this data. Rather than something this data is ex ante supposed to be about, the process is the instrumental means of producing the preference about in general something else.
  
  My argument was making two points: that any immediate wants are not the preference, and that the process defining the preference would itself have moral significance. So there is a circularity to this construction, defining preference benefits from already having some access to it, to better structure the process that defines it. Resolving this circularity is a step that could benefit from efforts of a superintelligence. But the process still retains the character of lived experience rather than clinical calculation, so speculations about better structures of hypothetical societies remain relevant for defining extrapolated volition.
  - williawa 13 Oct 2025 15:07 UTC
    1 point
    0
    Parent
    I don’t think this makes sense.
    My argument was making two points: that any immediate wants are not the preference [...]
    Right now you’d want the ASI to maximize your preferences, even though those preferences are not yet legible/knowable to the AI (or yourself). The AI knows this, so it will allow those preferences to develop (taking for granted that that’s the only way the AI can learn them, without violating some other things you currently want, like the moral worth of the simulated entities a potential attempt at shortcutting the unfolding might create)
    Like, right now you have wants the define your preferences (not object level, but define the process that would lead to your preferences being developed, and which constraints that unfolding needs to be subject to). If the AI optimizes for this, it will lead to the preferences being optimized for later.
    And it will be able to do this, because this is what you currently want, and the premise is that we can get the AI to do what you want.