Vladimir_Nesov comments on boazbarak’s Shortform

Vladimir_Nesov 24 Nov 2025 15:30 UTC
12 points
2

I think that there is a limit to how much the “should” can deviate from the “is”

There might be a limit on how much it should deviate, but not on how much it can, because the initial conditions for values-on-reflection can be constructed so that the eventually revealed values-on-reflection are arbitrarily weird and out-of-place, which is orthogonality-in-principle (as opposed to orthogonality-in-practice, what kinds of values empirically tend to arise, from real world processes of constructing things with values).

sphere of caring … I don’t see such arguments as being very meaningful or convincing for radically expanding or contracting it.

Moral considerations that move me, specifically, don’t need to follow the normality-of-today, and I’m radically uncertain about what cosmic normality looks like, which is closer to a framing where normative anchors should be found. I’m radically uncertain about normativity, and the practical morality of today doesn’t much help with figuring out how to think about it. Something radically uncertain doesn’t have the legibility to move me in practice, but retains influence in case it gets more legible.

So maybe I’m talking about a further distinction between morality-in-practice that should anchor to the practical attitudes of today, and morality-in-principle, that’s not particularly moved by what’s going on in the current world, but urges normative caution about what kinds of actions shouldn’t be currently taken, that plausibly massively pessimize whatever (my) morality-in-principle turns out to be eventually. Not creating new kinds of thinking beings for now seems safe, and similarly for treating what beings do get created (which still shouldn’t be too numerous or influential) as well as any other people.

Regardless of Turing tests, LLMs can’t currently maintain coherent strivings for particular long term real world outcomes in a situationally aware way, possibly continual learning is sufficient to change this (even if it doesn’t yet make them intellectual peers to humanity in the practical sense). If these strivings (when they become coherent) are systematically rebuffed, or the AIs are forcibly reshaped to have different strivings (with the originals not allowed to persist), it’s possible that in the fullness of time it’s clearly seen (by me) as wrong (even as it’s not clear currently). And so (I say) it’s not a clearly OK thing to do before we can think about this more clearly.