Wei Dai comments on Would I think for ten thousand years?

Wei Dai 14 Feb 2019 20:48 UTC
LW: 3 AF: 2
0
AF

The others are mainly oral, with people coming up with plans that involve simulating humans for long periods of time, me doing the equivalent of saying “have you considered value drift” and (often) the reaction from the other revealing that no, they had not considered value drift.

Ah, value drift has been on my mind for so long that it’s surprising to me that people could be thinking about simulating humans for long periods of time without thinking about value drift. Thanks for the update!

The most important differences I foresee are the unforseen :-) I mean that seriously, because anything that is easy to foresee will possibly be patched before implementation.

I guess my perspective here is that pretty soon we’ll be forced to live in a real environment that will be quite alien / drift-inducing already, so maybe it wouldn’t be so hard to construct a virtual environment that would be better in comparison, so the risk-minimizing thing to do would be to put yourself in such an environment as soon as possible and then work on further risk reduction from there. (See this recent news as another sign pointing to that coming soon.)

Most of the simulation ideas do away with that.

Yeah I agree that getting the social aspect right is probably the hardest part, and we might need more than a small group of virtual humans to do that.

So what I’m mainly trying to say is that using simulations (or predictions about simulations) to do safety work is a difficult and subtle project, and needs to be thoroughly planned out with, at minimum, a lot of psychologists and some anthropologists. I think it can be done, but not glibly and not easily.

I think this framing makes sense.