I think it’s worth explicitly discussing the assumption that people won’t do “the dumbest possible thing”. It’s a reasonable assumption, but it’s probably a little more complicated than that. If alignment taxes are non-zero, there will be some pull between different motivations.
Yeah, it kinda depends on how small the alignment tax is. If it’s not 0, like I unfortunately suspect, but instead small, then there is a small chance of extinction risk. I definitely plan to discuss that when I reupload the post after deleting it first.
Yeah, it kinda depends on how small the alignment tax is. If it’s not 0, like I unfortunately suspect, but instead small, then there is a small chance of extinction risk. I definitely plan to discuss that when I reupload the post after deleting it first.
Thanks for talking with me today!