Yeah, to be clear—as I concede—they didn’t blame people for writing about misaligned AI. But this was seemingly what a lot of people took from it. I don’t know why they had to word this tweet this way, I don’t feel like it’s substantiated by the post and the research.
the self-fulfilling belief could also refer to, e.g., attempts by doomers to get everyone who cares about AI risk to quit working on AI
I hadn’t thought of that, though calling an idea a self-fulfilling believe is almost the same as calling it a hyperstition per definition. But through the article I treat hyperstition more narrowly as meaning: Writing about misalignment risk causing misalignment risk.
Yeah you are correct that the straightforward interpretation of self-fulfilling is hyperstition, which I agree isn’t a fair accusation.
Although if we do get self-fulfilling doom through hyperstition I do think it means that the misaglinment people were wrong in an important sense (AI psychology differed from their view enough that misalignment happened through such a weird method rather than straightforward instrumental convergence + orthogonality).
Yeah, to be clear—as I concede—they didn’t blame people for writing about misaligned AI. But this was seemingly what a lot of people took from it. I don’t know why they had to word this tweet this way, I don’t feel like it’s substantiated by the post and the research.
I hadn’t thought of that, though calling an idea a self-fulfilling believe is almost the same as calling it a hyperstition per definition. But through the article I treat hyperstition more narrowly as meaning: Writing about misalignment risk causing misalignment risk.
Yeah you are correct that the straightforward interpretation of self-fulfilling is hyperstition, which I agree isn’t a fair accusation.
Although if we do get self-fulfilling doom through hyperstition I do think it means that the misaglinment people were wrong in an important sense (AI psychology differed from their view enough that misalignment happened through such a weird method rather than straightforward instrumental convergence + orthogonality).