Commentless downvoting is not a good way to fight infohazards

Last night, I posted a question seeking advice on publishing an AI research project, using a program that generates causal DAGs in an LLM-readable format along with test questions designed to evaluate the LLM’s ability to do causal inference. As a hobbyist, I am not too worried about whether it’s novel or interesting research, although it would be good to know. My main concern is that it might be infohazardous research, and my goal was to get some advice on whether or not others think it might be.

Unfortunately, it appears that the response was to downvote it without commenting or PMing me to explain why. If I posted on other topics and got downvoted, I normally wouldn’t worry about it—it might mean others found my post wrong, annoying, offensive, or contrary to ingroup common sense, and I would then have to decide what to do next in light of those possibilities.

But in this specific case, it means that the possibility that my research project is seen as infohazardous and is being downvoted for that reason is mixed in with all those other possibilities. If it’s being downvoted on infohazard grounds, it deprives me of an opportunity to learn why others think as they do. If it’s being downvoted on other grounds, then I’m left in a state where I have to make a judgment call on whether to publish a topic of interest to me despite a few downvotes, or whether to second-guess and silence myself.

My personal belief is that, unless you have good reason to think a topic is infohazardous, you should go ahead and publish—there are far too many examples of politically motivated or simply reactive people silencing things they disagree with on trumped-up charges of infohazards, even if not expressed using that specific term, to silence oneself without good reason. Even if LessWrong would downvote the final product as well, there are plenty of other outlets. So me posting this question here on LessWrong is me giving this community a privileged opportunity to weigh in on my publishing behavior. When I receive downvotes and no comments or PMs in exchange for doing that, it makes me feel like in the future, I should just publish what I want in an outlet I expect to be receptive to it.

So I am going to give some advice, and announce my publishing policy on infohazards going forward.

  1. If AI safety researchers feel they have some skill in evaluating potential research for infohazardous content and how to manage it, they should publicly offer to evaluate research proposals and make it easy to submit and get a constructive, explicit evaluation via a private message.

  2. If nobody is willing to perform this service, then a second-best alternative is for people to publish and debate self-evaluation guidelines.

  3. If nobody is willing to publish such guidelines, then when somebody like myself asks for advice, that advice should be delivered explicitly in a comment or PM, and the karma reaction should be neutral-to-positive unless there’s a clear reason to downvote it—in which case that reason should be communicated explicitly to the poster. Otherwise, this creates a disincentive to ask for advice about potential infohazards.

  4. In general, if you are a believer in a principle of conformity to prevent the unilateralist’s curse and in the need to take infohazards seriously, and if you believe you are capable of judging infohazardous content, then you have an ethical obligation to find a transparent and informative way to help the authors of the content you are judging improve their own models about infohazards, and to minimize the level of silencing that you impose in your quest to reduce infohazards. Those who cannot or will not accept that obligation have no business stepping into the role of evaluating or enforcing policy against infohazards.

First, I am once again requesting explicit feedback in comment or PM about the research project I linked above. If I don’t receive any infohazard-related feedback, I will find an outlet to publish it when I’m done. If I do receive it, I will consider it thoughtfully and consider whether or not it should modify or cancel my decision to publish. If I receive net downvotes on this post here and insufficient comments explaining their reasoning, or if that reasoning and the downvotes seem excessive, harsh, and nonconstructive, then I’ll stop asking LessWrong for feedback about infohazards in the future and make my own independent decisions about whether and where to publish.

Note that although I’m quite defensive about potential harsh or negative reactions to this post, and although this post contains an ultimatum, I’m very receptive to a constructive and friendly debate on any of these points. I don’t feel any urgency about publishing my project, and I’d much rather have my final decision be the result of a debate carried through to consensus, or at least crux-identification, than to unilaterally take action.