Mech Interp Wiki Page and Why You Should Edit Wikipedia

TL;DR:
A couple months ago, we (Jo and Noah) wrote the first Wikipedia article on Mechanistic Interpretability. It was oddly missing despite Mech Interp’s visibility in alignment circles. We think Wikipedia is a top-of-funnel resource for journalists, policy staffers, and curious students, so filling that gap is cheap field-building. Seeing that the gap existed in one of the most important subfields in AIS, we suspect that there are probably many others. Below, we (1) list other alignment /​ EA topics that need pages or can be upgraded and (2) share some notes about editing Wikipedia. If you know the literature, an afternoon of edits may be surprisingly high-impact.

PS: We also think that there existing a wiki page for the field that one is working in increases one’s credibility to outsiders—i.e. if you tell someone that you’re working in AI Control, and the only pages linked are from LessWrong and Arxiv, this might not be a good look.

Wikipedia pages worth (re)writing

Below are topics with plentiful secondary literature yet their wiki pages are either missing or a stub.

Wikipedia Tips

  • If you want to work on a new page, discuss with the community first by going to the talk page of a related topic or meta-page.

  • When making substantive updates, prepare the update in your user sandbox first.

  • Attribute claims (Nature says x, or expert y says) rather than advocate (z is important because...).

  • Special:WantedPages lets you filter red-links for terms like “alignment”, “Goodhart”, “interpretability”.

  • In general, you shouldn’t post before you understand Wikipedia rules, norms, and guidelines. We recommend consulting the following pages:[1]

Moral of the post: Work on Wikipedia sites! It doesn’t take that long, there is plenty of low hanging fruit, and it could be great for field-building. At the very least, trying it seems like a good experiment.

  1. ^

    Thanks to gjm in the comments for recommending this edit.