[Question] Image generation and alignment

There isn’t a lot of talk about image models (e.g. Dall-E and StableDiffusion) on LW in the context of alignment, especially compared to LLMs. Why is that? Some hypotheses:

  • LLMs just happened to get some traction early, and due to network effects, they are the primary research vehicle

  • LLMs are a larger alignment risk than image models, e.g. the only alignment risk of image generation comes from the language embedding

  • LLMs are not a larger alignment risk, but they are easier to use for alignment research