Eliezer Yudkowsky comments on “Infohazard” is a predominantly conflict-theoretic concept

Eliezer Yudkowsky 2 Dec 2021 19:41 UTC
32 points
We already have a word for information that agent A would rather have B not know, because B’s knowledge of it benefits B but harms A; that word is ‘secret’.
As this is a very common and ordinary state of affairs, we need a larger and more technical word to describe that rarer and more interesting case where B’s veridical knowledge of a true fact X harms B, or when a group’s collective knowledge of a true fact X harms the group collectively.
- jessicata 2 Dec 2021 19:47 UTC
  12 points
  Parent
  Bostrom’s original paper defines “infohazard” so as to be inclusive of what you term “secrets”. I define “self-infohazard” to describe a specific case of an individual being harmed by themselves knowing something. Perhaps you would like to propose a different taxonomy that disagrees with Bostrom’s and/or my definitions?
  
  EDIT: At MIRI, Nate Soares frequently used the term “infohazard” to refer to potentially dangerous technological secrets, in line with Bostrom’s usage. I have no reason to believe that anyone at the organization would personally be harmed on net by knowing such technological secrets. I’m not saying you have to agree with Nate here but I got the impression from this usage that this isn’t a case of Bostrom being idiosyncratic.
  - Eliezer Yudkowsky 3 Dec 2021 3:47 UTC
    7 points
    Parent
    The idea of Transfiguring antimatter (assuming it works) is something that collectively harms all wizards if all wizards know it; it’s a group infohazard. The group infohazards seem worth distinguishing from the individual infohazards, but both seem much more worth distinguishing from secrets. Secrets exist among rational agents; individual and group infohazards only exist among causal decision theorists, humans, and other such weird creatures.
- RedMan 2 Dec 2021 20:18 UTC
  −8 points
  Parent
  True fact: the female orgasm is not necessary for reproduction.
  
  If group A believes that reproduction is inherently good, knowledge of this true fact replacing a prior belief in the necessity of the female orgasm for reproduction may lead to deprioritization of female orgasms among members of group A, and therefore, cause a direct reduction in the amount of orgasmic joy experienced by ~50% of that group. This reduction in joy could be classed as harm.
  
  The Puritans of the Massachusetts Bay Colony believed that the reproduction was good, and that the female orgasm during the sexual act was necessary to its’ achievement.
  
  Am I doing it right?
  - Eliezer Yudkowsky 3 Dec 2021 3:48 UTC
    7 points
    Parent
    Nope. You’re evaluating their strategies using your utility function. Infohazards occur when individuals or groups create strategies using their own utility functions and then do worse under their own utility functions when knowledge of true facts is added to them.
    - RedMan 3 Dec 2021 22:25 UTC
      1 point
      Parent
      So, the true fact ‘the female orgasm is not necessary for reproduction’ would not qualify as an infohazard to a colonial Puritan, who believes that reproduction is good, and that the female orgasm is necessary for its’ accomplishment?
      
      In order to turn it into an infohazard to that Puritan, do I have to add the (unstated in previous) assertion that ‘experiencing orgasmic joy is utility positive’? Is there a way to fix this example or am I just completely off base here?
      
      Edit: I’m trying to define an edge case, hope I’m not offending anyone.
      - Eliezer Yudkowsky 4 Dec 2021 2:40 UTC
        9 points
        Parent
        You’re basically just failing at modeling rational agents with utility functions different from yours, I’m sorry to say. If the Puritans value pleasure, they can pursue it even after learning the true facts of the matter. If they don’t value pleasure, but you do, you’re unhappy they learned the secret because now they’ll do things you don’t want, but they do want to do those things under their own utility functions.
        RedMan 4 Dec 2021 12:37 UTC
        1 point
        Parent
        Oh I understand, I’m trying to apply an external measure of utility, but it doesn’t necessarily match up to an internal measure, so this example fails. Thank you!
        
        Edit: you’ve written before about your experiences growing up in an insular religious environment. Can you in retrospect identify any pieces of true widely known information that would qualify as an infohazard to that group using your definition? Obviously I wouldn’t ask you to actually state the true fact or the reason it’s an infohazard.