Vladimir_Nesov comments on Simulators

Vladimir_Nesov 6 Sep 2022 15:44 UTC
23 points
13
Please don’t put ML opt-out strings on other people’s writings. They might want the Future to keep them around. The apparent intent is better conveyed by linking to an instruction for doing this without actually doing this unilaterally.
- Gunnar_Zarncke 6 Sep 2022 18:24 UTC
  3 points
  0
  Parent
  Commenters seem to agree with you here, and I followed the recommendation by removing the code and adding instructions instead.
  But I wonder whether this convention means that I can’t use the code to prevent my comment from being added to a corpus. I think it would be better if comments were scraped separately. Does anybody know how the scraping works?
  - janus 7 Sep 2022 16:21 UTC
    3 points
    0
    Parent
    Idk how others do it, but you can see how LW/AF/EAF comments are scraped for the alignment research dataset here (as you can see we don’t check for the uuid)
    - Gunnar_Zarncke 8 Sep 2022 14:06 UTC
      2 points
      0
      Parent
      Yeah, I guess it is a hopeless endeavor to hide things from web scrapers and by extension GPT-N.