Zac Hatfield-Dodds comments on LW Team is adjusting moderation policy

Zac Hatfield-Dodds 4 Apr 2023 22:55 UTC
9 points
4
It’s also quite plausible to me that carefully prompted language models, with a few dozen carefully explained examples and detailed instructions on the decision criteria, would do a good job at this specific moderation task. Less clear what the payoff period of such an investment would be so I’m not actually recommending it, but it’s an option worth considering IMO.
- Ruby 5 Apr 2023 0:26 UTC
  8 points
  0
  Parent
  Agree, I’ve said to the team that I think we could get some mileage out of this kind of thing.