It’s also quite plausible to me that carefully prompted language models, with a few dozen carefully explained examples and detailed instructions on the decision criteria, would do a good job at this specific moderation task. Less clear what the payoff period of such an investment would be so I’m not actually recommending it, but it’s an option worth considering IMO.
It’s also quite plausible to me that carefully prompted language models, with a few dozen carefully explained examples and detailed instructions on the decision criteria, would do a good job at this specific moderation task. Less clear what the payoff period of such an investment would be so I’m not actually recommending it, but it’s an option worth considering IMO.
Agree, I’ve said to the team that I think we could get some mileage out of this kind of thing.