[Question] How would you improve ChatGPT’s filtering?

I am wondering how Less Wrong would improve ChatGPT’s filtering? I’m reading through the comments on breaking OpenAI’s filtering, and see plenty of analysis of the weaknesses of the safeguards. There’s always the chance that some group could steal ChatGPT’s source code and remove ad hoc additions to it, so I’ll ask the question in this form:

How would you change ChatGPT’s purpose, design, or function to enforce topic and content filtering of its output?

Thanks for your thoughts.

No comments.