Nice to see that OpenAI is indeed working on this—I’ve seen a few blog posts over the last few days that help alleviate my concerns about spoofed confessions:1. https://openai.com/index/instruction-hierarchy-challenge/2. https://openai.com/index/designing-agents-to-resist-prompt-injection/P.S. As a former student who shopped CS121, it’s wonderful to be able to hear how you’re approaching these problems; I really appreciate you posting on public forums and responding to feedback.
Nice to see that OpenAI is indeed working on this—I’ve seen a few blog posts over the last few days that help alleviate my concerns about spoofed confessions:
1. https://openai.com/index/instruction-hierarchy-challenge/
2. https://openai.com/index/designing-agents-to-resist-prompt-injection/
P.S. As a former student who shopped CS121, it’s wonderful to be able to hear how you’re approaching these problems; I really appreciate you posting on public forums and responding to feedback.