London L. comments on Why we are excited about confession!

London L. 12 Mar 2026 7:18 UTC
1 point
0
Nice to see that OpenAI is indeed working on this—I’ve seen a few blog posts over the last few days that help alleviate my concerns about spoofed confessions:
1. https://openai.com/index/instruction-hierarchy-challenge/
2. https://openai.com/index/designing-agents-to-resist-prompt-injection/

P.S. As a former student who shopped CS121, it’s wonderful to be able to hear how you’re approaching these problems; I really appreciate you posting on public forums and responding to feedback.