deep comments on the void

deep 24 Jul 2025 15:33 UTC
3 points
0
We’re launching an “AI psychiatry” team as part of interpretability efforts at Anthropic! We’ll be researching phenomena like model personas, motivations, and situational awareness, and how they lead to spooky/unhinged behaviors. (x)
“making up types of guy” research is a go?
They’re hiring; you might be great for this.