We’re launching an “AI psychiatry” team as part of interpretability efforts at Anthropic! We’ll be researching phenomena like model personas, motivations, and situational awareness, and how they lead to spooky/unhinged behaviors. (x)
“making up types of guy” research is a go?
They’re hiring; you might be great for this.
“making up types of guy” research is a go?
They’re hiring; you might be great for this.