RSS

evhub

Karma: 14,536

Evan Hubinger (he/​him/​his) (evanjhub@gmail.com)

Head of Alignment Stress-Testing at Anthropic. My posts and comments are my own and do not represent Anthropic’s positions, policies, strategies, or opinions.

Previously: MIRI, OpenAI

See: “Why I’m joining Anthropic

Selected work:

Build­ing and eval­u­at­ing al­ign­ment au­dit­ing agents

Jul 24, 2025, 7:22 PM
44 points
1 comment5 min readLW link

Agen­tic Misal­ign­ment: How LLMs Could be In­sider Threats

Jun 20, 2025, 10:34 PM
72 points
12 comments6 min readLW link