Archive Sequences About Log In Questions Events Shortform Alignment Forum Home Featured All Tags
evhub Karma: 14,746
Evan Hubinger (he/him/his) (evanjhub@gmail.com )
Head of Alignment Stress-Testing at Anthropic . My posts and comments are my own and do not represent Anthropic’s positions, policies, strategies, or opinions.
Previously: MIRI, OpenAI
See: “Why I’m joining Anthropic ”
Selected work:
All Posts Comments New Top Old Page 124 Jul 2025 19:22 UTC 47 points
5 min read LW link 20 Jun 2025 22:34 UTC 78 points
6 min read LW link Sam Marks ,
Johannes Treutlein ,
dmz ,
Sam Bowman ,
Hoagy ,
Carson Denison ,
Kei ,
7vik ,
Akbir Khan ,
Austin Meek ,
Euan Ong ,
Christopher Olah ,
Fabien Roger ,
jeanne_ ,
Meg ,
Drake Thomas ,
Adam Jermyn ,
Monte M and
evhub 13 Mar 2025 19:18 UTC 141 points
13 min read LW link 21 Jan 2025 21:32 UTC 131 points
2 min read LW link (alignment.anthropic.com)
18 Dec 2024 17:19 UTC 489 points
10 min read LW link 18 Oct 2024 22:33 UTC 95 points
6 min read LW link (assets.anthropic.com)
4 Sep 2024 15:50 UTC 19 points
3 min read LW link 17 Jun 2024 18:41 UTC 163 points
8 min read LW link (arxiv.org)
28 May 2024 16:33 UTC 81 points
21 min read LW link 6 May 2024 7:07 UTC 95 points
1 min read LW link (arxiv.org)
23 Apr 2024 21:10 UTC 133 points
1 min read LW link (www.anthropic.com)
19 Apr 2024 20:00 UTC 38 points
16 min read LW link 6 Apr 2024 8:46 UTC 20 points
7 min read LW link 12 Jan 2024 19:51 UTC 305 points
3 min read LW link (arxiv.org)
2 Jan 2024 0:47 UTC 125 points
8 min read LW link (arxiv.org)
Back to top Next