Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
gasteigerjo
Karma:
243
Working on Alignment Science at Anthropic
All
Posts
Comments
New
Top
Old
AI Safety at the Frontier: Paper Highlights, August ’25
gasteigerjo
2 Sep 2025 20:29 UTC
12
points
0
comments
7
min read
LW
link
(open.substack.com)
AI Safety at the Frontier: Paper Highlights, July ’25
gasteigerjo
10 Aug 2025 12:49 UTC
7
points
0
comments
9
min read
LW
link
(aisafetyfrontier.substack.com)
AI Safety at the Frontier: Paper Highlights, June ’25
gasteigerjo
7 Jul 2025 18:17 UTC
4
points
0
comments
7
min read
LW
link
(open.substack.com)
AI Safety at the Frontier: Paper Highlights, May ’25
gasteigerjo
17 Jun 2025 17:16 UTC
6
points
0
comments
8
min read
LW
link
(aisafetyfrontier.substack.com)
AI Safety at the Frontier: Paper Highlights, April ’25
gasteigerjo
6 May 2025 14:22 UTC
4
points
0
comments
7
min read
LW
link
(aisafetyfrontier.substack.com)
AI Safety at the Frontier: Paper Highlights, March ’25
gasteigerjo
7 Apr 2025 20:17 UTC
9
points
0
comments
9
min read
LW
link
(aisafetyfrontier.substack.com)
Automated Researchers Can Subtly Sandbag
gasteigerjo
,
Akbir Khan
,
Sam Bowman
,
Vlad Mikulik
,
Ethan Perez
and
Fabien Roger
26 Mar 2025 19:13 UTC
44
points
0
comments
4
min read
LW
link
(alignment.anthropic.com)
AI Safety at the Frontier: Paper Highlights, February ’25
gasteigerjo
3 Mar 2025 22:09 UTC
7
points
0
comments
7
min read
LW
link
(aisafetyfrontier.substack.com)
AI Safety at the Frontier: Paper Highlights, January ’25
gasteigerjo
11 Feb 2025 16:14 UTC
7
points
0
comments
8
min read
LW
link
(aisafetyfrontier.substack.com)
AI Safety at the Frontier: Paper Highlights, December ’24
gasteigerjo
11 Jan 2025 22:54 UTC
7
points
2
comments
7
min read
LW
link
(aisafetyfrontier.substack.com)
AI Safety at the Frontier: Paper Highlights, November ’24
gasteigerjo
7 Dec 2024 19:15 UTC
7
points
0
comments
8
min read
LW
link
(aisafetyfrontier.substack.com)
AI Safety at the Frontier: Paper Highlights, October ’24
gasteigerjo
31 Oct 2024 0:09 UTC
3
points
0
comments
9
min read
LW
link
(aisafetyfrontier.substack.com)
AI Safety at the Frontier: Paper Highlights, September ’24
gasteigerjo
2 Oct 2024 9:49 UTC
13
points
0
comments
7
min read
LW
link
(aisafetyfrontier.substack.com)
AI Safety at the Frontier: Paper Highlights, August ’24
gasteigerjo
3 Sep 2024 19:17 UTC
28
points
0
comments
6
min read
LW
link
(aisafetyfrontier.substack.com)
AI Safety at the Frontier: Paper Highlights, July ’24
gasteigerjo
5 Aug 2024 13:00 UTC
8
points
0
comments
7
min read
LW
link
(aisafetyfrontier.substack.com)
Discussion: Challenges with Unsupervised LLM Knowledge Discovery
Seb Farquhar
,
Vikrant Varma
,
zac_kenton
,
gasteigerjo
,
Vlad Mikulik
and
Rohin Shah
18 Dec 2023 11:58 UTC
149
points
21
comments
10
min read
LW
link
Back to top