Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
DanielFilan
Karma:
8,882
All
Posts
Comments
New
Top
Old
Page
1
AXRP Episode 46 - Tom Davidson on AI-enabled Coups
DanielFilan
7 Aug 2025 5:10 UTC
11
points
0
comments
68
min read
LW
link
AXRP Episode 45 - Samuel Albanie on DeepMind’s AGI Safety Approach
DanielFilan
6 Jul 2025 23:00 UTC
31
points
0
comments
40
min read
LW
link
AXRP Episode 44 - Peter Salib on AI Rights for Human Safety
DanielFilan
28 Jun 2025 1:40 UTC
12
points
0
comments
103
min read
LW
link
AXRP Episode 43 - David Lindner on Myopic Optimization with Non-myopic Approval
DanielFilan
15 Jun 2025 1:20 UTC
12
points
0
comments
56
min read
LW
link
AXRP Episode 42 - Owain Evans on LLM Psychology
DanielFilan
6 Jun 2025 20:20 UTC
13
points
0
comments
66
min read
LW
link
AXRP Episode 41 - Lee Sharkey on Attribution-based Parameter Decomposition
DanielFilan
3 Jun 2025 3:40 UTC
28
points
1
comment
61
min read
LW
link
Consider not donating under $100 to political candidates
DanielFilan
11 May 2025 3:20 UTC
138
points
32
comments
1
min read
LW
link
(danielfilan.com)
AXRP Episode 40 - Jason Gross on Compact Proofs and Interpretability
DanielFilan
28 Mar 2025 18:40 UTC
26
points
0
comments
89
min read
LW
link
AXRP Episode 38.8 - David Duvenaud on Sabotage Evaluations and the Post-AGI Future
DanielFilan
1 Mar 2025 1:20 UTC
13
points
0
comments
13
min read
LW
link
AXRP Episode 38.7 - Anthony Aguirre on the Future of Life Institute
DanielFilan
9 Feb 2025 1:10 UTC
10
points
0
comments
12
min read
LW
link
AXRP Episode 38.6 - Joel Lehman on Positive Visions of AI
DanielFilan
24 Jan 2025 23:00 UTC
10
points
0
comments
9
min read
LW
link
AXRP Episode 38.5 - Adrià Garriga-Alonso on Detecting AI Scheming
DanielFilan
20 Jan 2025 0:40 UTC
9
points
0
comments
16
min read
LW
link
MATS mentor selection
DanielFilan
and
Ryan Kidd
10 Jan 2025 3:12 UTC
44
points
12
comments
6
min read
LW
link
AXRP Episode 38.4 - Shakeel Hashim on AI Journalism
DanielFilan
5 Jan 2025 0:20 UTC
11
points
0
comments
12
min read
LW
link
AXRP Episode 38.3 - Erik Jenner on Learned Look-Ahead
DanielFilan
12 Dec 2024 5:40 UTC
20
points
0
comments
16
min read
LW
link
AXRP Episode 39 - Evan Hubinger on Model Organisms of Misalignment
DanielFilan
1 Dec 2024 6:00 UTC
41
points
0
comments
67
min read
LW
link
AXRP Episode 38.2 - Jesse Hoogland on Singular Learning Theory
DanielFilan
27 Nov 2024 6:30 UTC
34
points
0
comments
10
min read
LW
link
AXRP Episode 38.1 - Alan Chan on Agent Infrastructure
DanielFilan
16 Nov 2024 23:30 UTC
12
points
0
comments
14
min read
LW
link
AXRP Episode 38.0 - Zhijing Jin on LLMs, Causality, and Multi-Agent Systems
DanielFilan
14 Nov 2024 7:00 UTC
14
points
0
comments
12
min read
LW
link
MATS AI Safety Strategy Curriculum v2
DanielFilan
and
Ryan Kidd
7 Oct 2024 22:44 UTC
43
points
6
comments
13
min read
LW
link
Back to top
Next