Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
DanielFilan
Karma:
8,775
All
Posts
Comments
New
Top
Old
Page
1
Consider not donating under $100 to political candidates
DanielFilan
May 11, 2025, 3:20 AM
130
points
31
comments
1
min read
LW
link
(danielfilan.com)
AXRP Episode 40 - Jason Gross on Compact Proofs and Interpretability
DanielFilan
Mar 28, 2025, 6:40 PM
23
points
0
comments
89
min read
LW
link
AXRP Episode 38.8 - David Duvenaud on Sabotage Evaluations and the Post-AGI Future
DanielFilan
Mar 1, 2025, 1:20 AM
13
points
0
comments
13
min read
LW
link
AXRP Episode 38.7 - Anthony Aguirre on the Future of Life Institute
DanielFilan
Feb 9, 2025, 1:10 AM
10
points
0
comments
12
min read
LW
link
AXRP Episode 38.6 - Joel Lehman on Positive Visions of AI
DanielFilan
Jan 24, 2025, 11:00 PM
10
points
0
comments
9
min read
LW
link
AXRP Episode 38.5 - Adrià Garriga-Alonso on Detecting AI Scheming
DanielFilan
Jan 20, 2025, 12:40 AM
9
points
0
comments
16
min read
LW
link
MATS mentor selection
DanielFilan
and
Ryan Kidd
Jan 10, 2025, 3:12 AM
44
points
12
comments
6
min read
LW
link
AXRP Episode 38.4 - Shakeel Hashim on AI Journalism
DanielFilan
Jan 5, 2025, 12:20 AM
11
points
0
comments
12
min read
LW
link
AXRP Episode 38.3 - Erik Jenner on Learned Look-Ahead
DanielFilan
Dec 12, 2024, 5:40 AM
20
points
0
comments
16
min read
LW
link
AXRP Episode 39 - Evan Hubinger on Model Organisms of Misalignment
DanielFilan
Dec 1, 2024, 6:00 AM
41
points
0
comments
67
min read
LW
link
AXRP Episode 38.2 - Jesse Hoogland on Singular Learning Theory
DanielFilan
Nov 27, 2024, 6:30 AM
34
points
0
comments
10
min read
LW
link
AXRP Episode 38.1 - Alan Chan on Agent Infrastructure
DanielFilan
Nov 16, 2024, 11:30 PM
12
points
0
comments
14
min read
LW
link
AXRP Episode 38.0 - Zhijing Jin on LLMs, Causality, and Multi-Agent Systems
DanielFilan
Nov 14, 2024, 7:00 AM
14
points
0
comments
12
min read
LW
link
MATS AI Safety Strategy Curriculum v2
DanielFilan
and
Ryan Kidd
Oct 7, 2024, 10:44 PM
43
points
6
comments
13
min read
LW
link
AXRP Episode 37 - Jaime Sevilla on Forecasting AI
DanielFilan
Oct 4, 2024, 9:00 PM
21
points
3
comments
56
min read
LW
link
AXRP Episode 36 - Adam Shai and Paul Riechers on Computational Mechanics
DanielFilan
Sep 29, 2024, 5:50 AM
25
points
0
comments
55
min read
LW
link
AXRP Episode 35 - Peter Hase on LLM Beliefs and Easy-to-Hard Generalization
DanielFilan
Aug 24, 2024, 10:30 PM
21
points
0
comments
74
min read
LW
link
AXRP Episode 34 - AI Evaluations with Beth Barnes
DanielFilan
Jul 28, 2024, 3:30 AM
23
points
0
comments
69
min read
LW
link
Why keep a diary, and why wish for large language models
DanielFilan
Jun 14, 2024, 4:10 PM
9
points
1
comment
2
min read
LW
link
(danielfilan.com)
AXRP Episode 33 - RLHF Problems with Scott Emmons
DanielFilan
Jun 12, 2024, 3:30 AM
34
points
0
comments
56
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel