Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
DanielFilan
Karma:
8,778
All
Posts
Comments
New
Top
Old
Page
3
AXRP Episode 24 - Superalignment with Jan Leike
DanielFilan
Jul 27, 2023, 4:00 AM
55
points
3
comments
69
min read
LW
link
AXRP Episode 23 - Mechanistic Anomaly Detection with Mark Xu
DanielFilan
Jul 27, 2023, 1:50 AM
22
points
0
comments
72
min read
LW
link
AXRP announcement: Survey, Store Closing, Patreon
DanielFilan
Jun 28, 2023, 11:40 PM
14
points
0
comments
1
min read
LW
link
AXRP Episode 22 - Shard Theory with Quintin Pope
DanielFilan
Jun 15, 2023, 7:00 PM
52
points
11
comments
93
min read
LW
link
[Linkpost] Interpretability Dreams
DanielFilan
May 24, 2023, 9:08 PM
39
points
2
comments
2
min read
LW
link
(transformer-circuits.pub)
Difficulties in making powerful aligned AI
DanielFilan
May 14, 2023, 8:50 PM
41
points
1
comment
10
min read
LW
link
(danielfilan.com)
AXRP Episode 21 - Interpretability for Engineers with Stephen Casper
DanielFilan
May 2, 2023, 12:50 AM
12
points
1
comment
66
min read
LW
link
Podcast with Divia Eden and Ronny Fernandez on the strong orthogonality thesis
DanielFilan
Apr 28, 2023, 1:30 AM
18
points
1
comment
1
min read
LW
link
(youtu.be)
AXRP Episode 20 - ‘Reform’ AI Alignment with Scott Aaronson
DanielFilan
Apr 12, 2023, 9:30 PM
22
points
2
comments
68
min read
LW
link
[Link] A community alert about Ziz
DanielFilan
Feb 24, 2023, 12:06 AM
180
points
166
comments
2
min read
LW
link
4
reviews
(medium.com)
Video/animation: Neel Nanda explains what mechanistic interpretability is
DanielFilan
Feb 22, 2023, 10:42 PM
24
points
7
comments
1
min read
LW
link
(youtu.be)
[linkpost] Better Without AI
DanielFilan
Feb 14, 2023, 5:30 PM
47
points
13
comments
1
min read
LW
link
(betterwithout.ai)
AXRP: Store, Patreon, Video
DanielFilan
Feb 7, 2023, 4:50 AM
12
points
0
comments
1
min read
LW
link
Podcast with Oli Habryka on LessWrong / Lightcone Infrastructure
DanielFilan
Feb 5, 2023, 2:52 AM
88
points
20
comments
1
min read
LW
link
(thefilancabinet.com)
AXRP Episode 19 - Mechanistic Interpretability with Neel Nanda
DanielFilan
Feb 4, 2023, 3:00 AM
45
points
0
comments
117
min read
LW
link
First Three Episodes of The Filan Cabinet
DanielFilan
Jan 18, 2023, 7:20 PM
17
points
1
comment
1
min read
LW
link
Podcast with Divia Eden on operant conditioning
DanielFilan
Jan 15, 2023, 2:44 AM
14
points
0
comments
1
min read
LW
link
(youtu.be)
On Blogging and Podcasting
DanielFilan
Jan 9, 2023, 12:40 AM
18
points
6
comments
11
min read
LW
link
(danielfilan.com)
Things I carry almost every day, as of late December 2022
DanielFilan
Dec 30, 2022, 7:40 AM
38
points
9
comments
5
min read
LW
link
(danielfilan.com)
Announcing The Filan Cabinet
DanielFilan
Dec 30, 2022, 3:10 AM
21
points
2
comments
1
min read
LW
link
(danielfilan.com)
Back to first
Previous
Back to top
Next