Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Sonia Joseph
Karma:
120
Getting PhD in multimodal interpretability and alignment at Mila.
Twitter:
@soniajoseph_
All
Posts
Comments
New
Top
Old
Litigate-for-Impact: Preparing Legal Action against an AGI Frontier Lab Leader
Sonia Joseph
7 Dec 2024 21:42 UTC
38
points
8
comments
2
min read
LW
link
Bridging the VLM and mech interp communities for multimodal interpretability
Sonia Joseph
28 Oct 2024 14:41 UTC
19
points
5
comments
15
min read
LW
link
Interpretability in Action: Exploratory Analysis of VPT, a Minecraft Agent
Karolis Jucys
,
george_adams
and
Sonia Joseph
18 Jul 2024 17:02 UTC
9
points
0
comments
1
min read
LW
link
(arxiv.org)
Laying the Foundations for Vision and Multimodal Mechanistic Interpretability & Open Problems
Sonia Joseph
and
Neel Nanda
13 Mar 2024 17:09 UTC
44
points
13
comments
14
min read
LW
link
Back to top