Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Can
Karma:
69
All
Posts
Comments
New
Top
Old
Safety of Self-Assembled Neuromorphic Hardware
Can
26 Dec 2022 18:51 UTC
15
points
2
comments
10
min read
LW
link
(forum.effectivealtruism.org)
An adversarial example for Direct Logit Attribution: memory management in gelu-4l
Can
,
Yeu-Tong Lau
,
James Dao
and
Jett
30 Aug 2023 17:36 UTC
17
points
0
comments
8
min read
LW
link
(arxiv.org)
Past Tense Features
Can
20 Apr 2024 14:34 UTC
11
points
0
comments
4
min read
LW
link
Back to top