Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Rareș Baron
Karma:
15
All
Posts
Comments
New
Top
Old
Minor interpretability exploration #4: LayerNorm and the learning coefficient
Rareș Baron
20 Mar 2025 16:18 UTC
4
points
0
comments
1
min read
LW
link
Minor interpretability exploration #3: Extending superposition to different activation functions (loss landscape)
Rareș Baron
14 Mar 2025 15:45 UTC
5
points
0
comments
3
min read
LW
link
Minor interpretability exploration #2: Extending superposition to different activation functions
Rareș Baron
6 Mar 2025 11:22 UTC
3
points
0
comments
4
min read
LW
link
Minor interpretability exploration #1: Grokking of modular addition, subtraction, multiplication, for different activation functions
Rareș Baron
26 Feb 2025 11:35 UTC
5
points
13
comments
4
min read
LW
link
Back to top