Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Fernando Avalos
Karma:
18
Wubbalubbaudbdub
All
Posts
Comments
New
Top
Old
Approximating Human Preferences Using a Multi-Judge Learned System
JoseFaustino
,
eitan sprejer
,
Fernando Avalos
and
Augusto Bernardi
31 Jul 2025 18:01 UTC
19
points
0
comments
13
min read
LW
link
[Linkpost] Interpretable Analysis of Features Found in Open-source Sparse Autoencoder (partial replication)
Fernando Avalos
9 Sep 2024 3:33 UTC
6
points
1
comment
1
min read
LW
link
(forum.effectivealtruism.org)
Back to top