Fernando Avalos

Karma: 23

Wubbalubbaudbdub

Approximating Human Preferences Using a Multi-Judge Learned System

JoseFaustino, eitan sprejer, Fernando Avalos and Augusto Bernardi

31 Jul 2025 18:01 UTC

19 points

0 comments13 min readLW link

[Linkpost] Interpretable Analysis of Features Found in Open-source Sparse Autoencoder (partial replication)

Fernando Avalos9 Sep 2024 3:33 UTC

6 points

1 comment1 min readLW link

(forum.effectivealtruism.org)