RSS

Adrià Garriga-alonso

Karma: 1,354

A scheme to credit hack policy gra­di­ent training

Adrià Garriga-alonso7 Nov 2025 6:24 UTC
15 points
0 comments5 min readLW link

An­thropic’s JumpReLU train­ing method is re­ally good

3 Oct 2025 15:23 UTC
27 points
0 comments2 min readLW link

A re­cur­rent CNN finds maze paths by filling dead-ends

Adrià Garriga-alonso15 Sep 2025 20:49 UTC
19 points
0 comments2 min readLW link

The “Spar­sity vs Re­con­struc­tion Trade­off” Illusion

26 Aug 2025 4:39 UTC
21 points
0 comments4 min readLW link

L0 is not a neu­tral hyperparameter

19 Jul 2025 13:51 UTC
24 points
3 comments5 min readLW link

Can We Change the Goals of a Toy RL Agent?

15 Jun 2025 20:34 UTC
20 points
0 comments9 min readLW link

Spar­sity is the en­emy of fea­ture ex­trac­tion (ft. ab­sorp­tion)

3 May 2025 10:13 UTC
32 points
0 comments6 min readLW link