RSS

Kaarel

Karma: 510

kaarelh AT gmail DOT com

personal website

A start­ing point for mak­ing sense of task struc­ture (in ma­chine learn­ing)

24 Feb 2024 1:51 UTC
37 points
2 comments12 min readLW link

Toward A Math­e­mat­i­cal Frame­work for Com­pu­ta­tion in Superposition

18 Jan 2024 21:06 UTC
182 points
17 comments73 min readLW link

Grokking, mem­o­riza­tion, and gen­er­al­iza­tion — a discussion

29 Oct 2023 23:17 UTC
63 points
10 comments23 min readLW link

Crys­tal Heal­ing — or the Ori­gins of Ex­pected Utility Maximizers

25 Jun 2023 3:18 UTC
49 points
11 comments5 min readLW link

Search­ing for a model’s con­cepts by their shape – a the­o­ret­i­cal framework

23 Feb 2023 20:14 UTC
50 points
0 comments19 min readLW link

[RFC] Pos­si­ble ways to ex­pand on “Dis­cov­er­ing La­tent Knowl­edge in Lan­guage Models Without Su­per­vi­sion”.

25 Jan 2023 19:03 UTC
47 points
6 comments12 min readLW link

A gen­tle primer on car­ing, in­clud­ing in strange senses, with applications

Kaarel30 Aug 2022 8:05 UTC
9 points
4 comments18 min readLW link

kh’s Shortform

Kaarel6 Jul 2022 21:48 UTC
2 points
6 comments1 min readLW link

[Question] Trans­fer­ring cre­dence with­out trans­fer­ring ev­i­dence?

Kaarel4 Feb 2022 8:11 UTC
11 points
6 comments3 min readLW link