RSS

Adam Karvonen

Karma: 591

Fron­tier AI Models Still Fail at Ba­sic Phys­i­cal Tasks: A Man­u­fac­tur­ing Case Study

Adam Karvonen14 Apr 2025 17:38 UTC
154 points
42 comments7 min readLW link
(adamkarvonen.github.io)

Adam Kar­vo­nen’s Shortform

Adam Karvonen18 Jan 2025 17:11 UTC
4 points
1 commentLW link

SAEBench: A Com­pre­hen­sive Bench­mark for Sparse Autoencoders

11 Dec 2024 6:30 UTC
82 points
6 comments2 min readLW link
(www.neuronpedia.org)

Eval­u­at­ing Sparse Au­toen­coders with Board Game Models

2 Aug 2024 19:50 UTC
38 points
1 comment9 min readLW link

Us­ing an LLM per­plex­ity filter to de­tect weight exfiltration

Adam Karvonen21 Jul 2024 18:18 UTC
25 points
11 comments2 min readLW link

Othel­loGPT learned a bag of heuristics

2 Jul 2024 9:12 UTC
111 points
10 comments9 min readLW link

An In­tu­itive Ex­pla­na­tion of Sparse Au­toen­coders for Mechanis­tic In­ter­pretabil­ity of LLMs

Adam Karvonen25 Jun 2024 15:57 UTC
27 points
0 comments9 min readLW link
(adamkarvonen.github.io)

A Chess-GPT Lin­ear Emer­gent World Representation

Adam Karvonen8 Feb 2024 4:25 UTC
105 points
14 comments7 min readLW link
(adamkarvonen.github.io)