RSS

CallumMcDougall

Karma: 2,369

ARENA 8.0 - Call for Applicants

20 Feb 2026 18:28 UTC
17 points
0 comments6 min readLW link

An­nounc­ing Gemma Scope 2

22 Dec 2025 21:56 UTC
94 points
1 comment2 min readLW link

Trans­mit­ting Misal­ign­ment with Sublimi­nal Learn­ing via Paraphrasing

17 Dec 2025 19:34 UTC
38 points
0 comments10 min readLW link

How Can In­ter­pretabil­ity Re­searchers Help AGI Go Well?

1 Dec 2025 13:05 UTC
66 points
1 comment14 min readLW link

A Prag­matic Vi­sion for Interpretability

1 Dec 2025 13:05 UTC
131 points
39 comments27 min readLW link

ARENA 7.0 - Call for Applicants

30 Sep 2025 14:54 UTC
27 points
1 comment6 min readLW link

ARENA 6.0 - Call for Applicants

4 Jun 2025 10:19 UTC
26 points
3 comments6 min readLW link

New Cause Area Proposal

CallumMcDougall1 Apr 2025 7:12 UTC
110 points
4 comments1 min readLW link

Nega­tive Re­sults for SAEs On Down­stream Tasks and Depri­ori­tis­ing SAE Re­search (GDM Mech In­terp Team Progress Up­date #2)

26 Mar 2025 19:07 UTC
116 points
15 comments29 min readLW link
(deepmindsafetyresearch.medium.com)

ARENA 5.0 - Call for Applicants

30 Jan 2025 13:18 UTC
35 points
2 comments6 min readLW link

Scal­ing Sparse Fea­ture Cir­cuit Find­ing to Gemma 9B

10 Jan 2025 11:08 UTC
88 points
11 comments17 min readLW link

SAEBench: A Com­pre­hen­sive Bench­mark for Sparse Autoencoders

11 Dec 2024 6:30 UTC
82 points
6 comments2 min readLW link
(www.neuronpedia.org)

AI Align­ment Re­search Eng­ineer Ac­cel­er­a­tor (ARENA): Call for ap­pli­cants v4.0

6 Jul 2024 11:34 UTC
57 points
7 comments6 min readLW link