Jessica Rumbelow

Karma: 1,239

AI researcher

An Introduction to Exemplar Partitioning for Mechanistic Interpretability

Jessica Rumbelow16 May 2026 3:58 UTC

68 points

7 comments11 min readLW link

(www.leap-labs.com)

Scientific Discovery in the Age of Artificial Intelligence

Jessica Rumbelow29 Jun 2025 20:45 UTC

42 points

3 comments10 min readLW link

Why did ChatGPT say that? Prompt engineering and more, with PIZZA.

Jessica Rumbelow3 Aug 2024 12:07 UTC

43 points

2 comments4 min readLW link

Introducing Leap Labs, an AI interpretability startup

Jessica Rumbelow6 Mar 2023 16:16 UTC

104 points

12 comments1 min readLW link

SolidGoldMagikarp III: Glitch token archaeology

mwatkins and Jessica Rumbelow

14 Feb 2023 10:17 UTC

92 points

36 comments16 min readLW link

SolidGoldMagikarp II: technical details and more recent findings

mwatkins and Jessica Rumbelow

6 Feb 2023 19:09 UTC

114 points

45 comments13 min readLW link

SolidGoldMagikarp (plus, prompt generation)

Jessica Rumbelow and mwatkins

5 Feb 2023 22:02 UTC

677 points

208 comments12 min readLW link 1 review

Guardian AI (Misaligned systems are all around us.)

Jessica Rumbelow25 Nov 2022 15:55 UTC

15 points

6 comments2 min readLW link

The Ground Truth Problem (Or, Why Evaluating Interpretability Methods Is Hard)

Jessica Rumbelow17 Nov 2022 11:06 UTC

27 points

2 comments2 min readLW link

Why I’m Working On Model Agnostic Interpretability

Jessica Rumbelow11 Nov 2022 9:24 UTC

27 points

9 comments2 min readLW link