Marius Hobbhahn

Karma: 2,890

I recently founded Apollo Research: https://www.apolloresearch.ai/

I was previously doing a Ph.D. in ML at the International Max-Planck research school in Tübingen, worked part-time with Epoch and did independent AI safety research.

For more see https://www.mariushobbhahn.com/aboutme/

I subscribe to Crocker’s Rules

How to Sleep Better

Marius Hobbhahn16 Jul 2021 0:00 UTC

48 points

49 comments15 min readLW link

A Guide for Productivity

Marius Hobbhahn23 Jul 2021 7:03 UTC

33 points

7 comments41 min readLW link

What makes us happy and depressed?

Marius Hobbhahn3 Oct 2021 6:25 UTC

16 points

6 comments19 min readLW link

What are red flags for Neural Network suffering?

Marius Hobbhahn8 Nov 2021 12:51 UTC

29 points

15 comments12 min readLW link

How to measure FLOP/s for Neural Networks empirically?

Marius Hobbhahn29 Nov 2021 15:18 UTC

16 points

5 comments7 min readLW link

What’s the backward-forward FLOP ratio for Neural Networks?

Marius Hobbhahn and Jsevillamol

13 Dec 2021 8:54 UTC

19 points

12 comments10 min readLW link

Causality, Transformative AI and alignment—part I

Marius Hobbhahn27 Jan 2022 16:18 UTC

14 points

11 comments8 min readLW link

Nuclear Energy—Good but not the silver bullet we were hoping for

Marius Hobbhahn30 Apr 2022 15:41 UTC

64 points

33 comments15 min readLW link 1 review

The limits of AI safety via debate

Marius Hobbhahn10 May 2022 13:33 UTC

29 points

7 comments10 min readLW link

Eliciting Latent Knowledge (ELK) - Distillation/Summary

Marius Hobbhahn8 Jun 2022 13:18 UTC

69 points

2 comments21 min readLW link

Investigating causal understanding in LLMs

Marius Hobbhahn and Tom Lieberum

14 Jun 2022 13:57 UTC

28 points

6 comments13 min readLW link

Our mental building blocks are more different than I thought

Marius Hobbhahn15 Jun 2022 11:07 UTC

44 points

11 comments14 min readLW link

Reflection Mechanisms as an Alignment target: A survey

Marius Hobbhahn, elandgre and Beth Barnes

22 Jun 2022 15:05 UTC

32 points

1 comment14 min readLW link

What success looks like

Marius Hobbhahn, MaxRa, JasperGeh and Yannick_Muehlhaeuser

28 Jun 2022 14:38 UTC

19 points

4 comments1 min readLW link

(forum.effectivealtruism.org)

Trends in GPU price-performance

Marius Hobbhahn and Tamay

1 Jul 2022 15:51 UTC

85 points

12 comments1 min readLW link 1 review

(epochai.org)

The Defender’s Advantage of Interpretability

Marius Hobbhahn14 Sep 2022 14:05 UTC

41 points

4 comments6 min readLW link

Why deceptive alignment matters for AGI safety

Marius Hobbhahn15 Sep 2022 13:38 UTC

57 points

13 comments13 min readLW link

Paper+Summary: OMNIGROK: GROKKING BEYOND ALGORITHMIC DATA

Marius Hobbhahn4 Oct 2022 7:22 UTC

46 points

11 comments1 min readLW link

(arxiv.org)

Reflection Mechanisms as an Alignment target: A follow-up survey

Marius Hobbhahn, elandgre and Beth Barnes

5 Oct 2022 14:03 UTC

15 points

2 comments7 min readLW link

Lessons learned from talking to >100 academics about AI safety

Marius Hobbhahn10 Oct 2022 13:16 UTC

214 points

17 comments12 min readLW link 1 review