Johannes Treutlein(Johannes Treutlein)

Karma: 784

johannestreutlein.com

Proper scoring rules don’t guarantee predicting fixed points

Johannes Treutlein, Rubi J. Hudson and Caspar Oesterheld

16 Dec 2022 18:22 UTC

68 points

8 comments21 min readLW link

Report on modeling evidential cooperation in large worlds

Johannes Treutlein12 Jul 2023 16:37 UTC

44 points

3 comments1 min readLW link

(arxiv.org)

Stop-gradients lead to fixed point predictions

Johannes Treutlein, Caspar Oesterheld, Rubi J. Hudson and Emery Cooper

28 Jan 2023 22:47 UTC

36 points

2 comments24 min readLW link

Training goals for large language models

Johannes Treutlein18 Jul 2022 7:09 UTC

28 points

5 comments19 min readLW link

Did EDT get it right all along? Introducing yet another medical Newcomb problem

Johannes Treutlein24 Jan 2017 11:43 UTC

22 points

21 comments8 min readLW link

Request for input on multiverse-wide superrationality (MSR)

Johannes Treutlein14 Aug 2018 17:29 UTC

18 points

3 comments1 min readLW link

(effective-altruism.com)

Anthropic uncertainty in the Evidential Blackmail problem

Johannes Treutlein14 May 2017 16:43 UTC

10 points

1 comment1 min readLW link

(casparoesterheld.com)

“Betting on the Past” – a decision problem by Arif Ahmed

Johannes Treutlein7 Feb 2017 21:14 UTC

7 points

6 comments1 min readLW link

(casparoesterheld.com)

A behaviorist approach to building phenomenological bridges

Johannes Treutlein20 Nov 2017 19:36 UTC

4 points

0 comments1 min readLW link

(casparoesterheld.com)