Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Johannes Treutlein
(Johannes Treutlein)
Karma:
784
johannestreutlein.com
All
Posts
Comments
New
Top
Old
Proper scoring rules don’t guarantee predicting fixed points
Johannes Treutlein
,
Rubi J. Hudson
and
Caspar Oesterheld
16 Dec 2022 18:22 UTC
68
points
8
comments
21
min read
LW
link
Report on modeling evidential cooperation in large worlds
Johannes Treutlein
12 Jul 2023 16:37 UTC
44
points
3
comments
1
min read
LW
link
(arxiv.org)
Stop-gradients lead to fixed point predictions
Johannes Treutlein
,
Caspar Oesterheld
,
Rubi J. Hudson
and
Emery Cooper
28 Jan 2023 22:47 UTC
36
points
2
comments
24
min read
LW
link
Training goals for large language models
Johannes Treutlein
18 Jul 2022 7:09 UTC
28
points
5
comments
19
min read
LW
link
Did EDT get it right all along? Introducing yet another medical Newcomb problem
Johannes Treutlein
24 Jan 2017 11:43 UTC
22
points
21
comments
8
min read
LW
link
Request for input on multiverse-wide superrationality (MSR)
Johannes Treutlein
14 Aug 2018 17:29 UTC
18
points
3
comments
1
min read
LW
link
(effective-altruism.com)
Anthropic uncertainty in the Evidential Blackmail problem
Johannes Treutlein
14 May 2017 16:43 UTC
10
points
1
comment
1
min read
LW
link
(casparoesterheld.com)
“Betting on the Past” – a decision problem by Arif Ahmed
Johannes Treutlein
7 Feb 2017 21:14 UTC
7
points
6
comments
1
min read
LW
link
(casparoesterheld.com)
A behaviorist approach to building phenomenological bridges
Johannes Treutlein
20 Nov 2017 19:36 UTC
4
points
0
comments
1
min read
LW
link
(casparoesterheld.com)
Back to top