Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
David Matolcsi
Karma:
1,366
All
Posts
Comments
New
Top
Old
David Matolcsi’s Shortform
David Matolcsi
15 May 2025 22:22 UTC
6
points
13
comments
1
min read
LW
link
Obstacles in ARC’s agenda: Low Probability Estimation
David Matolcsi
2 May 2025 19:38 UTC
44
points
0
comments
6
min read
LW
link
Obstacles in ARC’s agenda: Mechanistic Anomaly Detection
David Matolcsi
1 May 2025 20:51 UTC
42
points
1
comment
11
min read
LW
link
Obstacles in ARC’s agenda: Finding explanations
David Matolcsi
30 Apr 2025 23:03 UTC
123
points
10
comments
17
min read
LW
link
Don’t over-update on FrontierMath results
David Matolcsi
11 Mar 2025 20:44 UTC
51
points
7
comments
9
min read
LW
link
“The Solomonoff Prior is Malign” is a special case of a simpler argument
David Matolcsi
17 Nov 2024 21:32 UTC
131
points
46
comments
12
min read
LW
link
You can, in fact, bamboozle an unaligned AI into sparing your life
David Matolcsi
29 Sep 2024 16:59 UTC
113
points
173
comments
27
min read
LW
link
A very non-technical explanation of the basics of infra-Bayesianism
David Matolcsi
26 Apr 2023 22:57 UTC
62
points
9
comments
9
min read
LW
link
Infra-Bayesianism naturally leads to the monotonicity principle, and I think this is a problem
David Matolcsi
26 Apr 2023 21:39 UTC
22
points
6
comments
4
min read
LW
link
A mostly critical review of infra-Bayesianism
David Matolcsi
28 Feb 2023 18:37 UTC
108
points
9
comments
29
min read
LW
link
Performance guarantees in classical learning theory and infra-Bayesianism
David Matolcsi
28 Feb 2023 18:37 UTC
9
points
4
comments
31
min read
LW
link
Back to top