Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Paul Bricman
Karma:
190
All
Posts
Comments
New
Top
Old
Sealed Computation: Towards Low-Friction Proof of Locality
Paul Bricman
29 Apr 2025 15:26 UTC
4
points
0
comments
10
min read
LW
link
(noemaresearch.com)
Reverse engineering the memory layout of GPU inference
Paul Bricman
9 Apr 2025 15:40 UTC
5
points
0
comments
6
min read
LW
link
(noemaresearch.com)
Join the $10K AutoHack 2024 Tournament
Paul Bricman
25 Sep 2024 11:54 UTC
5
points
0
comments
1
min read
LW
link
(noemaresearch.com)
Towards AI Safety Infrastructure: Talk & Outline
Paul Bricman
7 Jan 2024 9:31 UTC
11
points
0
comments
2
min read
LW
link
(www.youtube.com)
Hashmarks: Privacy-Preserving Benchmarks for High-Stakes AI Evaluation
Paul Bricman
4 Dec 2023 7:31 UTC
12
points
6
comments
16
min read
LW
link
(arxiv.org)
Elements of Computational Philosophy, Vol. I: Truth
Paul Bricman
and
Tom Feeney
1 Jul 2023 11:44 UTC
12
points
6
comments
1
min read
LW
link
(compphil.github.io)
Cataloguing Priors in Theory and Practice
Paul Bricman
13 Oct 2022 12:36 UTC
13
points
8
comments
7
min read
LW
link
(Structural) Stability of Coupled Optimizers
Paul Bricman
30 Sep 2022 11:28 UTC
25
points
0
comments
10
min read
LW
link
Interlude: But Who Optimizes The Optimizer?
Paul Bricman
23 Sep 2022 15:30 UTC
15
points
0
comments
10
min read
LW
link
Representational Tethers: Tying AI Latents To Human Ones
Paul Bricman
16 Sep 2022 14:45 UTC
30
points
0
comments
16
min read
LW
link
Ideological Inference Engines: Making Deontology Differentiable*
Paul Bricman
12 Sep 2022 12:00 UTC
6
points
0
comments
14
min read
LW
link
Oversight Leagues: The Training Game as a Feature
Paul Bricman
9 Sep 2022 10:08 UTC
20
points
6
comments
10
min read
LW
link
Benchmarking Proposals on Risk Scenarios
Paul Bricman
20 Aug 2022 10:01 UTC
25
points
2
comments
14
min read
LW
link
Steelmining via Analogy
Paul Bricman
13 Aug 2022 9:59 UTC
24
points
0
comments
2
min read
LW
link
(paulbricman.com)
[Linkpost] diffusion magnetizes manifolds (DALL-E 2 intuition building)
Paul Bricman
7 May 2022 11:01 UTC
1
point
0
comments
1
min read
LW
link
(paulbricman.com)
[Linkpost] Value extraction via language model abduction
Paul Bricman
1 May 2022 19:11 UTC
5
points
3
comments
1
min read
LW
link
(paulbricman.com)
Back to top