Quantilization

TagLast edit: Dec 2, 2024, 5:35 PM by Mateusz Bagiński

A Quantilizer is a proposed AI design that aims to reduce the harms from Goodhart’s law and specification gaming by selecting reasonably effective actions from a distribution of human-like actions, rather than maximizing over actions. It is more of a theoretical tool for exploring ways around these problems than a practical buildable design.

Quantilizers maximize expected utility subject to a conservative cost constraint

jessicataSep 28, 2015, 2:17 AM

33 points

3 comments5 min readLW link

Another view of quantilizers: avoiding Goodhart’s Law

jessicataJan 9, 2016, 4:02 AM

26 points

2 comments2 min readLW link

When to use quantilization

RyanCareyFeb 5, 2019, 5:17 PM

65 points

5 comments4 min readLW link

Computing an exact quantilal policy

Vanessa KosoyApr 12, 2018, 9:23 AM

9 points

0 comments2 min readLW link

Quantilal control for finite MDPs

Vanessa KosoyApr 12, 2018, 9:21 AM

14 points

0 comments13 min readLW link

Soft optimization makes the value target bigger

Jeremy GillenJan 2, 2023, 4:06 PM

119 points

20 comments12 min readLW link

Quantilizers and Generative Models

Adam JermynJul 18, 2022, 4:32 PM

24 points

5 comments4 min readLW link

[Question] Why don’t quantilizers also cut off the upper end of the distribution?

Alex_AltairMay 15, 2023, 1:40 AM

25 points

2 comments1 min readLW link

[Aspiration-based designs] 1. Informal introduction

B Jacobs, Jobst Heitzig, Simon Fischer and Simon Dima

Apr 28, 2024, 1:00 PM

44 points

4 comments8 min readLW link

Hedonic Loops and Taming RL

berenJul 19, 2023, 3:12 PM

20 points

14 comments9 min readLW link

Quantilizer ≡ Optimizer with a Bounded Amount of Output

itaibn0Nov 16, 2021, 1:03 AM

11 points

4 comments2 min readLW link

The murderous shortcut: a toy model of instrumental convergence

Thomas KwaOct 2, 2024, 6:48 AM

37 points

0 comments2 min readLW link

How to safely use an optimizer

Simon FischerMar 28, 2024, 4:11 PM

47 points

21 comments7 min readLW link

AISC team report: Soft-optimization, Bayes and Goodhart

Simon Fischer, benjaminko, jazcarretao, DFNaiff and Jeremy Gillen

Jun 27, 2023, 6:05 AM

38 points

2 comments15 min readLW link

Gravitizing Quantization

dmcg224Jun 1, 2025, 9:05 AM

1 point

0 comments8 min readLW link

[Aspiration-based designs] 2. Formal framework, basic algorithm

Jobst Heitzig, Simon Dima and Simon Fischer

Apr 28, 2024, 1:02 PM

18 points

2 comments16 min readLW link

AISC project: SatisfIA – AI that satisfies without overdoing it

Jobst HeitzigNov 11, 2023, 6:22 PM

12 points

0 comments1 min readLW link

(docs.google.com)

Recursive Quantilizers II

abramdemskiDec 2, 2020, 3:26 PM

30 points

15 comments13 min readLW link

Stable Pointers to Value III: Recursive Quantilization

abramdemskiJul 21, 2018, 8:06 AM

20 points

4 comments4 min readLW link

No comments.

Quantilization

See also