Tetraspace comments on AI Safety Debate and Its Applications

Tetraspace 25 Jul 2019 1:15 UTC
1 point
0
In the case of MNIST, how good is the judge itself—for example, if you were to pick the six pixels optimally to give it the most information, how well would it perform?
- VojtaKovarik 31 Jul 2019 11:54 UTC
  3 points
  0
  Parent
  I agree with Lanrian. A perhaps better metric is the chance that randomly selected pixels of a randomly selected image will cause the judge to guess the label correctly. This corresponds to “judge accuracy (random pixels)” in Table 2 of the original paper, and it’s 48.2%/59.4% for ⁴⁄₆ pixels.
- Lukas Finnveden 29 Jul 2019 15:04 UTC
  2 points
  0
  Parent
  What do you mean with picking pixels optimally? For very close to all images, I expect there to exist six pixels such that the judge identifies the correct label, if they are revealed. That doesn’t seem like a meaningful metric, though.