Adele Lopez comments on Adele Lopez’s Shortform

Adele Lopez 24 Feb 2023 0:43 UTC
14 points
0
I was pretty taken aback by the article claiming that the Kata-Go AI apparently has something like a human-exploitable distorted concept of “liberties”.

If we could somehow ask Kata-Go how it defined “liberties”, I suspect that it would have been more readily clear that its concept was messed-up. But of course, a huge part of The Problem is that we have no idea what these neural nets are actually doing.

So I propose the following challenge: Make a hybrid Kata-Go/LLM AI that makes the same mistake and outputs text representing its reasoning in which the mistake is recognizable.
- Viliam 4 Mar 2023 13:32 UTC
  4 points
  0
  Parent
  It would be funny if the Go part continued making the same mistake, and the LLM part just made up bullshit explanations.