I’m also interested in what goes on the other side of the equation.How are you defining what to search for in the first place? If you point your abstraction detector at an AI and it outputs “this AI has a concept of trees,” how do you gain confidence that the “trees” according to the AI (and according to your abstraction detector) are more or less what you mean by trees?
Some ad-hoc methods spring to mind, but I’m not sure what John would say.
I’m also interested in what goes on the other side of the equation.How are you defining what to search for in the first place? If you point your abstraction detector at an AI and it outputs “this AI has a concept of trees,” how do you gain confidence that the “trees” according to the AI (and according to your abstraction detector) are more or less what you mean by trees?
Some ad-hoc methods spring to mind, but I’m not sure what John would say.