AI alignment might be an ontology problem. It seems to me that the referent of “human values” is messy and fractal exactly in the way that our default modes of inquiry can’t get a grip on. If you stick to those modes of inquiry, it’s hard to even notice.
AI alignment might be an ontology problem. It seems to me that the referent of “human values” is messy and fractal exactly in the way that our default modes of inquiry can’t get a grip on. If you stick to those modes of inquiry, it’s hard to even notice.